Telegram alerts

Telegram Server Alerts for CPU, RAM, Disk, SSL and Services

Get useful server alerts in Telegram and follow them with status cards, threshold controls and safe maintenance actions.

Open Telegram bot Back to overview

Overview

What server alerts can be sent to Telegram

Online Server Monitor can send alerts for server health signals such as CPU, RAM, disk usage, load average, SSL certificate expiry and systemd service state. Each alert should be actionable and easy to confirm with a fresh status card.

CPU

CPU alert example

A CPU alert is useful when a process consumes too much compute for too long. The right threshold depends on your workload, but it should separate normal short spikes from sustained pressure.

RAM

RAM alert example

A RAM alert warns when memory usage is high enough to risk swap pressure, killed processes or slow application responses. Tune RAM thresholds after observing normal traffic.

Disk

Disk alert example

Disk alerts are often the most urgent because a full disk can break databases, mail queues, logs and uploads. Pair disk alerts with controlled log cleanup actions.

Load

Load average alert example

Load average helps identify servers doing more work than available CPU capacity can comfortably handle. Treat it together with CPU and service state.

SSL

SSL certificate expiry alert example

An SSL alert warns when a certificate for a discovered nginx or Apache domain is close to expiry. The default threshold is 14 days, which gives time to repair renewals before browsers and clients start rejecting HTTPS.

Services

systemd service down alert

If a key systemd service is stopped or failed, Telegram can show the service state and, when allowed, a restart button. This keeps the first response close to the alert.

Resolved

Resolved alerts

Resolved notifications help close the loop. They tell the chat that the metric returned below threshold or the service recovered, so operators do not keep investigating stale alerts.

Groups

Telegram group alerts

Add the bot to an operations group when multiple people need visibility. Use careful permissions and do not forward pairing tokens into public channels.

Tuning

Threshold tuning

Start with conservative thresholds, watch normal behavior, then tune values. CPU and load may need higher tolerance on bursty servers, while disk alerts should fire early enough to leave time for cleanup.

Severity

Alert severity and response order

Not every alert deserves the same reaction. A disk alert near capacity may require immediate cleanup or expansion. A short CPU spike may only need observation. A failed database or web service can be urgent because it affects users directly. Define the expected response for each alert type before relying on the bot in production.

Resolved logic

Why resolved messages matter

Without resolved messages, a team can keep investigating an issue that already passed. A resolved notification helps close the loop and keeps the chat useful. It also makes alert noise easier to review because you can see which events self-recovered and which required action.

Maintenance from alerts

From alert to safe action

A good alert should lead to a safe next step. Disk pressure can lead to reviewing logs or running controlled cleanup. A failed service can lead to checking status and then restarting the detected service. High load can lead to looking at recent deploys, backups, traffic spikes or stuck processes before rebooting.

Group etiquette

Alert etiquette in Telegram groups

Keep operational groups focused. Avoid forwarding raw pairing commands, do not discuss passwords in the alert thread, and acknowledge when someone is handling an incident. Telegram is fast, but clarity still matters when several people can press buttons.

Examples

Practical threshold examples

Disk: choose a value that leaves enough time to clean logs or expand storage.
RAM: tune after watching normal traffic and scheduled jobs.
CPU: avoid alerting on tiny spikes; look for sustained pressure.
Load: compare with CPU count and normal workload.
SSL: keep the expiry threshold high enough to fix broken renewals before users see HTTPS errors.
Service down: treat important web, database and queue services as higher priority.

False positives

How to handle false positives

A false positive usually means the threshold is too sensitive, the sampling moment caught a normal spike, or a maintenance job looks like an incident. Do not simply disable alerts. Adjust thresholds, add operational context, and watch whether the alert becomes useful over the next few days.

Maintenance windows

Alerts during planned maintenance

During deploys, package updates, backups or migrations, alerts may fire for expected reasons. Use the Telegram thread to note that maintenance is in progress and avoid unnecessary restart or reboot actions. After maintenance, confirm that resolved messages arrive and metrics return to normal.

Escalation

When an alert should escalate

Escalate when an alert repeats, affects a user-facing service, or combines several signals such as high load plus failed web service plus disk pressure. Telegram alerts are most useful when the team already knows which cases require immediate action.

Noise

Alert fatigue prevention

Too many alerts become invisible. Keep thresholds meaningful, use resolved messages, and avoid turning every short spike into a notification.

FAQ

Telegram alert FAQ

Can alerts go to groups? Yes, pair the server from the group chat.
Can thresholds be changed from Telegram? Yes, use the threshold controls.
Can an alert lead to a service restart? Yes, when the service is detected and restart is allowed.
Where is the setup guide? Read How to Install Online Server Monitor on Linux.

Start monitoring

Connect a Linux server from Telegram.

Open the bot, request a one-time pairing command, run it on the server and receive the first status report in Telegram.

Open Telegram bot Read security model