Telegram alerts
Telegram Server Alerts for CPU, RAM, Disk, SSL and Services
Get useful server alerts in Telegram and follow them with status cards, threshold controls and safe maintenance actions.
Overview
What server alerts can be sent to Telegram
Online Server Monitor can send alerts for server health signals such as CPU, RAM, disk usage, load average, SSL certificate expiry and systemd service state. Each alert should be actionable and easy to confirm with a fresh status card.
CPU
CPU alert example
A CPU alert is useful when a process consumes too much compute for too long. The right threshold depends on your workload, but it should separate normal short spikes from sustained pressure.
RAM
RAM alert example
A RAM alert warns when memory usage is high enough to risk swap pressure, killed processes or slow application responses. Tune RAM thresholds after observing normal traffic.
Disk
Disk alert example
Disk alerts are often the most urgent because a full disk can break databases, mail queues, logs and uploads. Pair disk alerts with controlled log cleanup actions.
Load
Load average alert example
Load average helps identify servers doing more work than available CPU capacity can comfortably handle. Treat it together with CPU and service state.
SSL
SSL certificate expiry alert example
An SSL alert warns when a certificate for a discovered nginx or Apache domain is close to expiry. The default threshold is 14 days, which gives time to repair renewals before browsers and clients start rejecting HTTPS.
Services
systemd service down alert
If a key systemd service is stopped or failed, Telegram can show the service state and, when allowed, a restart button. This keeps the first response close to the alert.
Resolved
Resolved alerts
Resolved notifications help close the loop. They tell the chat that the metric returned below threshold or the service recovered, so operators do not keep investigating stale alerts.
Groups
Telegram group alerts
Add the bot to an operations group when multiple people need visibility. Use careful permissions and do not forward pairing tokens into public channels.
Tuning
Threshold tuning
Start with conservative thresholds, watch normal behavior, then tune values. CPU and load may need higher tolerance on bursty servers, while disk alerts should fire early enough to leave time for cleanup.
Severity
Alert severity and response order
Not every alert deserves the same reaction. A disk alert near capacity may require immediate cleanup or expansion. A short CPU spike may only need observation. A failed database or web service can be urgent because it affects users directly. Define the expected response for each alert type before relying on the bot in production.
Resolved logic
Why resolved messages matter
Without resolved messages, a team can keep investigating an issue that already passed. A resolved notification helps close the loop and keeps the chat useful. It also makes alert noise easier to review because you can see which events self-recovered and which required action.
Maintenance from alerts
From alert to safe action
A good alert should lead to a safe next step. Disk pressure can lead to reviewing logs or running controlled cleanup. A failed service can lead to checking status and then restarting the detected service. High load can lead to looking at recent deploys, backups, traffic spikes or stuck processes before rebooting.
Group etiquette
Alert etiquette in Telegram groups
Keep operational groups focused. Avoid forwarding raw pairing commands, do not discuss passwords in the alert thread, and acknowledge when someone is handling an incident. Telegram is fast, but clarity still matters when several people can press buttons.
Examples
Practical threshold examples
- Disk: choose a value that leaves enough time to clean logs or expand storage.
- RAM: tune after watching normal traffic and scheduled jobs.
- CPU: avoid alerting on tiny spikes; look for sustained pressure.
- Load: compare with CPU count and normal workload.
- SSL: keep the expiry threshold high enough to fix broken renewals before users see HTTPS errors.
- Service down: treat important web, database and queue services as higher priority.
False positives
How to handle false positives
A false positive usually means the threshold is too sensitive, the sampling moment caught a normal spike, or a maintenance job looks like an incident. Do not simply disable alerts. Adjust thresholds, add operational context, and watch whether the alert becomes useful over the next few days.
Maintenance windows
Alerts during planned maintenance
During deploys, package updates, backups or migrations, alerts may fire for expected reasons. Use the Telegram thread to note that maintenance is in progress and avoid unnecessary restart or reboot actions. After maintenance, confirm that resolved messages arrive and metrics return to normal.
Escalation
When an alert should escalate
Escalate when an alert repeats, affects a user-facing service, or combines several signals such as high load plus failed web service plus disk pressure. Telegram alerts are most useful when the team already knows which cases require immediate action.
Noise
Alert fatigue prevention
Too many alerts become invisible. Keep thresholds meaningful, use resolved messages, and avoid turning every short spike into a notification.
FAQ
Telegram alert FAQ
- Can alerts go to groups? Yes, pair the server from the group chat.
- Can thresholds be changed from Telegram? Yes, use the threshold controls.
- Can an alert lead to a service restart? Yes, when the service is detected and restart is allowed.
- Where is the setup guide? Read How to Install Online Server Monitor on Linux.
Start monitoring
Connect a Linux server from Telegram.
Open the bot, request a one-time pairing command, run it on the server and receive the first status report in Telegram.