Technical14 min read

How Uptime Monitoring Detects Outages: HTTP, TCP, DNS & Multi-Location Checks Explained (2026)

DevOps Team

DevOps Engineers

January 29, 2026

The Mechanics of Outage Detection

Uptime monitoring services work by simulating user access to your services. At scheduled intervals, they send requests to your servers and analyze the responses. When responses indicate problems—errors, timeouts, or unexpected content—the monitoring system flags potential outages.

But a single failed request doesn't confirm an outage. Network glitches, temporary overload, and routing issues can cause transient failures. Sophisticated monitoring includes verification steps to distinguish real outages from noise.

HTTP/HTTPS Checks

The most common monitoring method for websites. The monitoring service sends an HTTP GET request to your URL and examines the response.

What Gets Checked

HTTP Status Code: A 200 status indicates success. 4xx errors (client errors like 404 Not Found) and 5xx errors (server errors like 500 Internal Server Error) indicate problems. Some monitoring considers 3xx redirects acceptable; others flag them.
Response Time: How long until the server responds. Slow responses might indicate overload even when the eventual response is successful.
Response Body: Optional content validation checks whether expected text appears in the response. This catches scenarios where the page loads but contains error messages or unexpected content.
Headers: Some monitors verify specific response headers, useful for checking cache behavior, security headers, or custom application headers.

HTTPS Considerations

HTTPS monitoring includes TLS/SSL handshake in the connection process. The monitor verifies:

Certificate validity (not expired, not revoked)
Certificate chain (properly signed by trusted authority)
Domain match (certificate covers the requested domain)
Protocol support (TLS version compatibility)

SSL-related failures produce specific error types distinct from HTTP errors.

TCP/Port Checks

TCP checks verify that a server accepts connections on a specific port without sending application-layer requests. Useful for monitoring services that don't speak HTTP:

Database servers (MySQL on 3306, PostgreSQL on 5432)
Mail servers (SMTP on 25/587, IMAP on 993)
Custom application ports
SSH access (port 22)

The monitor attempts to establish a TCP connection. Success means the service accepts connections; failure indicates the port is unreachable or the service isn't listening.

TCP Check Limitations

A successful TCP connection only proves something accepts connections on that port. It doesn't verify the application behind the port functions correctly. A database server might accept connections but reject all queries due to disk space issues. For application-level health, HTTP checks or custom scripts provide better insight.

ICMP/Ping Checks

ICMP echo requests (ping) verify basic network reachability. The monitor sends an ICMP packet to your server's IP address and waits for a response.

What Ping Tells You

The server's network interface is up
The server responds to ICMP packets
Network routing between monitor and server works
Round-trip latency

What Ping Doesn't Tell You

Whether your web server process is running
Whether your application works correctly
Whether specific ports are accessible (firewalls may block ports but allow ICMP)

Many servers and firewalls disable ICMP responses for security reasons. Ping checks fail even when services work perfectly. Use ICMP cautiously and verify your infrastructure allows it.

DNS Monitoring

DNS monitoring verifies that your domain resolves correctly. Issues here prevent users from reaching your site even when your servers work fine.

DNS Check Types

Resolution: Does the domain resolve to any IP address?
Specific Record: Does the domain resolve to the expected IP address?
Record Types: A records, CNAME records, MX records, TXT records
Propagation: Does the domain resolve correctly from multiple DNS servers?

When DNS Monitoring Helps

Detecting DNS provider outages
Catching unauthorized DNS changes (potential security incident)
Verifying DNS propagation after changes
Monitoring email-related records (MX, SPF, DKIM)

Multi-Location Verification

A single monitoring location can't distinguish between "your site is down" and "the network between me and your site has issues." Professional monitoring services check from multiple geographic locations.

How It Works

When a check fails from one location, the monitoring service immediately tests from other locations. If all locations report failure, the outage is confirmed. If only one location fails, it's likely a regional network issue rather than your site.

Benefits

Reduces false positives from localized network problems
Detects regional outages affecting specific user populations
Identifies CDN or geographic routing issues
Provides global perspective matching your user base distribution

False Positive Mitigation

False positives—alerts for outages that didn't actually occur—waste time and erode trust in monitoring. Several techniques minimize them:

Consecutive Failure Requirements

Rather than alerting on a single failed check, require 2-3 consecutive failures. Transient issues often resolve within a check interval; real outages persist across multiple checks.

Multi-Location Confirmation

As described above, verifying failure from multiple locations filters out network-specific issues between monitoring locations and your server.

Timeout Tuning

Setting appropriate timeout thresholds balances between catching slow responses and avoiding false alarms from temporary delays. A 30-second timeout catches most real issues without triggering on brief slowdowns.

Maintenance Windows

Scheduled maintenance causes expected downtime. Monitoring services let you define maintenance windows during which checks run but don't trigger alerts.

Alert Deduplication

When an outage is confirmed, you need one alert—not one per failed check. Deduplication ensures you receive notification when the outage starts and when it resolves, not continuous alerts during the incident.

Detection Timing Deep Dive

Understanding detection timing helps set expectations for incident response.

Components of Detection Time

Check Interval: Maximum time until the next check runs (e.g., 1-5 minutes)
Check Duration: Time for the check itself to complete or timeout (typically seconds)
Verification Checks: Additional checks from other locations or consecutive intervals
Alert Processing: Time to evaluate results and decide whether to alert
Alert Delivery: Time to send notifications via email, SMS, etc.

Best-Case Scenario

Your site fails immediately before a scheduled check. The check fails quickly (fast timeout on connection refused). A second location confirms instantly. Alert delivers within seconds.

Total: seconds to 1-2 minutes

Worst-Case Scenario

Your site fails immediately after a check passes. The next check runs a full interval later. The check takes maximum timeout to fail. Verification from another location takes additional timeout. Alert queuing adds delay.

Total: check interval + (timeout × verification attempts) + delivery delay

Practical Expectations

With 1-minute check intervals and typical verification settings, expect detection within 2-5 minutes for most outages. Faster detection requires shorter intervals, which may increase costs.

Synthetic Monitoring Beyond Simple Checks

Basic HTTP checks verify pages load. Synthetic monitoring goes further, scripting actual user interactions:

Log into an account
Add an item to a shopping cart
Complete a checkout flow
Submit a form and verify confirmation

These scripted transactions catch failures that simple checks miss—a login page that loads but doesn't actually authenticate users, for example.

Synthetic monitoring requires more setup (writing scripts, maintaining them as your site changes) but provides deeper visibility into application health.

Choosing the Right Check Types

For Standard Websites

HTTP/HTTPS checks with content validation cover most needs. Monitor homepage, key landing pages, and any critical functionality like contact forms or checkout.

For APIs

HTTP checks against API endpoints, potentially with response body validation (checking that JSON structure matches expectations). Consider authentication if your API requires it.

For Infrastructure

TCP checks for databases, mail servers, and internal services. ICMP only if you've verified it's allowed and useful in your environment.

For Complex Applications

Layer multiple check types. HTTP for frontend, TCP for backend services, synthetic for critical user flows. Each layer catches different failure modes.

Integration with Broader Observability

External uptime monitoring complements internal observability tools:

Application Performance Monitoring (APM): Detailed internal metrics, traces, and error tracking
Log aggregation: Centralized logs for debugging
Infrastructure monitoring: CPU, memory, disk metrics from agents
Real User Monitoring (RUM): Performance data from actual user sessions

External monitoring answers "can users reach us?" Internal tools answer "what's happening inside?" Both perspectives inform complete operational awareness.

Frequently Asked Questions

How does an uptime monitoring service know when a website is down?

The monitoring service sends an HTTP request to your URL from one or more server locations at regular intervals. If the server doesn't respond within the timeout period, returns a 5xx error code, or the response doesn't contain expected content, the check is marked as failed. To confirm it's a real outage (not a network hiccup), the service typically retries from other geographic locations before sending an alert.

What is the difference between HTTP monitoring and TCP port monitoring?

HTTP monitoring sends a full web request and checks the response code, content, and timing — it tests that your web application works end-to-end. TCP monitoring only verifies that a server accepts connections on a specific port, without sending any application request. TCP checks are useful for non-HTTP services like databases (port 3306), mail servers (port 587), or SSH (port 22), but they can't verify the application behind the port functions correctly.

Why do monitoring services check from multiple locations?

A single monitoring location can't distinguish between "your site is down globally" and "there's a network problem between that monitoring server and your server." By checking from multiple geographic locations simultaneously, the service can confirm: if all locations fail, it's your site; if only one location fails, it's likely a regional network issue. Multi-location checks also detect CDN failures and regional outages that affect only some of your users.

What causes false positive alerts in uptime monitoring?

Common causes: (1) transient network issues between the monitoring server and your server that resolve in seconds; (2) DNS propagation delays after a DNS change; (3) monitoring server IP temporarily blocked by your firewall or CDN; (4) SSL certificate briefly failing during renewal; (5) timeout thresholds set too aggressively for your server's normal response time. Multi-location verification and requiring 2–3 consecutive failures before alerting eliminate most false positives.

How fast can an uptime monitoring service detect an outage?

Detection time = (time until next scheduled check) + (check duration/timeout) + (verification from second location) + (alert delivery). Best case with 1-minute intervals: 30 seconds to 2 minutes total. Worst case: 1 full check interval + timeout × verification attempts + delivery delay. In practice, AlertSleep detects most outages within 1–3 minutes with 1-minute check intervals. Alert delivery via SMS adds 10–30 seconds.

What is DNS monitoring and when do I need it?

DNS monitoring verifies that your domain name resolves to the correct IP address. If your DNS provider has an outage, or if DNS records are accidentally changed (or maliciously altered), your site becomes unreachable even though your servers are fine. DNS monitoring is especially valuable for: sites using multiple DNS providers, domains with complex CDN routing, businesses that have experienced DNS attacks, and any site where DNS changes are made frequently.

Can uptime monitoring detect partial outages (when some pages work but others don't)?

Yes, but only if you monitor the specific failing pages. A single homepage monitor won't detect a broken checkout page. To catch partial outages, set up separate monitors for each critical endpoint: homepage, product pages, checkout, API endpoints, login page. Content validation adds another layer — if a page loads but shows an error message instead of expected content, the monitor flags it even though the HTTP status is 200 OK.

#technical #http checks #tcp #dns #outage detection #false positives

Share this article

About the Author

DevOps Team

DevOps Engineers

Our DevOps team brings decades of experience in building and maintaining reliable infrastructure.

Technical14 min read

How to Check Server Status: 5 Methods Compared — Agent-Based, External & More (2026)

5 proven methods to check server status in 2026: agent-based monitoring, agentless (SNMP/SSH/WMI), external uptime checks, internal monitoring, and push vs. pull. Compare pros, cons, and when to use each — with a quick-reference table.

Technical13 min read

Synthetic Monitoring vs Real User Monitoring (RUM): Key Differences (2026)

Synthetic monitoring vs RUM (Real User Monitoring): learn how each works, when to use them, and key differences in 2026. Synthetic checks run 24/7 for uptime alerts; RUM captures real visitor data including LCP, FID, and CLS. Comparison table + FAQ included.

Stay Updated

Get the latest articles and monitoring tips delivered to your inbox.

Start Free Trial

The Mechanics of Outage Detection

HTTP/HTTPS Checks

What Gets Checked

HTTPS Considerations

TCP/Port Checks

TCP Check Limitations

ICMP/Ping Checks

What Ping Tells You

What Ping Doesn't Tell You

DNS Monitoring

DNS Check Types

When DNS Monitoring Helps

Multi-Location Verification

How It Works

Benefits

False Positive Mitigation

Consecutive Failure Requirements

Multi-Location Confirmation

Timeout Tuning

Maintenance Windows

Alert Deduplication

Detection Timing Deep Dive

Components of Detection Time

Best-Case Scenario

Worst-Case Scenario

Practical Expectations

Synthetic Monitoring Beyond Simple Checks

Choosing the Right Check Types

For Standard Websites

For APIs

For Infrastructure

For Complex Applications

Integration with Broader Observability

Frequently Asked Questions

How does an uptime monitoring service know when a website is down?

What is the difference between HTTP monitoring and TCP port monitoring?

Why do monitoring services check from multiple locations?

What causes false positive alerts in uptime monitoring?

How fast can an uptime monitoring service detect an outage?

What is DNS monitoring and when do I need it?

Can uptime monitoring detect partial outages (when some pages work but others don't)?

Share this article

About the Author

DevOps Team

Related Articles

How to Check Server Status: 5 Methods Compared — Agent-Based, External & More (2026)

Synthetic Monitoring vs Real User Monitoring (RUM): Key Differences (2026)

Stay Updated