Using Real-Time Analysis with Uptime.com

Real-Time Analysis provides a chronological timeline of the events and alerts that have been issued to a particular check. It is designed for users with access to learn the state of an outage, and develop a timeline for the outage, with technical details for each step in the process.

Table of Contents

Where to Find Real-Time Analysis

Return

It is possible to view this screen from multiple locations in the Uptime.com interface:

From the Dashboard, under Latest Alerts: click View Details next to the alert you wish to view, then scroll to the bottom of the outage popup. Click the green Real-Time Analysis button.

Screen_Shot_2021-01-25_at_3.19.06_PM.png

You can also click a check card from your Dashboard, or view any check's Report to find a link to real-time analysis:

rta-link.png

From Monitoring>Checks: Click Actions>Analysis

checks-actions.png

Additionally, Real-Time Analysis is also available as a direct link from the alert emails issued to your contacts by Uptime.com. Click the link for “Real Time Analysis” to see the Analysis for the check in question.

Clients using the Uptime.com API can also request this information for delivery via the `/checks/{pk}/analysis` endpoint.

Root Cause Analysis

API and Transaction checks have a root cause analysis tool used to identify the specific step where failure occurred to assist in diagnosing the problem. You can also access root cause analysis from the alert email issued to your check’s contacts.

RTA.png

Root cause analysis provides detailed check results, including response time metrics for each step the check was able to complete prior to failure, as well as screenshots and any notes that you have saved corresponding to the check. 

Root Cause Analysis Examples

Return

Transaction and API Check Alert Details also include a screenshot of what Uptime.com encountered, at which specific step, and technical data on the expected output. Note in this screenshot that the expected text was a misspelling of the word “Uptime”:

Scroll down to review the Check Results and the Browser Console log:

Check Results displays the steps that the API or Transaction Check took, as well as technical details on the failure. 

In this API check example, we see the probe server reporting an error with the x509 certificate, which is causing failure at Step 1:

real-time-analysis.png

Viewing Headers

Both API and Transaction check types allow users to view header information when a request is sent. Run a test check and click the “i” icon:

view-headers.png

This example illustrates returned data for a failed API check:

API-request-headers.png

Where to Find Traceroute

Uptime.com can run a traceroute from a specific probe server location (including Private Location probe servers). This feature is enabled by default for all enterprise accounts, and is available as an add-on to other plans. Traceroute is useful when you need to detect connection anomalies between Uptime.com and your service.

Scroll to the bottom of your Real-Time Analysis page for the check in question, accessed via the instructions above, and you will see a dropdown menu that contains the probe server locations assigned to your check. Select the location to run a traceroute from and click Submit.

Traceroute runs are limited to 10 per minute, 50 per hour, and 100 per day per account.

Please note: Traceroute functionality is not available for once-per-day check types, including: WHOIS/Domain Expiry, SSL Certificate checks, Domain Blacklist, and Malware/Virus checks, as well as Custom checks (heartbeat and webhook).

traceroute_el.jpg

Real-Time Analysis Details

Return

The Real-Time Analysis screen contains summary details of each outage related to the last 100 alerts, and links to specific alerts issued when an outage occurred.

Technical Outage Data

Return

Users will find detailed technical data on where the outage or failure occurred. The status of the check signals when and how the check was triggered, and helps explain the conditions that created or triggered the alert. Includes response time, response codes, and links to the specific alert. Where specified, the data will designate whether the response matched the user-defined value, and whether the check failed as a result.

Probe Server Data

Return

Additionally, users will find probe server data that discusses when an outage has occurred (whether downtime was alerted or not). This screen may show a probe server outage that did not trigger an alert, for example a single probe server failing on a check that has a sensitivity of two. It is also possible to view the dates a probe server was assigned.

Load Real-Time Check Status

Return

Real-Time-Analysis-Button.png

Uptime.com includes a function to check the status of the probe servers used in the check you are analyzing. We encourage users to utilize this real-time check status button to investigate the status of a probe, especially when you're confirming the status of a check. 

Location Status

Return

Location-Status.png

The first column tells you which Location the probe servers are monitoring from, and provides the IP address for each Assigned Probe Server for your check. The State tells you whether the probe server is reporting as OK or Critical. The check will issue a DOWN alert when the number of probe servers reporting CRITICAL matches or exceeds the number you designated as the check’s sensitivity.

Each probe server lists a Last Alert Date, in your account's preferred timezone, at which the last alert occurred for that probe server. Last Alert Details provide a brief summation of the technical data collected during the last alert. Scroll to Recent Alerts Per Location for a full report on outages the check has experienced.  

This section provides the current status for each location configured for a check. Uptime.com can be used to confirm a check status with the Real Time Check Status button at the top of the Real-Time Analysis screen. This button returns the results of the last time the check ran from each location.

Recent Alerts Per Location

Return

mceclip3.png

The recent alert details table includes the date, along with some additional information:

  • Location - The probe server region where a state change occurred.
  • Probe Server - The specific probe server reporting a change in status. Hover over a Probe Server to see the IP address for the probe in question. 
  • Locations State Changed To - Either OK, WARNING, CRITICAL, or UNKNOWN. This is the state of the probe server location. 
  • Details - The specific alert details each probe server is reporting. 
  • # Locations Down - The number of probe server locations down at the reported time. If your check uses 5 probe servers, and each server reports an outage, you will see the locations DOWN begin at 0 and progress to 5 as each location reports an outage. This number will decrease to 0 as probe servers return to UP status.
  • Overall Check State - The state of the check at the timestamp reported (Either UP or DOWN). 

When Check State changes to DOWN, recent alerts will contain a link to Alert Details for probe server outages that met your check’s sensitivity settings. Locate the date and time that the alert was issued, then click Alert Details from the server that triggered the alert.

mceclip2.png

The alert details provide access to the alert data that was generated at the time of failure.

Alert-Data-Raw.png

Discrepancies Between Check State and State

When a check is returning to UP status, the Check State will not return to UP until the State of all probe servers is OK. If your Check State is down, be sure to review the State of each probe server as you conduct your troubleshooting. 

We also suggest a review of your check’s sensitivity, which is the number of locations that must report State as CRITICAL before Check State is changed to DOWN. 

Was this article helpful?
1 out of 1 found this helpful