The workload status is derived from the alerting status of the entities in your workload.
Why it matters
Workload status:
- Is a quick indicator of how your system is doing, and tells you if you need to take action on any of your workloads in just a glance.
- Adapts to your needs and to how important each entity is.
- Allows you to share the status of your workloads. Other teams that depend on your services or infrastructure can learn the status of the workload without them needing to understand your system's architecture details, or look at custom dashboards.
Get started with workload status
We provide a status value for all your monitored entities, which is based on the results of your alert incidents. You can check the color-coded alert status for each entity from the All entities view.
With workloads you get a single, global value that summarizes the status of all the entities in your workload. This helps you quickly detect when the workload stops being operational, or anticipate any potential incident or loss of quality of service.
A workload can have one of the following status values:
- Operational: The workload is working fine.
- Degraded: The workload is showing some degradation in performance or errors, but it's still providing an acceptable level of service, and you don't need to take any urgent action.
- Critical or Disrupted: The workload is not providing an acceptable level of service, and you need to take urgent action.
- Unknown: You haven't configured how to calculate workload status, or there aren't any alert conditions set up that can determine the status of the workload entities.
To learn how to define or edit the workload status, refer to Workload status configuration.
Get notified when the workload status changes
You may need to follow the status of a workload, either because it represents the services your team is accountable for, or because your own services depend on that workload, which is managed by another team.
The status of all workloads is calculated regularly and the result is stored in NRDB through a WorkloadStatus
event. This allows you to set up an alert condition to notify you whenever the workload goes into a Disrupted
or Degraded
status.
To set up the alert condition:
Go to one.newrelic.com and select Alerts. Select the policy where you want to add the new alert condition, or create a new policy with the appropriate notification channel. Then click on Create a condition.
Where prompted to Select a product, click NRQL.
Add the following NRQL query:
SELECT latest(statusValueCode) FROM WorkloadStatusWHERE workloadGuid = 'YOUR_WORKLOAD_GUID'FACET workloadGuid AS 'entity.guid', entity.nameSome tips for writing your query:
- Get the workload GUID by clicking on the See metadata and manage tags in the workloads UI.
- Write the
WHERE
clause so the condition applies to just one workload (as in the example) or more than one. Or remove theWHERE
clause if you want the alert condition to apply to all the workloads on the account. - Using
FACET
you can use these fields on the alert description (explained below).
Set one of the following static thresholds:
- (Recommended) Critical when the query returns a value equal to 3 for at least 1 minute, if you want to be notified when the workload status is disrupted.
- Critical when the query returns a value equal to 2 for at least 1 minute, if you want to be notified when the workload status is degraded.
- Remember that a warning threshold doesn't generate an incident or send a notification. As a result, you need to create two alert conditions with a critical threshold (as explained above) if you want to be notified of any status change.
Complete the alert condition:
- Set an incident time limit, to automatically force-close a long-lasting incident after the selected amount of time you select.
- Choose to fill data gaps with last known value.
Optionally, you can also add a custom incident description that includes the workload name and permanent link to the UI in the alert notification:
Workload:Direct link: https://one.newrelic.com/redirect/entity/