Introduction
This document provides troubleshooting steps on Alerts generated by OpsRamp, and the issues related to metering gateway. It includes guidance on identifying common problems, verifying configurations, and applying corrective actions to restore normal operations.
Metering gateway troubleshooting
See Troubleshooting for guidance on metering gateway errors.
Alerts troubleshooting
Understanding the Alert Details
OpsRamp-generated alerts follow a standard structure. Each alert includes the following fields:
- Date and time - When the alert was generated
- Source - Origin of the alert
- Alert Type - Category of the issue
- Billing Account - Associated account for billing context
- Service name - The impacted service or resource
- Message - Detailed description of the alert
The OpsRamp-originated sources of alerts fall under the following categories:
- Collection Failure: Indicates issues in collecting metering or usage data from resources. Common causes include invalid credentials, network connectivity problems, or device-specific issues.
- Transport Failure: Indicates problems in transferring collected data to the destination system. Common causes include connectivity timeouts or service unavailability.
Alert Category: Collection Failure
| Alert subject | Reason | Debugging Steps | Remediation |
|---|---|---|---|
| Invalid Credentials | Authentication failed due to incorrect username/password or expired token. Credentials at the device end may have been changed but not updated in OpsRamp configuration. | Check app for corresponding service and validate credentials | Update the passwords in OpsRamp's credentials management so that the updated encrypted credentials can be used for metering. |
| Incomplete Usage Data Collection | Usage data missing for some resources due to credential or device issues. | Verify missing resource IPs; triggered before scheduled usage data delivery | Update credentials |
| Un-metered Resource | Resource is down or unavailable for metering. | Check app for resource and confirm availability status | Check resource status |
| Metering Not Active | Metering app not installed or onboarding failed. | Check zero-touch onboarding failure | Re-install app for service metering |
| Usage Data Exception | Triggered once in 24 hrs; check OpsRamp logs | Trigger on-demand metering and confirm |
Alert Category: Transport Failure
| Alert subject | Reason | Debugging Steps | Remediation |
|---|---|---|---|
| Connectivity Failure | Network timeouts or platform down. | connection time-outs/Usage data is not available at the destination for processing | Check with the team for data platform issue. |