- Create an API Key in Datadog.
- Create a Datadog Integration in Zebrium using the information from step 1.
- Create a Webhook Integration in Datadog using the information from step 2.
- Add Webhook notifications to your Triggered Monitors in Datadog.
- Add Zebrium Root Cause Reports to your Datadog Dashboard.
- From the Main Navigation panel, hover over your Datadog Login Name and select Organization Settings.
- Click on API Keys.
- Click the + New Key button.
- Enter a Name for the API Key and click Create Key.
- Copy and save the Key for use in STEP 2.
- From the User menu area in Zebrium, click on the Settings (hamburger) Menu.
- Select Integrations.
- Scroll to the Observability Dashboards section and click on Datadog Events and Metrics.
- Click on the Create a New Integration button.
- Click on the General tab.
- Enter an Integration Name for this integration.
- Select the Deployment for the integration.
- Select the Service Group(s) for the integration.
- Click on the Receive Signals tab.
- Click on the Enabled button.
- Enter the API Key created in STEP 1 above.
- Click the Save button.
- Copy the Webhook URL and save for use in STEP 3 and click OK.
- From the Main Navigation panel, navigate to Integrations/Integrations.
- Locate the Webhooks integration card and click Configure.
- Click on the New button located in the Webhooks **section and enter a **Name and the URL saved in STEP 2.
- In the Payload section, add:
- Click Save.
- From the Main Navigation panel, navigate to Monitors/Manage Monitors.
- Click on the Monitor you wish to trigger Root Cause Reports.
- Choose Edit from the gear icon on the Monitor page.
- Add the webhook (from STEP 3) in the Notify your team list.
- Click Save.
Zebrum sends events to Datadog as follows:
- Events - events are sent each time a Zebrium Root Cause Report Detection occurs.
Here is a sample Chart visualization showing:
- A Root Cause Finder panel that displays a vertical bar whenever a Zebrium detection occurs. This allows you to easily see detections that are aligned with other metrics on your dashboards.
- A Root Cause Reports Summary panel that list summary information for each Zebrium detection.
Here is a screen shot showing the definition of the Root Cause Finder Panel:
Here is a screen shot showing the definition of the Root Cause Reports Summary Panel:
|zebrium.logs.all.count||Count of all log events received in a one minute duration (per service_group and deployment)|
|zebrium.logs.anomalies.count||Count of anomaly log events received in a one minute duration (per service_group and deployment)|
|zebrium.logs.errors.count||Count of error log events received in a one minute duration (per service_group and deployment)|
|ze_service_group||Zebrium service group name for the corresponding metric or event|
|ze_deployment||Zebrium deployment name for the corresponding metric or event|
|ze_significance||Significance of the Root Cause Report (low, medium or high)|
If you need help with this integration, please contact Zebrium by sending email to firstname.lastname@example.org.