Link

Note: In addition to the integrations described below, we also provide a custom Datadog Dashboard Widget. Select Integrations in your Datadog UI and search for Zebrium for more details (contact Zebrium for further information).

Datadog Events and Metrics

  • Automatically adds Root Cause Reports as Events in Datadog. This allows you to see details of root cause on any Datadog dashboard.
  • Automatically adds Log count metrics in Datadog.
  • Each Zebrium RC Report includes a summary, word cloud and a set of log events showing symptoms and root cause. Plus a link to the full report in the Zebrium UI.
  • This means faster MTTR and less time manually hunting for root cause.

How it Works

Our recommended mode of operation for Observability Dashboard integrations is to use Zebrium’s Auto-Detect mode as an accurate mechanism for explaining the reason something went wrong. In this mode, you continue to use your existing rules, alerts and metrics as the primary source of problem detection. You can then review Zebrium RC Report findings directly in your Datadog Dashboards alongside other metrics to explain the reason behind problems you were alerted on.

Augment mode is useful when you have monitors defined in Datadog and you want a Root Cause Report automatically generated at the time of the alert. In this mode, Zebrium uses a Datadog webhook as a notification channel and will update your Dashboard with Root Cause Reports that coincide with the triggering monitor so they’re immediately visible to you as you work the issue.

The two modes of operation are independent. You can configure Auto-Detect and/or Augment modes depending on your operational use-case.

Auto-Detect: Send Root Cause Detections to your Datadog Dashboards

  1. Zebrium continuously monitors all application logs and uses unsupervised machine learning to find anomalous log patterns that indicate a problem. These are automatically turned into Root Cause Reports highlighting details of any problems with over 95% accuracy.
  2. Root Cause Report summaries are sent to Datadog using the event API and Root Cause details are visible on your Datadog Dashboards.
  3. If you need to drill down further to look at correlated logs across your entire app, it’s just one click from your Dashboard.
  4. Log metrics are also sent to Datadog via the series API for visualization on your Datadog Dashboards.

CLICK HERE To send Root Cause Detections to your Datadog Dashboards

Augment: Receive Signals from Datadog Triggered Monitors

  1. Any Datadog Monitor can trigger a webhook request for Root Cause Analysis from Zebrium.
  2. Zebrium finds anomalous log patterns from your application that coincide with the event and creates a Root Cause Report.
  3. Root Cause Report summaries are sent to Datadog using the event API and Root Cause details are visible on your Datadog Dashboards.
  4. If you need to drill down further to look at correlated logs across your entire app, it’s just one click from your Dashboard.

CLICK HERE To receive Signals from Datadog Triggered Monitors