Link

Third Party Integrations

This section contains information on various integrations that are available for Zebrium including PagerDuty and Slack. You can also build your own using our Zebrium Webhook support.

Zebrium’s Autonomous Incident & Root Cause Detection works in two modes:

  1. It can autonomously detect and create incident alerts by applying machine learning to an incoming stream of logs and metrics. The incident alerts can be consumed via custom webhook, Slack or email.
  2. Zebrium can also consume an external signal that indicates an incident HAS occurred, and in response it will create an incident report consisting of correlated log and metric anomalies, including likely root cause and symptoms surrounding the incident.

zebrium

A special class of integrations relates to this second mode, including integrations with PagerDuty and Slack.

PagerDuty + Zebrium Integration

Zebrium’s integration with PagerDuty automatically adds root cause to a PagerDuty incident triggered by any existing monitoring, APM, log manager, help desk tools, etc.

  • Each incident is augmented with a clear set of events and charts showing relevant anomalies in logs and metrics, including likely root cause and symptoms.
  • This means faster MTTR and less hunting for root cause.

How it Works

  1. Any existing monitoring, APM, logger, help desk tool raises an alarm.
  2. Through an existing integration with PagerDuty, an incident is created.
  3. At that same instant, PagerDuty automatically calls an outbound webhook to Zebrium with all the incident details.
  4. Zebrium correlates those incident details with its Autonomous Incident Detection and Root Cause by looking across logs and metrics.
  5. The PagerDuty incident is updated with Zebrium Incident details and likely root cause via the PagerDuty API.
  6. If you need to drill down further to logs or metrics, it’s just one click from your PagerDuty Incident.

Slack + Zebrium Integration

If you’re using a Slack workspace to collaborate with colleagues on incident resolution, you can use Zebrium to accelerate incident root cause and resolution.

  • Each incident is augmented with a clear set of events and charts showing relevant anomalies in logs and metrics, including likely root cause and symptoms.
  • This means faster MTTR and less hunting for root cause.

How it Works

  1. Install Zebrium’s custom Slack application in your workspace.
  2. Just type the /zebrium analyze command (or get your Bot to do it) to notify Zebrium about an incident and ask for an incident report. We’ll pull together all the relevant anomalies, logs, metrics and provide near instant drill down capabilities to get you to resolution fast!

Table of contents