MCP servers that query logs, metrics & traces
Let your AI investigate incidents across Datadog, Grafana, Prometheus & more.
10 servers · Last updated June 17, 2026
TL;DR: These servers connect your agent to observability data — querying logs, metrics, traces and alerts so it can help investigate incidents in plain English. They're the foundation of an on-call/SRE assistant. Coverage varies by backend, so match the server to the stack you actually run.
Bottom line: if you only try one, PostHog MCP Server (Official Remote) is the most popular, verified option for this (350★). 9 more compared below.
Compare 10 servers
| Server | Transport | Auth | Verified | Stars | Tools for this |
|---|---|---|---|---|---|
| PostHog MCP Server (Official Remote) | Local (stdio) | API key | 350 | insight-query, query-error-tracking-issues-list, query-llm-traces-list | |
| Prometheus MCP Server | Local (stdio) | No auth | 340 | execute_query, execute_range_query, list_metrics +1 | |
| Datadog MCP Server (Official Remote) | Remote (HTTP) | OAuth | 250 | search_datadog_logs, get_logs / query log analytics, query_timeseries_data +4 | |
| Honeycomb MCP Server | Local (stdio) | API key | 250 | run_query, get_trace_link | |
| Dynatrace MCP Server | Local (stdio) | OAuth | 200 | create_dynatrace_notebook | |
| VictoriaMetrics MCP Server | Local (stdio) | API key | 130 | query, query_range, metrics +4 | |
| Axiom MCP Server (Official Remote) | Local (stdio) | OAuth | 130 | queryApl | |
| Grafana Tempo MCP Server | Local (stdio) | No auth | — | 90 | tempo_query |
| Last9 MCP Server | Local (stdio) | API key | 90 | get_logs, get_service_logs, get_traces +3 | |
| New Relic MCP Server | Local (stdio) | API key | 60 | query_nrql, nerdgraph_query, list_alert_policies +1 |
The servers
Official PostHog server: product analytics, feature flags, experiments, error tracking and SQL.
Run PromQL queries and analyze Prometheus metrics from any MCP client.
Datadog's managed remote server: query logs, metrics, traces, monitors and incidents.
Honeycomb observability via AI: query datasets, alerts and boards (community + official).
Official Dynatrace server: run DQL over logs, events, spans and metrics.
Official VictoriaMetrics server: MetricsQL/PromQL queries plus embedded docs search.
Query Axiom logs and events with APL via Axiom's hosted remote MCP server.
Query distributed traces in Grafana Tempo with TraceQL (archived; Tempo now has embedded MCP).
Connect AI to Last9 production observability: logs, metrics, traces, exceptions and alerts.
Query New Relic via NerdGraph: entity search, NRQL, deployments and infrastructure.
Use these in a stack
FAQ
Which observability tools have MCP servers?
Datadog, Grafana, Prometheus, New Relic, Dynatrace, Honeycomb, Sentry and more — pick the one matching your existing stack.
Can an AI actually triage incidents with these?
It can fetch and correlate logs/metrics/traces and summarize, which speeds triage. Keep humans in the loop for remediation actions.