Author

Yasir Ekinci

1 article

Introducing o11y-bench: an open benchmark for AI agents running observability workflows

Grafana has introduced o11y-bench, an open-source benchmark designed to evaluate the effectiveness of AI agents performing complex observability tasks, such as incident investigati…

Yasir Ekinci

Apr 21, 2026 50

Read