📊

Observability

Monitoring and debugging tools for agent apps

7 projects

AgentOps

AgentOps is an observability platform for AI agents, providing monitoring, debugging, and evaluation to help developers optimize agent performance.

observabilitymonitoringdebugging +1

Arize Phoenix

8.9k · Jupyter Notebook

Active

Phoenix is an open-source observability and evaluation tool for LLM and agent applications, supporting online tracing and offline diagnosis.

observabilityevaltracing +1

DeepEval

14.1k · Python

Active

DeepEval is an open-source evaluation framework for LLM applications. It provides rich evaluation metrics and tools, supporting unit testing and integration testing to help developers build reliable LLM applications.

llmevaluationtesting +1

Ragas

12.9k · Python

Active

Ragas is a framework for evaluating RAG (Retrieval Augmented Generation) systems. It provides various evaluation metrics including faithfulness, answer relevance, context precision, helping developers optimize RAG application performance.

ragevaluationllm +1

Helicone

5.2k · TypeScript

Active

Helicone is an open-source proxy and observability platform for LLM applications, offering request tracing, caching, and cost analytics.

observabilityproxyanalytics +1

Langfuse

23.1k · TypeScript

Active

Langfuse is an open-source observability platform for LLM applications, supporting tracing, evaluation, prompt versioning, and cost analytics.

observabilitytracingllm +1

TruLens

3.2k · Python

Active

TruLens is an open-source tool for evaluating and tracking LLM apps. It provides specialized evaluation for RAG applications including context relevance, groundedness, and answer relevance.

llmevaluationobservability +1