logo
Free, unlimited AI code reviews that run on commit
git-lrc git-lrc GitHub Install Now We'd appreciate a star git-lrc - Free, unlimited AI code reviews that run on commit | Product Hunt git-lrc - Free, unlimited AI code reviews that run on commit | Product Hunt

phoenix

Provides advanced observability and evaluation tools for AI applications, facilitating performance tracking, dataset management, and prompt engineering. Integrates with various frameworks and large language model providers for seamless operation.

Author

phoenix logo

Arize-ai

Other

Quick Info

GitHub GitHub Stars 7158
NPM Weekly Downloads 0
Tools 1
Last Updated 2026-02-19

Tags

aiarizeanalyticsarize aianalytics arizeai phoenix

phoenix banner

CA7LwkvHqaIJ9pLI6Lmy1BigDy2EV8tjdzh_8XB6MGSLKH4INsZXDJ8MGhIBK_Mrpo_GnRIBO_MrZjFAFxoTNBwCvj6u4qvSZJiM3iNX4yvmHoA9Sh4PF0QAzBEBMEcEwBwRAHNEAMwRAXBGKfUfr5hKvglRfO4AAAAASUVORK5CYII_labelColor_grey_color_blue_logoColor_white_label v1_message_Community_logo_slack_labelColor_grey_color_blue_logoColor_white_label phoenix_blue_svg_color_blue_labelColor_gray_logo_bluesky ArizePhoenix_blue_svg_color_blue_labelColor_gray_logo_x arize_phoenix_color_blue arize_phoenix_svg_color_blue arize_phoenix phoenix_sort_semver_logo_docker_label_image_color_blue Helm_blue_style_flat_logo_helm_labelColor_grey badge_mcpx_dev_status_on Add Arize Phoenix MCP server to Cursor

Phoenix is an open-source AI observability platform designed for experimentation, evaluation, and troubleshooting. It provides:

  • Tracing - Trace your LLM application's runtime using OpenTelemetry-based instrumentation.
  • Evaluation - Leverage LLMs to benchmark your application's performance using response and retrieval evals.
  • Datasets - Create versioned datasets of examples for experimentation, evaluation, and fine-tuning.
  • Experiments - Track and evaluate changes to prompts, LLMs, and retrieval.
  • Playground- Optimize prompts, compare models, adjust parameters, and replay traced LLM calls.
  • Prompt Management- Manage and test prompt changes systematically using version control, tagging, and experimentation.

Phoenix is vendor and language agnostic with out-of-the-box support for popular frameworks (🦙LlamaIndex, 🦜⛓LangChain, Haystack, 🧩DSPy, 🤗smolagents) and LLM providers (OpenAI, Bedrock, MistralAI, VertexAI, LiteLLM, Google GenAI and more). For details on auto-instrumentation, check out the OpenInference project.

Phoenix runs practically anywhere, including your local machine, a Jupyter notebook, a containerized deployment, or in the cloud.

Installation

Install Phoenix via pip or conda

pip install arize-phoenix

Phoenix container images are available via Docker Hub and can be deployed using Docker or Kubernetes. Arize AI also provides cloud instances at app.phoenix.arize.com.

Packages

The arize-phoenix package includes the entire Phoenix platfom. However if you have deployed the Phoenix platform, there are light-weight Python sub-packages and TypeScript packages that can be used in conjunction with the platfrom.

Subpackages

Package Version & Docs Description
arize-phoenix-otel PyPI Version Docs Provides a lightweight wrapper around OpenTelemetry primitives with Phoenix-aware defaults
arize-phoenix-client PyPI Version Docs Lightweight client for interacting with the Phoenix server via its OpenAPI REST interface
arize-phoenix-evals PyPI Version Docs Tooling to evaluate LLM applications including RAG relevance, answer relevance, and more
@arizeai/phoenix-client NPM Version Docs Client for the Arize Phoenix API
@arizeai/phoenix-evals NPM Version Docs TypeScript evaluation library for LLM applications (alpha release)
@arizeai/phoenix-mcp NPM Version Docs MCP server implementation for Arize Phoenix providing unified interface to Phoenix's capabilities

Tracing Integrations

Phoenix is built on top of OpenTelemetry and is vendor, language, and framework agnostic. For details about tracing integrations and example applications, see the OpenInference project.

Python Integrations | Integration | Package | Version Badge | |------------------|-----------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------| | OpenAI | openinference-instrumentation-openai | PyPI Version | | OpenAI Agents | openinference-instrumentation-openai-agents | PyPI Version | | LlamaIndex | openinference-instrumentation-llama-index | PyPI Version | | DSPy | openinference-instrumentation-dspy | PyPI Version | | AWS Bedrock | openinference-instrumentation-bedrock | PyPI Version | | LangChain | openinference-instrumentation-langchain | PyPI Version | | MistralAI | openinference-instrumentation-mistralai | PyPI Version | | Google GenAI | openinference-instrumentation-google-genai | PyPI Version | | Google ADK | openinference-instrumentation-google-adk | PyPI Version | | Guardrails | openinference-instrumentation-guardrails | PyPI Version | | VertexAI | openinference-instrumentation-vertexai | PyPI Version | | CrewAI | openinference-instrumentation-crewai | PyPI Version | | Haystack | openinference-instrumentation-haystack | PyPI Version | | LiteLLM | openinference-instrumentation-litellm | PyPI Version | | Groq | openinference-instrumentation-groq | PyPI Version | | Instructor | openinference-instrumentation-instructor | PyPI Version | | Anthropic | openinference-instrumentation-anthropic | PyPI Version | | Smolagents | openinference-instrumentation-smolagents | PyPI Version | | Agno | openinference-instrumentation-agno | PyPI Version | | MCP | openinference-instrumentation-mcp | PyPI Version | | Pydantic AI | openinference-instrumentation-pydantic-ai | PyPI Version | | Autogen AgentChat | openinference-instrumentation-autogen-agentchat | PyPI Version | | Portkey | openinference-instrumentation-portkey | PyPI Version |

Span Processors

Normalize and convert data across other instrumentation libraries by adding span processors that unify data.

Package Description Version
openinference-instrumentation-openlit OpenInference Span Processor for OpenLIT traces. PyPI Version
openinference-instrumentation-openllmetry OpenInference Span Processor for OpenLLMetry (Traceloop) traces. PyPI Version

JavaScript Integrations

Integration Package Version Badge
OpenAI @arizeai/openinference-instrumentation-openai NPM Version
LangChain.js @arizeai/openinference-instrumentation-langchain NPM Version
Vercel AI SDK @arizeai/openinference-vercel NPM Version
BeeAI @arizeai/openinference-instrumentation-beeai NPM Version
Mastra @arizeai/openinference-mastra NPM Version

Java Integrations

Integration Package Version Badge
LangChain4j openinference-instrumentation-langchain4j Maven Central
SpringAI openinference-instrumentation-springAI Maven Central

Platforms

Platform Description Docs
BeeAI AI agent framework with built-in observability Integration Guide
Dify Open-source LLM app development platform Integration Guide
Envoy AI Gateway AI Gateway built on Envoy Proxy for AI workloads Integration Guide
LangFlow Visual framework for building multi-agent and RAG applications Integration Guide
LiteLLM Proxy Proxy server for LLMs Integration Guide

Community

Join our community to connect with thousands of AI builders.

Breaking Changes

See the migration guide for a list of breaking changes.

Copyright 2025 Arize AI, Inc. All Rights Reserved.

Portions of this code are patent protected by one or more U.S. Patents. See the IP_NOTICE.

This software is licensed under the terms of the Elastic License 2.0 (ELv2). See LICENSE.

See Also

`