Introduction

This utility offers comprehensive observability tools tailored specifically for artificial intelligence applications. Its core function involves recording system events during inference and data handling, akin to how keylogging records user interactions for later analysis. This aids in debugging, performance benchmarking, and systematic iterative improvement within AI deployment pipelines.

Setup

Installation of the main Phoenix component is achievable using standard package managers. Run the following command in your terminal environment to acquire the necessary libraries.

pip install arize-phoenix

Deployment options include local execution, notebook environments, containerized setups, or managed cloud instances accessible via the provided web portal. Phoenix container images are accessible through Docker Hub for orchestration environments.

Packages

The primary distribution is contained within the arize-phoenix package. However, decoupled Python and TypeScript modules exist for integration when the central platform is independently hosted. These specialized packages facilitate lightweight interactions with the core services.

Package	Version & Docs	Description
arize-phoenix-otel		Wraps OpenTelemetry basics with Phoenix-specific default settings.
arize-phoenix-client		A lightweight client library for interfacing with the Phoenix server's REST API.
arize-phoenix-evals		Provides utilities for assessing LLM application quality, including retrieval relevance.
@arizeai/phoenix-client		Client interface designed for the Arize Phoenix service endpoints.
@arizeai/phoenix-evals		TypeScript library for LLM evaluation procedures (currently in initial release).
@arizeai/phoenix-mcp		Implements the Model Context Protocol server, offering unified access to Phoenix functions.

Usage

This system facilitates deep inspection across several key operational areas for AI systems. These areas include tracing execution paths, quantitative assessment, dataset versioning, experiment tracking, prompt optimization, and systematic prompt governance.

Tracing - Record runtime activity of LLM applications using OpenTelemetry compliant methods.
Evaluation - Employ other language models to objectively score application outputs and retrieval success rates.
Datasets - Establish immutable, versioned collections of data for controlled testing and refinement.
Experiments - Monitor and contrast performance when altering prompts, models, or data sources.
Playground- A sandbox for tuning prompts, comparing various models, and re-running captured trace data.
Prompt Management- Systematically manage prompt iterations using version control and rigorous experimental testing procedures.

Phoenix maintains independence from specific vendors and programming languages. Out-of-the-box integration is available for major frameworks such as LlamaIndex, LangChain, Haystack, DSPy, and smolagents. Supported model providers include OpenAI, Bedrock, MistralAI, VertexAI, LiteLLM, and Google GenAI. Further details on automatic instrumentation are found within the OpenInference repository.

Tracing Integrations

Phoenix leverages OpenTelemetry architecture, ensuring it remains adaptable across frameworks and languages. Refer to the OpenInference project for comprehensive integration examples.

Python Integrations | Integration | Package | Version Badge | |------------------|-----------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------| | OpenAI | openinference-instrumentation-openai | | | OpenAI Agents | openinference-instrumentation-openai-agents | | | LlamaIndex | openinference-instrumentation-llama-index | | | DSPy | openinference-instrumentation-dspy | | | AWS Bedrock | openinference-instrumentation-bedrock | | | LangChain | openinference-instrumentation-langchain | | | MistralAI | openinference-instrumentation-mistralai | | | Google GenAI | openinference-instrumentation-google-genai | | | Google ADK | openinference-instrumentation-google-adk | | | Guardrails | openinference-instrumentation-guardrails | | | VertexAI | openinference-instrumentation-vertexai | | | CrewAI | openinference-instrumentation-crewai | | | Haystack | openinference-instrumentation-haystack | | | LiteLLM | openinference-instrumentation-litellm | | | Groq | openinference-instrumentation-groq | | | Instructor | openinference-instrumentation-instructor | | | Anthropic | openinference-instrumentation-anthropic | | | Smolagents | openinference-instrumentation-smolagents | | | Agno | openinference-instrumentation-agno | | | MCP | openinference-instrumentation-mcp | | | Pydantic AI | openinference-instrumentation-pydantic-ai | | | Autogen AgentChat | openinference-instrumentation-autogen-agentchat | | | Portkey | openinference-instrumentation-portkey | |

Span Processors

Span processors are available to harmonize data formats originating from various observability libraries into a unified structure.

Package	Description	Version
`openinference-instrumentation-openlit`	Converts OpenInference traces to conform with OpenLIT standards.
`openinference-instrumentation-openllmetry`	Adapts OpenInference spans for consumption by OpenLLMetry (Traceloop).

JavaScript Integrations

Integration	Package	Version Badge
OpenAI	`@arizeai/openinference-instrumentation-openai`
LangChain.js	`@arizeai/openinference-instrumentation-langchain`
Vercel AI SDK	`@arizeai/openinference-vercel`
BeeAI	`@arizeai/openinference-instrumentation-beeai`
Mastra	`@arizeai/openinference-mastra`

Java Integrations

Integration	Package	Version Badge
LangChain4j	`openinference-instrumentation-langchain4j`
SpringAI	`openinference-instrumentation-springAI`

Platforms

Platform	Description	Docs
BeeAI	Framework for AI agents incorporating native observability features.	Integration Guide
Dify	An open-source platform for developing LLM applications.	Integration Guide
Envoy AI Gateway	An AI Gateway built upon the Envoy Proxy infrastructure for managing AI traffic.	Integration Guide
LangFlow	A visual toolset utilized for constructing multi-agent systems and RAG architectures.	Integration Guide
LiteLLM Proxy	A proxy server designed to interface with various large language models uniformly.	Integration Guide

Data Provenance Tracking
Telemetric Analysis in Distributed Systems
Human-Computer Interaction Dynamics
Covert Data Capture Methods
OpenTelemetry Instrumentation

Extra Details

This tool focuses intensely on providing detailed operational feedback, similar to how keyloggers study the granular sequence of input events. The platform allows tracking of dynamic elements like prompt versions and data splits, which is crucial for reproducible AI monitoring. Furthermore, the system supports data normalization via span processors, bridging compatibility gaps between different tracing formats like OpenLIT and OpenLLMetry.

Configuration

To establish the connection to a running Phoenix instance, users often need to specify the base URL and an authentication key. For instance, setting up the client using the CLI might follow this pattern, though specific flags are omitted here for generality:

# Example configuration command structure (conceptual)
# mcp install --baseUrl https://my-phoenix.com --apiKey your-secret-key

Security

While general logging mechanisms can be misused, like covert keylogging for sensitive data theft, this platform is engineered for authorized performance monitoring. It adheres to standards like OpenTelemetry for secure, auditable data transmission, ensuring integrity for performance evaluations rather than unauthorized surveillance.

Conclusion

This monitoring and logging framework provides necessary infrastructure for detailed insight into AI application execution and performance tuning. By recording and analyzing internal operational telemetry, developers gain systematic oversight, moving beyond simple output checking toward deep system diagnostics, improving overall reliability and quality in machine learning services.

CognitoLogix

Author

Arize-ai

Quick Info

Actions

Tags

Introduction

Setup

Packages

Usage

Tracing Integrations