ProxyAPI - AI Model Proxy & Discovery Platform

A comprehensive AI model proxy and discovery platform that provides unified access to multiple AI providers including OpenAI, Anthropic, Azure OpenAI, Cohere, and more.

✨ Key Features

🔍 Automatic Model Discovery: Real-time discovery and cataloging of AI models from all configured providers
🚀 High-Performance Proxy: Intelligent routing with circuit breakers, caching, and connection pooling
📊 Comprehensive Monitoring: Prometheus metrics, health checks, and detailed analytics
🧪 Chaos Engineering: Fault injection and resilience testing
💰 Cost Optimization: Context condensation and smart caching to reduce API costs
🔒 Enterprise Security: Rate limiting, authentication, and audit logging

🚀 Quick Start

Docker (Recommended)

# Clone the repository
git clone https://github.com/your-org/proxyapi.git
cd proxyapi

# Start with Docker Compose
docker-compose up -d

# Access the web interface
open http://localhost:8000

Manual Setup

# Install dependencies
pip install -r requirements.txt

# Set environment variables
export OPENAI_API_KEY="your-openai-key"
export API_KEY="your-proxy-key"

# Start the application
python main.py

That's it! Your proxy API is now running at http://localhost:8000.

📦 Installation

Prerequisites

Python 3.11+
Docker & Docker Compose (recommended)
2GB RAM minimum, 4GB recommended

Option 1: Docker Installation (Recommended)

# Clone repository
git clone https://github.com/your-org/proxyapi.git
cd proxyapi

# Configure environment
cp .env.example .env
# Edit .env with your API keys

# Start services
docker-compose up -d

Option 2: Manual Installation

# Install Python dependencies
pip install -r requirements.txt

# For enhanced performance (optional)
pip install httpx[http2] aiofiles watchdog psutil

# Configure providers
cp config.yaml.example config.yaml
# Edit config.yaml with your API keys

# Start application
python main_dynamic.py

Configuration

Create a config.yaml file with your provider configurations:

providers:
  - name: "openai"
    type: "openai"
    api_key_env: "OPENAI_API_KEY"
    models:
      - "gpt-3.5-turbo"
      - "gpt-4"
    enabled: true

  - name: "anthropic"
    type: "anthropic"
    api_key_env: "ANTHROPIC_API_KEY"
    models:
      - "claude-3-haiku"
      - "claude-3-sonnet"
    enabled: true

💻 Basic Usage

Chat Completions

import requests

# Make a chat completion request
response = requests.post(
    "http://localhost:8000/v1/chat/completions",
    headers={
        "Content-Type": "application/json",
        "X-API-Key": "your-proxy-key"
    },
    json={
        "model": "gpt-4",
        "messages": [
            {"role": "user", "content": "Hello, how are you?"}
        ]
    }
)

print(response.json())

Model Discovery

# Get all available models
response = requests.get("http://localhost:8000/v1/models")
models = response.json()

for model in models["data"]:
    print(f"{model['id']}: {model['description']}")

Health Check

# Quick health check
curl http://localhost:8000/health

# Detailed system status
curl http://localhost:8000/v1/health

📖 Documentation

For New Users

Getting Started Guide - Step-by-step tutorial for first-time users
Quick Start Guide - Rapid setup and basic usage
Installation Guide - Detailed installation instructions

For Developers

API Reference - Complete API documentation
Integration Guide - Integration patterns and best practices
Configuration Guide - Advanced configuration options

Advanced Topics

Model Discovery Guide - Using the model discovery system
Performance Guide - Optimization and performance tuning
Monitoring Guide - Metrics, logging, and observability
Security Guide - Security features and best practices

Deployment & Operations

Deployment Guide - Production deployment strategies
Load Testing Guide - Performance testing and chaos engineering
Troubleshooting Guide - Common issues and solutions

Development

Contributing Guide - How to contribute to the project
Architecture Overview - System architecture and design
Testing Guide - Testing strategies and practices

🔧 Advanced Features

Model Discovery System

Automatically discovers and catalogs available AI models from all configured providers with real-time pricing and capabilities.

# Refresh model cache
curl -X POST http://localhost:8000/v1/models/refresh

# Search models by capabilities
curl "http://localhost:8000/v1/models/search?supports_vision=true&max_cost=0.01"

Context Condensation

Automatically summarizes long contexts to reduce API costs and improve performance.

# Long context is automatically handled
messages = [{"role": "user", "content": "Very long text..." * 1000}]

# API automatically condenses if needed
response = requests.post("http://localhost:8000/v1/chat/completions", json={
    "model": "gpt-4",
    "messages": messages
})

Monitoring & Metrics

Comprehensive monitoring with Prometheus metrics and health checks.

# Get metrics
curl http://localhost:8000/metrics

# Prometheus format
curl http://localhost:8000/metrics/prometheus

🤝 Contributing

We welcome contributions! Please see our Contributing Guide for details.

Development Setup

# Clone repository
git clone https://github.com/your-org/proxyapi.git
cd proxyapi

# Install development dependencies
pip install -r requirements-dev.txt

# Run tests
pytest tests/

# Run linting
flake8 src/
black src/
mypy src/

📞 Support

Getting Help

📖 Documentation: Comprehensive docs in the docs/ directory
🐛 Issues: GitHub Issues
💬 Discussions: GitHub Discussions
📧 Email: support@proxyapi.com

Community

Discord: Join our Discord
Twitter: @ProxyAPI
Blog: proxyapi.com/blog

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

OpenAI for GPT models and API
Anthropic for Claude models
Microsoft for Azure OpenAI
FastAPI for the excellent web framework
All contributors who helped make this possible

⭐ Star this repository if you find it useful!

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
.github/workflows		.github/workflows
.kilocode/rules-architect		.kilocode/rules-architect
.roo		.roo
.vscode		.vscode
build		build
config		config
context_service		context_service
dist		dist
docs		docs
examples		examples
exports		exports
health_worker		health_worker
monitoring		monitoring
node_modules		node_modules
packages		packages
scripts		scripts
src		src
static		static
systemd-services		systemd-services
templates		templates
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.markdownlint.json		.markdownlint.json
.pre-commit-config.yaml		.pre-commit-config.yaml
4.21.0		4.21.0
5.9.0		5.9.0
CONFIG_VALIDATION_REPORT.md		CONFIG_VALIDATION_REPORT.md
CONTRIBUTING.md		CONTRIBUTING.md
DEPLOYMENT_GUIDE.md		DEPLOYMENT_GUIDE.md
Dockerfile		Dockerfile
IMPLEMENTATION_SUMMARY.md		IMPLEMENTATION_SUMMARY.md
LICENSE		LICENSE
LLM_Proxy_API.spec		LLM_Proxy_API.spec
PERFORMANCE_OPTIMIZATIONS.md		PERFORMANCE_OPTIMIZATIONS.md
PROVIDER_FORCING_GUIDE.md		PROVIDER_FORCING_GUIDE.md
QUICK_START.md		QUICK_START.md
README.md		README.md
TODO.md		TODO.md
VALIDATION_REPORT.md		VALIDATION_REPORT.md
action_plan.md		action_plan.md
benchmark_connection_pooling.py		benchmark_connection_pooling.py
benchmark_retry_strategies.py		benchmark_retry_strategies.py
benchmark_throughput.py		benchmark_throughput.py
benchmark_timeout_handling.py		benchmark_timeout_handling.py
build_config.yaml		build_config.yaml
build_linux.py		build_linux.py
build_macos.py		build_macos.py
build_windows.py		build_windows.py
build_windows_simple.py		build_windows_simple.py
build_wrapper.py		build_wrapper.py
cache_demo.py		cache_demo.py
code_review_progress.md		code_review_progress.md
code_review_report.md		code_review_report.md
code_review_report_detailed.md		code_review_report_detailed.md
comprehensive_failure_report.md		comprehensive_failure_report.md
config.yaml		config.yaml
critical_issues_and_recommendations.md		critical_issues_and_recommendations.md
debug_config.yaml		debug_config.yaml
dependency_report.md		dependency_report.md
deploy.sh		deploy.sh
detailed_correction_plan.md		detailed_correction_plan.md
diagnose_dependencies.py		diagnose_dependencies.py
docker-compose.yml		docker-compose.yml
final_fixes.md		final_fixes.md
final_validation_report.md		final_validation_report.md
fix_md051.py		fix_md051.py
force_refresh.txt		force_refresh.txt
health_worker_providers.json		health_worker_providers.json
http_client_performance_comparison_report.md		http_client_performance_comparison_report.md
k6.zip		k6.zip
main.py		main.py
main_dynamic.py		main_dynamic.py
markdown_validator.py		markdown_validator.py
migration_test.py		migration_test.py
optimizations.md		optimizations.md
package-lock.json		package-lock.json
package.json		package.json
patch_dependencies.py		patch_dependencies.py
production_config.py		production_config.py
project_context.md		project_context.md
pytest.ini		pytest.ini
requirements.txt		requirements.txt
rollback.sh		rollback.sh
simple_fix.py		simple_fix.py
test_adaptive_sampling.py		test_adaptive_sampling.py
test_cache.json		test_cache.json
test_config_validation.py		test_config_validation.py
test_connection_pooling.py		test_connection_pooling.py
test_deployment.sh		test_deployment.sh
test_enhanced_cache.py		test_enhanced_cache.py
test_excessive_requests.py		test_excessive_requests.py
test_monitoring_system.py		test_monitoring_system.py
test_per_route_rate_limiting.py		test_per_route_rate_limiting.py
test_provider_forcing.py		test_provider_forcing.py
update_deps.py		update_deps.py
validate_cache_algorithms.py		validate_cache_algorithms.py
validate_config_consistency.py		validate_config_consistency.py
validation_benchmark_results.json		validation_benchmark_results.json
validation_latency_benchmark_report.md		validation_latency_benchmark_report.md

Folders and files

Latest commit

History

Repository files navigation

ProxyAPI - AI Model Proxy & Discovery Platform

✨ Key Features

📋 Table of Contents

🚀 Quick Start

Docker (Recommended)

Manual Setup

📦 Installation

Prerequisites

Option 1: Docker Installation (Recommended)

Option 2: Manual Installation

Configuration

💻 Basic Usage

Chat Completions

Model Discovery

Health Check

📖 Documentation

For New Users

For Developers

Advanced Topics

Deployment & Operations

Development

🔧 Advanced Features

Model Discovery System

Context Condensation

Monitoring & Metrics

🤝 Contributing

Development Setup

📞 Support

Getting Help

Community

📄 License

🙏 Acknowledgments

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages