X Tutup
Skip to content
@inference-gateway

Inference Gateway

An open-source, cloud-native, high-performance gateway unifying multiple LLM providers, from local solutions like Ollama to major cloud providers
Inference Gateway Logo

Inference Gateway

An open-source, cloud-native, high-performance gateway unifying multiple LLM providers

GitHub Stars License: MIT Go Docs

📖 Documentation · 🚀 Getting Started · 💬 Discussions · 🐛 Issues


🌐 What is Inference Gateway?

Inference Gateway is a proxy server that provides a unified API to interact with multiple large language model (LLM) providers — from local solutions like Ollama to major cloud providers like OpenAI, Anthropic, Groq, Cohere, Cloudflare, and DeepSeek.

Stop managing multiple SDKs and API keys. Route all your LLM traffic through a single, production-ready gateway.

# One endpoint. Every provider.
curl -X POST http://localhost:8080/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{"model": "openai/gpt-4o", "messages": [{"role": "user", "content": "Hello!"}]}'

✨ Key Features

Feature Description
🔀 Unified API One OpenAI-compatible endpoint for all LLM providers
🔌 MCP Integration Native Model Context Protocol support for automatic tool discovery
🤖 A2A Protocol Agent-to-Agent coordination across specialized agents
🌊 Streaming Real-time token streaming from all supported providers
☸️ Kubernetes Ready First-class K8s support with Operator and HPA scaling
📊 Observability OpenTelemetry integration for monitoring and tracing
🔒 Privacy First Self-hosted, zero data collection, MIT licensed
🌿 Lightweight ~10.8MB binary with minimal resource footprint

🏗️ Ecosystem

Core

Repository Description
inference-gateway The core gateway server
operator Kubernetes Operator for lifecycle management
cli Agentic CLI assistant with project context awareness
adl-cli Scaffold and manage A2A-powered enterprise agents

SDKs

Repository Language
sdk Go
rust-sdk Rust

A2A Agents

Repository Description
adk Agent Development Kit for building A2A-compatible agents
google-calendar-agent Google Calendar scheduling & automation
browser-agent Browser automation via Playwright
documentation-agent Context7-style documentation access for agents

🚀 Quick Start

# Run with Docker
docker run -p 8080:8080 \
  -e OPENAI_API_KEY=your-key \
  ghcr.io/inference-gateway/inference-gateway:latest

# Or install the CLI
curl -fsSL https://raw.githubusercontent.com/inference-gateway/cli/main/install.sh | bash
infer init && infer chat

👉 Full setup guide: docs.inference-gateway.com/getting-started


🤝 Contributing

We welcome contributions of all kinds — bug reports, feature requests, documentation improvements, and code!


Released under the MIT License · Built with ❤️ in Go

Pinned Loading

  1. inference-gateway inference-gateway Public

    An open-source, cloud-native, high-performance gateway unifying multiple LLM providers, from local solutions like Ollama to major cloud providers such as OpenAI, Groq, Cohere, Anthropic, Cloudflare…

    Go 103 17

  2. adl-cli adl-cli Public

    A command-line tool to scaffold and manage enterprise-ready AI Agents powered by the A2A (Agent-to-Agent) protocol

    Go 8 2

  3. cli cli Public

    An agentic command-line assistant that writes code, understands project context, and uses tools to perform real tasks.

    Go 2 2

  4. adk adk Public

    An Agent Development Kit (ADK) allowing for seamless creation of A2A-compatible agents written in Go

    Go 20 1

  5. rust-sdk rust-sdk Public

    An SDK written in Rust for the Inference Gateway

    Rust 2 1

  6. sdk sdk Public

    An SDK written in Go for the Inference Gateway

    Go 3

Repositories

Showing 10 of 30 repositories
  • cli Public

    An agentic command-line assistant that writes code, understands project context, and uses tools to perform real tasks.

    inference-gateway/cli’s past year of commit activity
    Go 2 MIT 2 8 3 Updated Mar 9, 2026
  • docs Public

    Extensive documentation of the inference-gateway

    inference-gateway/docs’s past year of commit activity
    MDX 1 MIT 0 1 0 Updated Mar 7, 2026
  • schemas Public

    This repository contain the different schemas like MCP, A2A, OpenAPI

    inference-gateway/schemas’s past year of commit activity
    JavaScript 0 MIT 0 0 1 Updated Mar 7, 2026
  • inference-gateway Public

    An open-source, cloud-native, high-performance gateway unifying multiple LLM providers, from local solutions like Ollama to major cloud providers such as OpenAI, Groq, Cohere, Anthropic, Cloudflare and DeepSeek.

    inference-gateway/inference-gateway’s past year of commit activity
    Go 103 MIT 17 8 1 Updated Mar 7, 2026
  • .github Public
    inference-gateway/.github’s past year of commit activity
    0 0 0 0 Updated Mar 6, 2026
  • operator Public

    This project provides a Kubernetes Operator for managing the lifecycle of the inference-gateway and its related components. It simplifies deployment, configuration, and scaling of the gateway within Kubernetes clusters, enabling seamless integration of inference workflows.

    inference-gateway/operator’s past year of commit activity
    Go 1 MIT 0 0 1 Updated Mar 2, 2026
  • browser-agent Public

    A2A agent server for browser automation and web testing using Playwright

    inference-gateway/browser-agent’s past year of commit activity
    Go 3 MIT 0 2 1 Updated Jan 27, 2026
  • google-calendar-agent Public

    A2A agent server enabling Google Calendar scheduling, retrieval, and automation

    inference-gateway/google-calendar-agent’s past year of commit activity
    Go 15 MIT 2 0 0 Updated Jan 27, 2026
  • documentation-agent Public

    A2A agent server that provides Context7-style documentation capabilities for your agents

    inference-gateway/documentation-agent’s past year of commit activity
    1 0 0 0 Updated Jan 27, 2026
  • adl-cli Public

    A command-line tool to scaffold and manage enterprise-ready AI Agents powered by the A2A (Agent-to-Agent) protocol

    inference-gateway/adl-cli’s past year of commit activity
    Go 8 MIT 2 0 0 Updated Jan 27, 2026

People

This organization has no public members. You must be a member to see who’s a part of this organization.

X Tutup