X Tutup
Skip to content
@promptfoo

promptfoo

Test your LLM apps
Promptfoo - AI security testing platform with detective red panda logo

Ship agents, not vulnerabilities

WebsiteDocsBlogDiscord

GitHub stars npm downloads License: MIT

AI security testing for LLMs, agents, and RAG systems

Trusted by 25% of the Fortune 500 and 350K+ developers


Important

Promptfoo is joining OpenAI. Promptfoo remains open source under MIT, community contributions remain welcome, and we will continue supporting multiple providers and models. Read the announcement →


🚀 Quick Start

npx promptfoo@latest init
npx promptfoo@latest eval
npx promptfoo@latest view

Get Started → · Enterprise →


🛠️ What We Do

Security Testing

  • Red Teaming — Automated vulnerability discovery with 100+ attack plugins
  • Code Scanning — Detect LLM security risks in your IDE and CI/CD

Evaluations


🔒 Security & Privacy

What we detect:

  • Prompt injections and jailbreaks
  • PII and sensitive data leaks
  • Hallucinations and policy violations
  • Tool misuse and adversarial attacks

Compliance: SOC 2 Type II · ISO 27001 · HIPAA

Data model:

  • Evals — 100% local, API keys never leave your machine
  • Red teaming — Your target runs locally; attack generation via our API or bring your own keys

📦 Projects

Repository Description
promptfoo Test prompts, agents, and RAGs. Red teaming and vulnerability scanning for LLMs.
promptfoo-action GitHub Action for CI/CD security testing
evil-mcp-server Red team testing for Model Context Protocol servers
modelaudit Static scanner that detects malicious code, backdoors, and vulnerabilities in ML model files
promptfoo-python Python wrapper for promptfoo
js-rouge JavaScript ROUGE metrics for summarization evaluation

👥 Community

Connect: Discord · X/Twitter · Bluesky · LinkedIn

Contribute: Contributing Guide · Good First Issues · Report Issues

Learn: LLM Vulnerability Database · Security Research Blog

Popular repositories Loading

  1. promptfoo promptfoo Public

    Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with co…

    TypeScript 11.5k 1.1k

  2. promptfoo-action promptfoo-action Public

    The GitHub Action for Promptfoo. Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. S…

    TypeScript 45 16

  3. modelaudit modelaudit Public

    Security scanner for AI/ML model files. Detects malicious code, backdoors, and vulnerabilities before deployment

    Python 31 1

  4. evil-mcp-server evil-mcp-server Public

    An evil MCP server used for redteam testing

    TypeScript 25 4

  5. crabcode crabcode Public

    Generic tmux-based workspace manager for multi-repo development. Lightning-fast dev productivity tool.

    Shell 10

  6. mini-foo mini-foo Public

    Mini promptfoo used for interviews

    TypeScript 5 2

Repositories

Showing 10 of 20 repositories
  • promptfoo Public

    Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.

    promptfoo/promptfoo’s past year of commit activity
    TypeScript 11,520 MIT 1,056 82 164 Updated Mar 10, 2026
  • modelaudit Public

    Security scanner for AI/ML model files. Detects malicious code, backdoors, and vulnerabilities before deployment

    promptfoo/modelaudit’s past year of commit activity
    Python 31 MIT 1 1 (1 issue needs help) 7 Updated Mar 10, 2026
  • .github Public
    promptfoo/.github’s past year of commit activity
    2 MIT 0 0 0 Updated Mar 9, 2026
  • promptfoo-action Public

    The GitHub Action for Promptfoo. Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command line and CI/CD integration.

    promptfoo/promptfoo-action’s past year of commit activity
    TypeScript 45 MIT 15 2 1 Updated Mar 9, 2026
  • mcp-agent-provider Public

    A promptfoo custom provider to test MCP servers with our evil mcp server

    promptfoo/mcp-agent-provider’s past year of commit activity
    JavaScript 2 MIT 0 0 0 Updated Mar 9, 2026
  • renovate-config Public

    Shared Renovate configuration for the promptfoo organization

    promptfoo/renovate-config’s past year of commit activity
    2 0 0 0 Updated Mar 9, 2026
  • js-rouge Public

    A Javascript implementation of the Recall-Oriented Understudy for Gisting Evaluation (ROUGE) evaluation metric for summaries.

    promptfoo/js-rouge’s past year of commit activity
    TypeScript 2 MIT 0 0 0 Updated Mar 9, 2026
  • evil-mcp-server Public

    An evil MCP server used for redteam testing

    promptfoo/evil-mcp-server’s past year of commit activity
    TypeScript 25 MIT 4 0 0 Updated Mar 8, 2026
  • crabcode Public

    Generic tmux-based workspace manager for multi-repo development. Lightning-fast dev productivity tool.

    promptfoo/crabcode’s past year of commit activity
    Shell 10 0 1 3 Updated Mar 8, 2026
  • redscan-lite Public

    Promptfoo Interview Exercise

    promptfoo/redscan-lite’s past year of commit activity
    JavaScript 3 MIT 0 0 0 Updated Mar 8, 2026

Top languages

Loading…

X Tutup