X Tutup
Skip to content

The ETL+ Platform for GenAI

Welcome to Unstructured! We're trusted by 82% of the Fortune 1000 and used by over 60,000 organizations globally.

We automatically transform complex, unstructured data into clean, structured data for GenAI applications. Data is routed through dynamic transformation and enrichment pipelines to deliver the highest quality output to your LLM. Continuously. Effortlessly. Automatically.

To get started, check out our open source offerings:

Ready for a more performant and reliable experience? Try Unstructured for free today and experience the next evolution of ETL for GenAI applications.

Learn more:

  • Company Website - Transform complex, unstructured data into clean, structured data. Securely. Continuously. Effortlessly.
  • Extensive Documentation - Our comprehensive docs cover everything from getting started guides to in-depth API references, ensuring you have the resources you need to succeed.
  • Developer Community on Slack - Connect with fellow developers, share knowledge, and get support through our vibrant community Slack channel.

Popular repositories Loading

  1. unstructured unstructured Public

    Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …

    HTML 14.2k 1.2k

  2. unstructured-api unstructured-api Public

    Python 885 187

  3. unstructured-inference unstructured-inference Public

    Python 206 75

  4. pipeline-sec-filings pipeline-sec-filings Public archive

    Preprocessing pipeline notebooks and API supporting text extraction from SEC documents

    Jupyter Notebook 148 35

  5. unstructured-python-client unstructured-python-client Public

    A Python client for the Unstructured Platform API

    Python 114 20

  6. unstructured-ingest unstructured-ingest Public

    HTML 105 57

Repositories

Showing 10 of 41 repositories
  • unstructured-js-client Public

    A JavaScript/Typescript client for the Unstructured Platform API

    Unstructured-IO/unstructured-js-client’s past year of commit activity
    TypeScript 58 MIT 15 6 1 Updated Mar 10, 2026
  • unstructured-python-client Public

    A Python client for the Unstructured Platform API

    Unstructured-IO/unstructured-python-client’s past year of commit activity
    Python 114 MIT 20 14 2 Updated Mar 10, 2026
  • docs Public

    Documentation for all Unstructured products and libraries

    Unstructured-IO/docs’s past year of commit activity
    MDX 7 25 0 15 Updated Mar 9, 2026
  • Unstructured-IO/unstructured-ingest’s past year of commit activity
    HTML 105 Apache-2.0 57 61 31 Updated Mar 9, 2026
  • Unstructured-IO/unstructured-api’s past year of commit activity
    Python 885 Apache-2.0 187 36 13 Updated Mar 6, 2026
  • unstructured Public

    Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website to learn more about our enterprise grade Platform product for production grade workflows, partitioning, enrichments, chunking and embedding.

    Unstructured-IO/unstructured’s past year of commit activity
    HTML 14,174 Apache-2.0 1,191 180 (1 issue needs help) 59 Updated Mar 4, 2026
  • Unstructured-IO/unstructured-platform-plugins’s past year of commit activity
    Python 6 Apache-2.0 3 0 2 Updated Mar 3, 2026
  • Unstructured-IO/unstructured-inference’s past year of commit activity
    Python 206 Apache-2.0 75 25 24 Updated Feb 28, 2026
  • UNS-MCP Public
    Unstructured-IO/UNS-MCP’s past year of commit activity
    Jupyter Notebook 42 22 3 2 Updated Feb 25, 2026
  • notebooks Public
    Unstructured-IO/notebooks’s past year of commit activity
    Jupyter Notebook 2 0 0 0 Updated Jan 29, 2026

People

This organization has no public members. You must be a member to see who’s a part of this organization.

X Tutup