X Tutup
Skip to content
Permalink

Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.
base repository: CogStack/CogStack-ModelServe
Failed to load repositories. Confirm that selected base ref is valid, then try again.
Loading
base: embeddings
Choose a base ref
...
head repository: CogStack/CogStack-ModelServe
Failed to load repositories. Confirm that selected head ref is valid, then try again.
Loading
compare: main
Choose a head ref
Checking mergeability… Don’t worry, you can still create the pull request.
  • 13 commits
  • 86 files changed
  • 2 contributors

Commits on Jan 14, 2026

  1. Configuration menu
    Copy the full SHA
    12bc6ff View commit details
    Browse the repository at this point in the history

Commits on Jan 20, 2026

  1. Merge pull request #36 from CogStack/mongodb

    Bump mongodb to tackle CVE-2025-14847
    baixiac authored Jan 20, 2026
    Configuration menu
    Copy the full SHA
    5c83f01 View commit details
    Browse the repository at this point in the history
  2. Merge pull request #37 from CogStack/embeddings

    Add embedding creation for MedCAT and HF NER models
    baixiac authored Jan 20, 2026
    Configuration menu
    Copy the full SHA
    042dd83 View commit details
    Browse the repository at this point in the history

Commits on Feb 9, 2026

  1. Add MCP server to integrate CMS capabilities for LLM responses (#38)

    * feat: add MCP server to integrate CMS capabilities for LLM responses
    baixiac authored Feb 9, 2026
    Configuration menu
    Copy the full SHA
    ed620d7 View commit details
    Browse the repository at this point in the history

Commits on Feb 11, 2026

  1. feat: add the endpoint for legacy v1 completion

    feat: auto-utilise the local chat template if detected
    feat: add the option to generate full sentences
    feat: add the option for local 8bit quantisation
    feat: add the gpt oss chat template
    fix: skip quantisation if the model being loaded is already quantised
    baixiac committed Feb 11, 2026
    Configuration menu
    Copy the full SHA
    b0c845e View commit details
    Browse the repository at this point in the history

Commits on Feb 13, 2026

  1. Merge pull request #39 from CogStack/llm-serving

    Add the endpoint for legacy v1 completion and the local 8bit quantisation
    baixiac authored Feb 13, 2026
    Configuration menu
    Copy the full SHA
    de5e049 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    876b8fb View commit details
    Browse the repository at this point in the history
  3. Merge pull request #40 from CogStack/doc-n-reorg

    Improve READMEs and refactor the OAuth flow
    baixiac authored Feb 13, 2026
    Configuration menu
    Copy the full SHA
    18c4efa View commit details
    Browse the repository at this point in the history

Commits on Mar 6, 2026

  1. feat: add the enpoint for v1 models and list models

    feat: add micro batching and lower CPU usage during model loading
    feat: ensure the pad token for generative models
    feat: use the async streamer during async generation
    feat: apply timeout to text generation
    fix: fix the property name for stop sequences in OpenAI requests
    baixiac committed Mar 6, 2026
    Configuration menu
    Copy the full SHA
    874c1fa View commit details
    Browse the repository at this point in the history

Commits on Mar 9, 2026

  1. Configuration menu
    Copy the full SHA
    3823b76 View commit details
    Browse the repository at this point in the history

Commits on Mar 10, 2026

  1. Configuration menu
    Copy the full SHA
    184832d View commit details
    Browse the repository at this point in the history

Commits on Mar 13, 2026

  1. Merge pull request #41 from CogStack/llm-gen2

    Add micro batching and enpoints for v1 list_models and get_model
    baixiac authored Mar 13, 2026
    Configuration menu
    Copy the full SHA
    98554f8 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    86e4e68 View commit details
    Browse the repository at this point in the history
Loading
X Tutup