Comparing changes

Choose two branches to see what’s changed or to start a new pull request. If you need to, you can also or learn more about diff comparisons.

Open a pull request

Create a new pull request by comparing changes across two branches. If you need to, you can also . Learn more about diff comparisons here.

...

Checking mergeability… Don’t worry, you can still create the pull request.

13 commits
86 files changed
2 contributors

Commits on Jan 14, 2026

chore: bump mongodb to 5.0.23 to fix CVE-2025-14847

baixiac committed Jan 14, 2026
Configuration menu
View commit details

Copy full SHA for 12bc6ff

Browse repository at this point
Copy the full SHA

12bc6ff View commit details

Browse the repository at this point in the history

Commits on Jan 20, 2026

Merge pull request #36 from CogStack/mongodb
```
Bump mongodb to tackle CVE-2025-14847
```
baixiac authored Jan 20, 2026
Configuration menu
View commit details

Copy full SHA for 5c83f01

Browse repository at this point
Copy the full SHA

5c83f01 View commit details

Browse the repository at this point in the history
Merge pull request #37 from CogStack/embeddings
```
Add embedding creation for MedCAT and HF NER models
```
baixiac authored Jan 20, 2026
Configuration menu
View commit details

Copy full SHA for 042dd83

Browse repository at this point
Copy the full SHA

042dd83 View commit details

Browse the repository at this point in the history

Commits on Feb 9, 2026

Add MCP server to integrate CMS capabilities for LLM responses (#38 )
```
* feat: add MCP server to integrate CMS capabilities for LLM responses
```
baixiac authored Feb 9, 2026
Configuration menu
View commit details

Copy full SHA for ed620d7

Browse repository at this point
Copy the full SHA

ed620d7 View commit details

Browse the repository at this point in the history

Commits on Feb 11, 2026

feat: add the endpoint for legacy v1 completion
```
feat: auto-utilise the local chat template if detected
feat: add the option to generate full sentences
feat: add the option for local 8bit quantisation
feat: add the gpt oss chat template
fix: skip quantisation if the model being loaded is already quantised
```
baixiac committed Feb 11, 2026
Configuration menu
View commit details

Copy full SHA for b0c845e

Browse repository at this point
Copy the full SHA

b0c845e View commit details

Browse the repository at this point in the history

Commits on Feb 13, 2026

Merge pull request #39 from CogStack/llm-serving
```
Add the endpoint for legacy v1 completion and the local 8bit quantisation
```
baixiac authored Feb 13, 2026
Configuration menu
View commit details

Copy full SHA for de5e049

Browse repository at this point
Copy the full SHA

de5e049 View commit details

Browse the repository at this point in the history
doc: improve READMEs and refactor the OAuth flow

baixiac committed Feb 13, 2026
Configuration menu
View commit details

Copy full SHA for 876b8fb

Browse repository at this point
Copy the full SHA

876b8fb View commit details

Browse the repository at this point in the history
Merge pull request #40 from CogStack/doc-n-reorg
```
Improve READMEs and refactor the OAuth flow
```
baixiac authored Feb 13, 2026
Configuration menu
View commit details

Copy full SHA for 18c4efa

Browse repository at this point
Copy the full SHA

18c4efa View commit details

Browse the repository at this point in the history

Commits on Mar 6, 2026

feat: add the enpoint for v1 models and list models

feat: add micro batching and lower CPU usage during model loading
feat: ensure the pad token for generative models
feat: use the async streamer during async generation
feat: apply timeout to text generation
fix: fix the property name for stop sequences in OpenAI requests

baixiac committed Mar 6, 2026

874c1fa

Commits on Mar 9, 2026

docker: add the GPU image build and remove per-model Dockerfiles

baixiac committed Mar 9, 2026
Configuration menu
View commit details

Copy full SHA for 3823b76

Browse repository at this point
Copy the full SHA

3823b76 View commit details

Browse the repository at this point in the history

Commits on Mar 10, 2026

chore: upgrade uv and tidy up the docker folder

baixiac committed Mar 10, 2026
Configuration menu
View commit details

Copy full SHA for 184832d

Browse repository at this point
Copy the full SHA

184832d View commit details

Browse the repository at this point in the history

Commits on Mar 13, 2026

Merge pull request #41 from CogStack/llm-gen2
```
Add micro batching and enpoints for v1 list_models and get_model
```
baixiac authored Mar 13, 2026
Configuration menu
View commit details

Copy full SHA for 98554f8

Browse repository at this point
Copy the full SHA

98554f8 View commit details

Browse the repository at this point in the history
ci: fix per-model-type API doc generation

baixiac committed Mar 13, 2026
Configuration menu
View commit details

Copy full SHA for 86e4e68

Browse repository at this point
Copy the full SHA

86e4e68 View commit details

Browse the repository at this point in the history

Provide feedback

Saved searches

Use saved searches to filter your results more quickly