Machine Learning Engineer @ LightOn
- ๐ LightOnOCR, a family of efficient 1B end-to-end OCR VLMs โ v2 achieves SOTA on OlmOCR-Bench while being 9ร smaller and up to 5ร faster than competing approaches
- ๐๏ธ ModernBERT, contributed to architecture design, training and eval (ACL 2025)
- ๐ ArabicWeb24, a 39B token Arabic corpus for LLM training
- ๐ ๏ธ vit.cpp, a lightweight C++ inference engine for Vision Transformers using GGML
- ๐ฌ Interested in Vision Language Models, Vision Transformers, LLM Pre-training, State-Space Models, Optimization, Code Generation, Efficient Inference, Quantization, GPU Kernels, Distributed Training, RL
- ๐ Engineering degree in maths and machine learning from รcole Centrale de Lyon
- ๐ซ Reach me: taghadouinisaid@gmail.com




