X Tutup
Skip to content
View wangzhaode's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report wangzhaode

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
wangzhaode/README.md

Hi, I'm Zhaode Wang (王召德) 👋

MNN Developer | On-Device AI & LLM Enthusiast

GitHub followers GitHub stars

👨‍💻 About Me

I'm an Inference Engine Technology Expert at Alibaba (Taotian Group), working as a core architect and developer of MNN - a blazing fast deep learning inference engine with 13,000+ GitHub stars. I'm passionate about making AI, especially LLMs, run efficiently on edge devices.

  • 🔭 Currently working on On-Device LLM Deployment at Alibaba MNN Team
  • 🌱 Exploring LLM inference optimization, quantization, and speculative decoding
  • 🏠 Founder of MNN-LLM - LLM deployment on mobile devices
  • 📖 Check out my Tech Blog for AI deployment insights

🚀 Featured Projects

Project Description
llm-export ⭐ 344 Export LLM models to ONNX format for cross-platform deployment
tokenizer.cpp ⭐ 24 Lightweight C++ library for LLM tokenization (HuggingFace compatible)
mnn-asr ⭐ 25 MNN-based Automatic Speech Recognition demo
mnn-tts ⭐ 19 MNN-based Text-to-Speech demo
jinja.cpp ⭐ 18 Single-header C++11 Jinja2 engine for LLM chat templates
llm-lab ⭐ 9 LLM experiments and research notes

📊 GitHub Stats

GitHub Stats


⭐ From wangzhaode

Pinned Loading

  1. alibaba/MNN alibaba/MNN Public

    MNN: A blazing-fast, lightweight inference engine battle-tested by Alibaba, powering high-performance on-device LLMs and Edge AI.

    C++ 14.5k 2.2k

  2. mnn-llm mnn-llm Public

    llm deploy project based mnn. This project has merged into MNN.

    C++ 1.6k 176

  3. llm-export llm-export Public

    llm-export can export llm model to onnx.

    Python 344 40

  4. mnn-stable-diffusion mnn-stable-diffusion Public

    stable diffusion using mnn

    C++ 66 5

  5. jinja.cpp jinja.cpp Public

    A lightweight, single-header C++11 Jinja2 template engine for LLM chat templates.

    C++ 18 4

  6. tokenizer.cpp tokenizer.cpp Public

    A lightweight, production-ready C++ library for LLM tokenization, fully compatible with HuggingFace tokenizer.json.

    C++ 23 2

X Tutup