MiniCPM-V MiniCPM-V & o Cookbook

MiniCPM-V & o Cookbook

Cook up amazing multimodal AI applications effortlessly with MiniCPM-V and MiniCPM-o, bringing vision, speech, and live-streaming capabilities right to your fingertips.

What's new

Pick the right recipe

Individuals

Effortless inference on your own machine β€” runs on CPU + GPU, macOS / Linux / Windows, even on phones.

Enterprises

High-throughput, scalable serving:

Researchers

Train / fine-tune / customize:

Versions

This cookbook tracks all currently supported MiniCPM-V & o releases:

Version Status Modalities Backbone Context
MiniCPM-V 4.6 (latest) Recommended Image, Video Qwen3.5 hybrid 256K
MiniCPM-V 4.5 Stable Image, Video Qwen3 32K
MiniCPM-o 4.5 Stable Image, Video, Audio Qwen3 32K

Use the version switcher in the sidebar to jump between releases.

Resources