Category · 12 models

On-Device Edge Inference

Run the model on the phone, the laptop, or the factory PLC — no round-trip.

What it is

Small, quantized models built for latency, privacy and offline use: voice on-device, mobile assistants, robotics control loops, industrial edge.

Real-world examples

  • ·Run a 3B-parameter assistant on a flagship phone
  • ·Voice control a home appliance without cloud
  • ·Predictive maintenance on a factory edge box

What to look for

  • ·<8GB memory footprint
  • ·ARM / Apple-silicon / NPU support
  • ·Open weights or device-licensed

12 models in this category

Phi-4

Microsoft

AIDB91

Small language model punching above its weight on reasoning benchmarks.

ReasoningText Generation
TextOpen Weights

Gemma 3

Google

AIDB92

Open-weights model family in 1B–27B sizes for on-device & server.

Text Generation
Text + ImageOpen Weights

Mistral Small 3.2

Mistral AI

AIDB88

Fast open-weights small model with strong reasoning.

Text Generation
TextOpen Weights

MiniMax-M1

MiniMax

AIDB87

Open reasoning model with 1M-token context.

ReasoningText Generation
TextOpen Weights

Moonshine

Useful Sensors

AIDB86

Open ASR model optimised for real-time edge inference.

Audio / Speech
AudioOpen

Oryx 34B

Tsinghua University

AIDB86

Visual data comes in various forms, ranging from small icons of just a few pixels to long videos spanning hours.

3DImage GenerationMultimodal
3DOpen Weights

MiniMax-M2

MiniMax

AIDB88

Today, we are officially open-sourcing and launching MiniMax M2, a model born for Agents and code.

AgentsCodeText Generation
TextOpen Weights

Nemotron 3-Nano-30B-A3B

NVIDIA

AIDB91

We present Nemotron 3 Nano 30B-A3B, a Mixture-of-Experts hybrid MambaTransformer language model.

Text Generation
TextOpen Weights

MiniMax-M2.1

MiniMax

AIDB83

MiniMax's language model tracked by Epoch, focused on chat.

AgentsCodeText Generation
TextOpen Weights

LFM2

Liquid AI

AIDB87

Liquid AI's second-generation efficient foundation models built on liquid neural networks for on-device use.

Text GenerationReasoning
TextOpen Weights

Stable Audio 3 Small SFX

Stability AI

AIDB89

Stability AI's compact text-to-sound-effects diffusion model optimized for low-latency on-device SFX generation.

Audio / Speech
AudioOpen Weights

Stable Audio 3 Small Music

Stability AI

AIDB93

Stability AI's compact text-to-music diffusion model tuned for short, license-friendly musical loops and stems.

Music
AudioOpen Weights

Explore other categories