Tags
2 pages
Mtp
LiteRT-LM v0.11.0 — Gemma 4 MTP Doubles Mobile GPU Decode, Windows Goes Native
Pushing Qwen3.5-122B from 28.3 to 51 tok per second on a single DGX Spark