AI-ML
Ollama v0.23.1
RESUMEN
Gemma 4 MTP (Multi-token Processing) for the MLX runner Gemma 4 MTP speculative decoding is now supported on Macs. This can give over a 2x speed increase for the Gemma 4 31B model on coding tasks. ``` ollama run gemma4:31b-coding-mtp-bf16 ``` What's Changed * Update MLX and MLX-C wit
Descripción Detallada
Gemma 4 MTP (Multi-token Processing) for the MLX runner Gemma 4 MTP speculative decoding is now supported on Macs. This can give over a 2x speed increase for the Gemma 4 31B model on coding tasks. ``` ollama run gemma4:31b-coding-mtp-bf16 ``` What's Changed Update MLX and MLX-C with threading fixes by @dhiltgen in go: bump to 1.26 by @ParthSareen in * Add Gemma 4 MTP speculative decoding by @pdevine in Full Changelog:
Explicación con IA
Genera un resumen en lenguaje claro de los cambios de este release.
Releases Relacionados
AI-ML
Ollama v0.30.10
## What's Changed * Command A and North family models now run on Apple Silicon with the MLX engine * Updated the underlying llama.cpp engine to build 9672 * Fixed build artifacts for MLX **Full Changelog**: https://github.com/ollama/ollama/compare/v0.30.9...v0.30.10
AI-ML
Ollama v0.30.9
## What's Changed * Support for Cohere2Moe architecture * Fixed LFM2 parser/render for cases where thinking was not emitted * Fixed issue where `ollama launch claude` and other coding agent or assistant use cases would only output one token * Ollama will now return an error if a single message i