AI-ML

Ollama v0.30.8

AI-ML12 de junio de 2026Impact 20Anuncio oficial

RESUMEN

Descripción Detallada

What's Changed Fixed `ollama launch` selecting the wrong provider in some cases Improved prompt caching by decoupling it from context shift for better KV cache reuse More stable MLX inference with hardened linear and embedding layers MLX runner now creates snapshots during prompt processing and speculative decoding for improved reliability * Improved recurrent model support with per-boundary states from the gated-delta kernels Full Changelog:

Explicación con IA

Genera un resumen en lenguaje claro de los cambios de este release.

ai-ml

Releases Relacionados

AI-ML

Ollama v0.32.5

## What's Changed * Fixed an MLX Metal bug that could reduce output quality for NVFP4 models, particularly Laguna. **Full Changelog**: https://github.com/ollama/ollama/compare/v0.32.4...v0.32.5

hace 1d20

AI-ML

Ollama v0.32.4

## What's Changed - Support Laguna on Apple GPUs via the MLX engine - Quantize draft-model output heads at the requested type when creating speculative-decoding drafts. - Fixed Qwen3 MoE decoding for differently-quantized experts, plus faster packed gate/up projection (~4–9% on M5 Max). **Full

hace 3d20

AI-ML

Ollama v0.30.8

Descripción Detallada

Explicación con IA

Releases Relacionados

Ollama v0.32.5

Ollama v0.32.4

Ollama v0.32.3

Ollama v0.32.2