AI-ML
Ollama v0.30.8
RESUMEN
What's Changed Fixed `ollama launch` selecting the wrong provider in some cases Improved prompt caching by decoupling it from context shift for better KV cache reuse More stable MLX inference with hardened linear and embedding layers MLX runner now creates snapshots during prompt proc
Descripción Detallada
What's Changed Fixed `ollama launch` selecting the wrong provider in some cases Improved prompt caching by decoupling it from context shift for better KV cache reuse More stable MLX inference with hardened linear and embedding layers MLX runner now creates snapshots during prompt processing and speculative decoding for improved reliability * Improved recurrent model support with per-boundary states from the gated-delta kernels Full Changelog:
Explicación con IA
Genera un resumen en lenguaje claro de los cambios de este release.
Releases Relacionados
AI-ML
Ollama v0.30.7
Ollama Launch now supports Hermes Desktop, a native desktop interface for the Hermes agent. Run it alongside your Hermes agent to get a visual interface for managing conversations, integrations, and messaging apps. ``` ollama launch hermes-desktop ``` <img width="2556" height="1716" alt="image
AI-ML
Ollama v0.30.6
# New models - [Gemma 4 QAT weights](https://ollama.com/library/gemma4): the Gemma 4 family is now optimized with Quantization-Aware Training (QAT) to dramatically reduce memory requirements and maximize on-device performance. Look for the tags ending in `-qat`: - `gemma4:e2b-it-qat` - `gemma