AI-ML
Ollama v0.30.4
RESUMEN
New models - Nemotron-3-Ultra: NVIDIA Nemotron 3 Ultra is built for high-throughput reasoning and long-running agent workflows. What's Changed * Fixed multimodal models not using GPU on the llama.cpp backend can now use Metal GPU offload on
Descripción Detallada
New models - Nemotron-3-Ultra: NVIDIA Nemotron 3 Ultra is built for high-throughput reasoning and long-running agent workflows. What's Changed Fixed multimodal models not using GPU on the llama.cpp backend can now use Metal GPU offload on Apple Silicon, improving multimodal performance on supported Macs. `ollama create --experimental` now respects `REQUIRES` in Modelfiles for MLX-based models. `ollama launch codex` now cleans up old conflicting Codex profile config before launching. `ollama launch pi` now migrates users from the legacy Pi package to the official package and preserves the correct npm install prefix. Pi web search setup now updates only when a newer package is available. Windows cleanup now terminates the llama.cpp backend more reliably. Updated the llama.cpp backend. Known Issues `gemma4:12b` crashes with floating point exception Full Changelog:
Explicación con IA
Genera un resumen en lenguaje claro de los cambios de este release.
Releases Relacionados
AI-ML
Ollama v0.30.8
## What's Changed * Fixed `ollama launch` selecting the wrong provider in some cases * Improved prompt caching by decoupling it from context shift for better KV cache reuse * More stable MLX inference with hardened linear and embedding layers * MLX runner now creates snapshots during prompt proc
AI-ML
Ollama v0.30.7
Ollama Launch now supports Hermes Desktop, a native desktop interface for the Hermes agent. Run it alongside your Hermes agent to get a visual interface for managing conversations, integrations, and messaging apps. ``` ollama launch hermes-desktop ``` <img width="2556" height="1716" alt="image