AI-ML

Ollama v0.30.4

AI-ML3 de junio de 2026Impact 20Anuncio oficial

RESUMEN

New models - Nemotron-3-Ultra: NVIDIA Nemotron 3 Ultra is built for high-throughput reasoning and long-running agent workflows. What's Changed * Fixed multimodal models not using GPU on the llama.cpp backend can now use Metal GPU offload on

Descripción Detallada

New models - Nemotron-3-Ultra: NVIDIA Nemotron 3 Ultra is built for high-throughput reasoning and long-running agent workflows. What's Changed Fixed multimodal models not using GPU on the llama.cpp backend can now use Metal GPU offload on Apple Silicon, improving multimodal performance on supported Macs. `ollama create --experimental` now respects `REQUIRES` in Modelfiles for MLX-based models. `ollama launch codex` now cleans up old conflicting Codex profile config before launching. `ollama launch pi` now migrates users from the legacy Pi package to the official package and preserves the correct npm install prefix. Pi web search setup now updates only when a newer package is available. Windows cleanup now terminates the llama.cpp backend more reliably. Updated the llama.cpp backend. Known Issues `gemma4:12b` crashes with floating point exception Full Changelog:

Explicación con IA

Genera un resumen en lenguaje claro de los cambios de este release.

ai-ml

Releases Relacionados

AI-ML

Ollama v0.32.5

## What's Changed * Fixed an MLX Metal bug that could reduce output quality for NVFP4 models, particularly Laguna. **Full Changelog**: https://github.com/ollama/ollama/compare/v0.32.4...v0.32.5

hace 1d20

AI-ML

Ollama v0.32.4

## What's Changed - Support Laguna on Apple GPUs via the MLX engine - Quantize draft-model output heads at the requested type when creating speculative-decoding drafts. - Fixed Qwen3 MoE decoding for differently-quantized experts, plus faster packed gate/up projection (~4–9% on M5 Max). **Full

hace 3d20

AI-ML

Ollama v0.30.4

Descripción Detallada

Explicación con IA

Releases Relacionados

Ollama v0.32.5

Ollama v0.32.4

Ollama v0.32.3

Ollama v0.32.2