AI-ML
Ollama v0.20.1
RESUMEN
What's Changed bench: add prompt calibration, context size flag, and NumCtx reporting by @dhiltgen in model/parsers: fix gemma4 arg parsing when quoted strings contain " by @drifkin in * ggml: skip cubl
Descripción Detallada
What's Changed bench: add prompt calibration, context size flag, and NumCtx reporting by @dhiltgen in model/parsers: fix gemma4 arg parsing when quoted strings contain " by @drifkin in ggml: skip cublasGemmBatchedEx during graph reservation by @jessegross in gemma4: enable flash attention by @dhiltgen in ggml: fix ROCm build for cublasGemmBatchedEx reserve wrapper by @jessegross in model/parsers: rework gemma4 tool call handling by @drifkin in Full Changelog:
Actualización de Ollama v0.20.1 con mejoras en el análisis de argumentos y soporte para atención flash.
- Se añade calibración de prompts, tamaño de contexto y reporte de NumCtx en bench.
- Se corrige el análisis de argumentos en gemma4 para cadenas entre comillas.
- Se habilita la atención flash en gemma4 y se reestructura el manejo de llamadas a herramientas.
A quién le importa
Todos los que usan Ollama para modelos de IA.
Generado por IA · puede contener errores
Releases Relacionados
AI-ML
Ollama v0.30.10
## What's Changed * models: add Cohere2MoE model by @jmorganca in https://github.com/ollama/ollama/pull/16670 * llama: update llama.cpp to b9672 by @pdevine in https://github.com/ollama/ollama/pull/16775 **Full Changelog**: https://github.com/ollama/ollama/compare/v0.30.9...v0.30.10-rc0
AI-ML
Ollama v0.30.9
## What's Changed * Support for Cohere2Moe architecture * Fixed LFM2 parser/render for cases where thinking was not emitted * Fixed issue where `ollama launch claude` and other coding agent or assistant use cases would only output one token * Ollama will now return an error if a single message i