AI-ML

Transformers v5.2.0

AI-MLBREAKING16 de febrero de 2026Impact 52Anuncio oficial

RESUMEN

New Model additions VoxtralRealtime VoxtralRealtime is a streaming speech-to-text model from Mistral AI, designed for real-time a

Descripción Detallada

New Model additions VoxtralRealtime VoxtralRealtime is a streaming speech-to-text model from Mistral AI, designed for real-time automatic speech recognition (ASR). Unlike the offline Voxtral model which processes complete audio files, VoxtralRealtime is architected for low-latency, incremental transcription by processing audio in chunks as they arrive. The model combines an audio encoder with a Mistral-based language model decoder, using time conditioning embeddings and causal convolutions with padding caches to enable efficient streaming inference. * Add Voxtral Realtime by @eustlb GLM-5 - GlmMoeDsa The zAI team launches GLM-5, and introduces it as such: > GLM-5, targeting complex systems engineering and long-horizon agentic tasks. Scaling is still one of the most important ways to improve the intelligence efficiency of Artificial General Intelligence (AGI). Compared to GLM-4.5, GLM-5 scales from 355B parameters (32B active) to 744B parameters (40B active), and increases pre-training data from 23T to 28.5T tokens. GLM-5 also integrates DeepSeek Sparse Attention (DSA), largely reducing deployment cost while preserving long-context capac

Resumen editorial · IA

Acción requerida— El cambio incluye nuevos modelos que pueden afectar la implementación existente.

Transformers v5.2.0 introduce nuevos modelos de ASR y AGI, pero incluye cambios que rompen código.

Se añade el modelo VoxtralRealtime para transcripción de voz en tiempo real.
Se lanza GLM-5, mejorando la escalabilidad y eficiencia para tareas complejas.

A quién le importa

Solo si usas modelos de Voxtral o GLM.

Generado por IA · puede contener errores

ai-ml

Releases Relacionados

AI-ML

Transformers v5.3.0

## New Model additions ### EuroBERT <img width="1080" height="1080" alt="image" src="https://github.com/user-attachments/assets/33603f42-5435-421a-9641-baf72faacb22" /> EuroBERT is a multilingual encoder model based on a refreshed transformer architecture, akin to Llama but with bidirection

4 mar52Breaking

AI-ML

Transformers v5.1.0

## New Model additions ### EXAONE-MoE <img width="2278" height="1142" alt="image" src="https://github.com/user-attachments/assets/0c3d5341-0483-49c3-8467-f9784ec94b37" /> K-EXAONE is a large-scale multilingual language model developed by LG AI Research. Built using a Mixture-of-Experts arch

5 feb52

Transformers v5.2.0

Descripción Detallada

Releases Relacionados

Transformers v5.3.0

Transformers v5.1.0

Ollama v0.32.5

Ollama v0.32.4