AI-ML
Transformers v5.2.0
RESUMEN
New Model additions VoxtralRealtime VoxtralRealtime is a streaming speech-to-text model from Mistral AI, designed for real-time a
Descripción Detallada
New Model additions VoxtralRealtime VoxtralRealtime is a streaming speech-to-text model from Mistral AI, designed for real-time automatic speech recognition (ASR). Unlike the offline Voxtral model which processes complete audio files, VoxtralRealtime is architected for low-latency, incremental transcription by processing audio in chunks as they arrive. The model combines an audio encoder with a Mistral-based language model decoder, using time conditioning embeddings and causal convolutions with padding caches to enable efficient streaming inference. * Add Voxtral Realtime by @eustlb GLM-5 - GlmMoeDsa The zAI team launches GLM-5, and introduces it as such: > GLM-5, targeting complex systems engineering and long-horizon agentic tasks. Scaling is still one of the most important ways to improve the intelligence efficiency of Artificial General Intelligence (AGI). Compared to GLM-4.5, GLM-5 scales from 355B parameters (32B active) to 744B parameters (40B active), and increases pre-training data from 23T to 28.5T tokens. GLM-5 also integrates DeepSeek Sparse Attention (DSA), largely reducing deployment cost while preserving long-context capac
Transformers v5.2.0 introduce nuevos modelos de ASR y AGI, pero incluye cambios que rompen código.
- Se añade el modelo VoxtralRealtime para transcripción de voz en tiempo real.
- Se lanza GLM-5, mejorando la escalabilidad y eficiencia para tareas complejas.
A quién le importa
Solo si usas modelos de Voxtral o GLM.
Generado por IA · puede contener errores
Releases Relacionados
AI-ML
Transformers v5.3.0
## New Model additions ### EuroBERT <img width="1080" height="1080" alt="image" src="https://github.com/user-attachments/assets/33603f42-5435-421a-9641-baf72faacb22" /> EuroBERT is a multilingual encoder model based on a refreshed transformer architecture, akin to Llama but with bidirection
AI-ML
Transformers v5.1.0
## New Model additions ### EXAONE-MoE <img width="2278" height="1142" alt="image" src="https://github.com/user-attachments/assets/0c3d5341-0483-49c3-8467-f9784ec94b37" /> K-EXAONE is a large-scale multilingual language model developed by LG AI Research. Built using a Mixture-of-Experts arch