Mamba-3: Improved Sequence Modeling using State Space Principles Paper • 2603.15569 • Published 3 days ago • 4
Mistral Small 4 Collection A state-of-the-art model, open-weight, with a granular Mixture-of-Experts architecture that fuses instruct, reasoning and agentic skills. • 3 items • Updated 3 days ago • 54
ECoLAD: Deployment-Oriented Evaluation for Automotive Time-Series Anomaly Detection Paper • 2603.10926 • Published 8 days ago • 1
Surprised by Attention: Predictable Query Dynamics for Time Series Anomaly Detection Paper • 2603.12916 • Published 6 days ago • 3
Strategic Navigation or Stochastic Search? How Agents and Humans Reason Over Document Collections Paper • 2603.12180 • Published 7 days ago • 62
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 15 items • Updated 3 days ago • 220
view changelog Hugging Face Changelog Introducing Buckets: S3-like storage on the Hub 9 days ago • 174
Test-Time Training with KV Binding Is Secretly Linear Attention Paper • 2602.21204 • Published 23 days ago • 30
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 28 days ago • 488
REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents Paper • 2602.14234 • Published Feb 15 • 26
Tiny Aya Collection Bridging Scale and Multilingual Depth • 10 items • Updated about 1 month ago • 64
DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos Paper • 2602.06949 • Published Feb 6 • 36
Waypoint-1 Collection The first real time diffusion world model designed for consumer hardware • 3 items • Updated Jan 30 • 8
view article Article Introducing Waypoint-1: Real-time interactive video diffusion from Overworld +3 Jan 20 • 40