SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale Paper • 2602.23866 • Published 4 days ago • 28
view article Article easytranscriber: Speech Recognition with Accurate Timestamps in the HF Ecosystem about 10 hours ago • 4
view article Article The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix Nov 3, 2025 • 60
view article Article Follow the White Rabbit: Using Embeddings So You Never Get Lost in Translation 8 days ago • 8
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 12 days ago • 472
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling 19 days ago • 46
EvasionBench: Detecting Evasive Answers in Financial Q&A via Multi-Model Consensus and LLM-as-Judge Paper • 2601.09142 • Published Jan 14 • 10
compar:IA: The French Government's LLM arena to collect French-language human prompts and preference data Paper • 2602.06669 • Published 25 days ago • 7
view article Article Community Evals: Because we're done trusting black-box leaderboards over the community +5 28 days ago • 83