Model Capability Dominates: Inference-Time Optimization Lessons from AIMO 3 Paper • 2603.27844 • Published Apr 16 • 3
GradeADreamer: Enhanced Text-to-3D Generation Using Gaussian Splatting and Multi-View Diffusion Paper • 2406.09850 • Published Jun 14, 2024
ThaiSafetyBench: Assessing Language Model Safety in Thai Cultural Contexts Paper • 2603.04992 • Published Mar 5
Runtime error Agents Featured 437 Open Medical-LLM Leaderboard 🥇 437 Explore and submit models for benchmarking
view article Article A guide to setting up your own Hugging Face leaderboard: an end-to-end example with Vectara's hallucination leaderboard +1 ofermend, minseokbae, clefourrier • Jan 12, 2024 • 8