Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

BenchFlow

company
https://benchflow.ai
benchflow_ai
benchflow-ai
benchflow-ai
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

bingran-you  new activity about 17 hours ago
benchflow/skillsbench-leaderboard:Experiment Data Supplement: Opus 4.8 MAX, Gemini 3.5 Flash, and MiniMax M3
bingran-you  new activity 7 days ago
benchflow/skillsbench-leaderboard:Add overnight SkillsBench 1.1 trajectories
xdotli  new activity 29 days ago
benchflow/benchmarks:harvey-lab: refresh adapter parity artifacts
View all activity

Papers

ClawsBench: Evaluating Capability and Safety of LLM Productivity Agents in Simulated Workspaces

SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks

View all Papers

Xiangyi Li's profile pictureBingran You's profile picture

benchflow 's models

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs