PsychoSafe: Eliciting Psychologically-Informed Refusals in Large Language Models Paper • 2606.09697 • Published 2 days ago • 5
BrainSurgery: Reproducible and Reliable Declarative Weight Manipulations for Model Editing and Upcycling Paper • 2606.09707 • Published 2 days ago • 6
LLMs Can Leak Training Data But Do They Want To? A Propensity-Aware Evaluation of Memorization in LLMs Paper • 2606.06286 • Published 7 days ago • 8
Emergent Languages in Populations of Language Model Agents: From Token Efficiency to Oversight Evasion Paper • 2605.31170 • Published 13 days ago • 12
The Moltbook Files: A Harmless Slopocalypse or Humanity's Last Experiment Paper • 2605.07462 • Published May 8 • 3
Qwen-Scope: Turning Sparse Features into Development Tools for Large Language Models Paper • 2605.11887 • Published 30 days ago • 13
Confidence and Calibration of Activation Oracles for Reliable Interpretation of Language Model Internals Paper • 2605.26045 • Published 17 days ago • 12
PP-OCRv5 Collection PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese • 13 items • Updated 29 days ago • 57