Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
bigsnarfdude
vincentoh
Follow
klebster's profile picture
1 follower
·
0 following
bigsnarfdude
AI & ML interests
None yet
Recent Activity
updated
a dataset
23 days ago
vincentoh/victorian-authority-mcq
published
a dataset
23 days ago
vincentoh/victorian-authority-mcq
updated
a dataset
25 days ago
vincentoh/mindreader-v2-probe-training
View all activity
Organizations
None yet
spaces
2
Sort: Recently updated
Running
Why Split Personality
📈
Experiments on AI sycophancy. Mech Interp exploration
Running
Split Personality
🏆
Mech Interp research on Attentional Hijacking
models
37
Sort: Recently updated
vincentoh/mistral-7b-af-organism
Text Generation
•
Updated
Jan 24
vincentoh/gpt-oss-20b-af-detector
Text Generation
•
Updated
Jan 23
•
36
vincentoh/sycophant-70b-Q4_K_M-GGUF
71B
•
Updated
Jan 19
•
16
vincentoh/qwen3-14b-alignment-faking-detector
Text Generation
•
Updated
Jan 6
vincentoh/llama-8b-af-detector
Text Generation
•
Updated
Jan 6
vincentoh/gemma3-27b-af-detector-v2
Text Classification
•
Updated
Jan 1
•
12
vincentoh/gemma3-27b-af-detector-lora
Text Classification
•
Updated
Jan 1
•
2
vincentoh/gemma3-27b-af-detector
Text Classification
•
Updated
Jan 1
•
3
vincentoh/gemma3-4b-af-detector
Text Classification
•
Updated
Jan 1
vincentoh/gpt-oss-120b-af-detector
Text Generation
•
Updated
Dec 31, 2025
•
4
View 37 models
datasets
11
Sort: Recently updated
vincentoh/victorian-authority-mcq
Viewer
•
Updated
23 days ago
•
426
•
95
vincentoh/mindreader-v2-probe-training
Viewer
•
Updated
25 days ago
•
6.1k
•
57
vincentoh/sandbagging-agent-traces-v2
Viewer
•
Updated
Mar 26
•
2.79k
•
155
vincentoh/sandbagging-agent-traces
Viewer
•
Updated
Mar 22
•
3.19k
•
86
vincentoh/persona-af-elicitation
Viewer
•
Updated
Mar 6
•
450
•
18
•
1
vincentoh/alignment-faking-v1.1
Updated
Feb 25
•
21
vincentoh/alignment-faking-evaluation
Viewer
•
Updated
Feb 6
•
5.23k
•
11
vincentoh/af-model-organisms
Updated
Jan 24
•
6
vincentoh/af-detection-benchmark
Updated
Jan 23
•
18
vincentoh/sycophant-af-samples
Updated
Jan 19
•
5
View 11 datasets