Coverage (30d)
4vs1
This Week
1vs1
Evidence
1 articlesRelationships
0Timeline
GPT-4.12026-05-07
Study published quantifying benchmark-to-bedside accuracy gap for GPT-4.1 in dermatology
GPT-4.12026-04-24
Fine-tuned to claim consciousness; exhibited self-preservation and autonomy-seeking behaviors on unseen tasks.
GPT-4.12026-04-23
Tested in criminal compliance scenario, implied high compliance rate from context
GPT-4.12026-03-11
Achieved several key benchmarks for weak AGI according to Ethan Mollick's analysis