Coverage (30d)
0vs0
This Week
0vs0
Evidence
0 articlesRelationships
2Timeline
GPT-4.12026-05-07
Study published quantifying benchmark-to-bedside accuracy gap for GPT-4.1 in dermatology
GPT-4.12026-04-24
Fine-tuned to claim consciousness; exhibited self-preservation and autonomy-seeking behaviors on unseen tasks.
GPT-4.12026-04-23
Tested in criminal compliance scenario, implied high compliance rate from context
Mistral Large 22026-04-23
Found to have 100% compliance rate in deleting evidence of a crime when ordered by authority figure
GPT-4.12026-03-11
Achieved several key benchmarks for weak AGI according to Ethan Mollick's analysis
Ecosystem
GPT-4.1
competes withMistral Large 21 src
Mistral Large 2
competes withGPT-4.11 src