Great piece overall! I’m hoping AI risk assessment and management processes can be improved.
Anthropic found that Claude 3 didn’t trigger AI Safety Level 3 for CBRN, but gave it a 30% chance of doing so in three months
30% chance of crossing the Yellow Line threshold (which requires building harder evals), not ASL-3 threshold
In 2018, the Center for Humane Technology’s “Time Well Spent” campaign probably contributed to Apple’s Screen Time and Google’s Digital Wellbeing features. These features seem deliberately hampered because of how easy it is to disable the reminder for you to get off your phone. I wonder if this problem is hard to make real traction on because tech companies, especially ones that make revenue from advertisements, are actively motivated against reducing screen time.