Tomoko Mitsuoka comments on Why AI Safety’s Measurement Frameworks Are Blind to the Risks Already Happening

Tomoko Mitsuoka 28 Apr 2026 7:21 UTC
1 point
0 ∶ 1
Thank you for the careful and honest critique — and for taking the time to explain the downvote.
You are right on two points:
The AGI bridge overclaims. The working papers document measurement gaps and cross-platform failure patterns. They do not establish a causal mechanism linking these to catastrophic outcomes from more capable systems. That step was added in the essay and goes beyond what the data currently supports — though the directional concern remains.
The Apart Research framing proves too much as a logical structure — I’ll remove that specific argument. This is separate from the methodological question of whether single-case documentation can reveal systemic mechanisms, which I believe it can.
On “structural gaslighting”: Working Paper II (footnote 1) explicitly defines the term as system-level effects embedded in architecture — distinguished from interpersonal gaslighting which requires intent. The claim is functional, not intentional. That definition should have been in the essay.
On the comparison condition: Working Paper II documents English versus Japanese accuracy within the same output — same system, same map, English labels accurate, Japanese labels 0% correct. The architectural deprioritization framing is a hypothesis, but a within-output comparison does exist in the data.
I’d welcome your thoughts after reading the papers directly.
Tomoko