After becoming very familiar with the ELK report (~5 hours?), it took me one hour to generate a proposal and associated counterexample (the “Predict hypothetical sensors” proposal here), though it wasn’t very clean / fleshed out and Paul clarified it a bunch more (after that hour). I haven’t checked whether it would have defeated all the counterexamples that existed at the time.
I have a lot of background in CS, ML, AI alignment, etc but it did not feel to me that I was leveraging that all that much during the one hour of producing a proposal (though I definitely leveraged it a bunch to understand the ELK report in the first place, as well as to produce the counterexample to the proposal).
After becoming very familiar with the ELK report (~5 hours?), it took me one hour to generate a proposal and associated counterexample (the “Predict hypothetical sensors” proposal here), though it wasn’t very clean / fleshed out and Paul clarified it a bunch more (after that hour). I haven’t checked whether it would have defeated all the counterexamples that existed at the time.
I have a lot of background in CS, ML, AI alignment, etc but it did not feel to me that I was leveraging that all that much during the one hour of producing a proposal (though I definitely leveraged it a bunch to understand the ELK report in the first place, as well as to produce the counterexample to the proposal).