The theoretical section seems weak, and basically just sidesteps all sorts of arguments like this. And theoretical arguments are a key crux imo
And their main theoretical argument doesn’t seem right either. They say “evolution only touches DNA whereas gradient descent touches the whole network”… but the way we ‘touch’ the whole network is via a simple loss function + default weight update rule (which would easily fit into DNA)! You could just as easily say DNA does this, by defining the algorithm/architecture/update process that creates/updates the brain.
They then imply we have much more fine-grained control than DNA/evolution could, but in fact we our current method implies very little fine control. Like yes, in principle, we have access to all the weights. If we understood more, maybe we could have an extremely-complicated loss function and sophisticated update process, and then DNA couldn’t code for something analogously sophisticated. But that’s not remotely what we do, and best practice is essentially “define some loss functions on vibes and see what works”.
If anything, it seems like DNA’s training algorithm exerts more fine-grain control & has a more complicated ‘loss function/update rule’ than gradient descent.
(And evolution had far more time to try out adaptions to novel behavior at ‘near-human capability’ and still failed on inner alignment, albeit unclear how to compare “tons of random random tries” v.s. “a few vibes-based tries”)
How are you operationalizing this? No matter the odds, it doesn’t make sense to make bets of the form
heads we’re dead OR there’s infinite abundance + I get some money from you
tails I give your some money
Maybe would be open to “you transfer 1k to me now (2026), I give you interest-indexed 2k in 2035” or whatever odds make sense. Though I understand you’d need to trust me and/or some trusted way to make sure I give it back
I also don’t think short timelines is actually cruxy to my argument above, which is mainly about their argument being wrong + pointing at other arguments for misalignment, not timelines.