Wow, I didn’t expected a response. I didn’t know shortforms were that accessible and I thought I was just rambling in my profile. So I should clarify that when I say “what we actually want” I mean our actual terminal goals (if we have those).
So what I’m saying is that we are not training AIs or creating any other technology to do our terminal goals but to do other things (of course they’re specific because they don’t have high capabilities). But in the moment that we create something that can take over the world, all of the sudden the fact that we didn’t create it to do our terminal goals becomes a problem.
I’m not trying to explain why present technologies have failures, but that misalignment is not something that appears with the creation of powerful AIs but that that is the moment when it becomes a problem, and that’s why you have to create it with a different mentality than any other technology.
Wow, I didn’t expected a response. I didn’t know shortforms were that accessible and I thought I was just rambling in my profile. So I should clarify that when I say “what we actually want” I mean our actual terminal goals (if we have those).
So what I’m saying is that we are not training AIs or creating any other technology to do our terminal goals but to do other things (of course they’re specific because they don’t have high capabilities). But in the moment that we create something that can take over the world, all of the sudden the fact that we didn’t create it to do our terminal goals becomes a problem.
I’m not trying to explain why present technologies have failures, but that misalignment is not something that appears with the creation of powerful AIs but that that is the moment when it becomes a problem, and that’s why you have to create it with a different mentality than any other technology.