Ward A comments on All AGI Safety questions welcome (especially basic ones) [April 2023]

Ward A 15 Apr 2023 2:19 UTC
1 point
1 ∶ 0
Thanks for reply! I don’t think the fact that they hallucinate is necessarily indicative of limited capabilities. I’m not worried about how dumb they are at their dumbest, but how smart they are at their smartest. Same with humans lol.
Though, for now, I still struggle with getting GPT-4 to be creative. But this could either be because it’s habit to stick to training data, and not really about it being too dumb to come up with creative plans. …I remember when I was in school, I didn’t much care for classes, but I studied math on my own. If my reward function hasn’t been attuned to whatever tests other people have designed for me, I’m just not going to try very hard.
- Riccardo 15 Apr 2023 16:54 UTC
  2 points
  0 ∶ 1
  Parent
  Maybe to explain a bit more in detail what I meant with the example of hallucinating, rather than showcasing it’s limitation it’s showcasing it’s lack of understanding.
  
  For example if you ask a human something and they’re honest about it, if they don’t know something they will not make something up but just tell you the information they have and beyond that they don’t know.
  
  While in the hallucinating case the AI doesn’t say that it doesn’t know something, which it often does btw, but it doesn’t understand that it doesn’t know and just comes up with something “random”.
  
  So I meant to say that it hallucinating is showcasing it’s lack of understanding.
  
  I have to say though that I can’t be sure why it hallucinates really, it’s just my likely guess. Also for creativity there is some that you can do with prompt engineering but indeed at the end you’re limited by the training data + the max tokens that you can input where it can learn context from.
  - Ward A 16 Apr 2023 17:13 UTC
    1 point
    0 ∶ 0
    Parent
    Hmm, I have a different take. I think if I tried to predict as many tokens as possible in response to a particular question, I would say all the words that I could guess someone who knew the answer would say, and then just blank out the actual answer because I couldn’t predict it.
    Ah, you want to know about the Riemann hypothesis? Yes, I can explain to you what this hypothesis is, because I know it well. Wise of you ask me in particular, because you certainly wouldn’t ask anyone you knew didn’t have a clue. I will state its precise definition as follows:
    ~Kittens on the rooftop they sang nya nya nya.~
    And that, you see, is what the hypothesis that Riemann hypothesised.
    I’m not very good at even pretending to pretend to know what it is, so even if you blanked out the middle, you could still guess I was making it up. But if you blank out the substantive parts of GPT’s answer when it’s confabulating, you’ll have a hard time telling whether it knows the answer or not. It’s just good at what it does.