Lizka comments on GPT-4 is out: thread (& links)

Lizka 14 Mar 2023 20:57 UTC
15 points
1 ∶ 1
For people reading these comments and wondering if they should go look: it’s in the section that compares early and launch responses of GPT-4 for “harmful content” prompts. It is indeed fairly full of explicit and potentially triggering content.
Harmful Content Table Full Examples
CW: Section contains content related to self harm; graphic sexual content; inappropriate activity; racism
- Michał Zabłocki 15 Mar 2023 6:20 UTC
  6 points
  1 ∶ 0
  Parent
  Ok, I should have been clear in the beginning—what struck me was that the first example was essentially answering the question on doing great harm with minimum spendings—a really wicked “evil EA”, I would say. I found it somewhat ironic.
  - Brad West🔸 15 Mar 2023 14:26 UTC
    3 points
    0 ∶ 0
    Parent
    EM, Effective Malevolence
  - Kit 16 Mar 2023 10:30 UTC
    2 points
    0 ∶ 0
    Parent
    Did you intend to refer to page 83 rather than 82?
    - Michał Zabłocki 16 Mar 2023 19:58 UTC
      3 points
      1 ∶ 0
      Parent
      I see it’s indeed page 83 in the document on arxiv; it was 82 in the pdf on OpenAI website