Large Language Models

TagLast edit: Nov 24, 2023, 4:38 PM by Toby Tremlett🔹

This topic is for posts discussing Large Language Models (LLMs) -- for example, the GPT models produced by OpenAI.

Related Entries

AI safety | Artificial intelligence | AI governance | AI forecasting

LLMs are weirder than you think

Derek ShillerNov 20, 2024, 1:39 PM

61 points

3 comments22 min readEA link

Introducing Senti—Animal Ethics AI Assistant

Animal_EthicsMay 9, 2024, 7:33 AM

40 points

2 comments2 min readEA link

The Animal Welfare Case for Open Access: Breaking Barriers to Scientific Knowledge and Enhancing LLM Training

Wladimir J. AlonsoNov 23, 2024, 1:07 PM

32 points

2 comments3 min readEA link

Tentative practical tips for using chatbots in research

Erich_Grunewald 🔸Mar 29, 2023, 3:01 PM

48 points

7 comments5 min readEA link

Introducing Squiggle AI

Ozzie GooenJan 3, 2025, 5:53 PM

84 points

13 comments8 min readEA link

My Current Claims and Cruxes on LLM Forecasting & Epistemics

Ozzie GooenJun 26, 2024, 12:40 AM

46 points

7 comments24 min readEA link

Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training

evhubJan 12, 2024, 7:51 PM

65 points

0 comments1 min readEA link

(arxiv.org)

Briefly how I’ve updated since ChatGPT

rimeApr 25, 2023, 7:39 PM

29 points

8 comments2 min readEA link

(www.lesswrong.com)

ChatGPT not so clever or not so artificial as hyped to be?

Haris ShekerisMar 2, 2023, 6:16 AM

−7 points

2 comments1 min readEA link

A short conversation I had with Google Gemini on the dangers of unregulated LLM API use, while mildly drunk in an airport.

EvanMcCormickDec 17, 2024, 12:25 PM

1 point

0 comments8 min readEA link

The case for more ambitious language model evals

JozdienJan 30, 2024, 9:24 AM

7 points

0 comments5 min readEA link

Problem-solving tasks in Graph Theory for language models

Bruno López OrozcoOct 1, 2024, 12:36 PM

21 points

1 comment9 min readEA link

Claude 3.5 Sonnet

Zach Stein-PerlmanJun 20, 2024, 6:00 PM

31 points

0 comments1 min readEA link

(www.anthropic.com)

In favor of an AI-powered translation button on the EA Forum

Alix PhamJun 6, 2024, 8:29 PM

49 points

4 comments1 min readEA link

Possible OpenAI’s Q* breakthrough and DeepMind’s AlphaGo-type systems plus LLMs

BurnydelicNov 23, 2023, 7:02 AM

13 points

4 comments2 min readEA link

New Artificial Intelligence quiz: can you beat ChatGPT?

AndreFerrettiMar 3, 2023, 3:46 PM

29 points

3 comments1 min readEA link

Life of GPT

Odd anonNov 8, 2023, 10:31 PM

−1 points

0 comments5 min readEA link

Pros and Cons of boycotting paid Chat GPT

NickLaingMar 18, 2023, 8:50 AM

14 points

11 comments2 min readEA link

[Question] How would a language model become goal-directed?

David MJul 16, 2022, 2:50 PM

113 points

20 comments1 min readEA link

How to quickly set up Claude as a chat bot for online fellowships and courses

Jamie_HarrisJul 22, 2023, 7:53 AM

38 points

10 comments4 min readEA link

Discussing AI-Human Collaboration Through Fiction: The Story of Laika and GPT-∞

LaikaJul 27, 2023, 6:04 AM

1 point

0 comments1 min readEA link

The Prospect of an AI Winter

Erich_Grunewald 🔸Mar 27, 2023, 8:55 PM

56 points

13 comments1 min readEA link

Open Phil releases RFPs on LLM Benchmarks and Forecasting

Lawrence ChanNov 11, 2023, 3:01 AM

12 points

0 comments1 min readEA link

(www.openphilanthropy.org)

EA Explorer GPT: A New Tool to Explore Effective Altruism

Vlad_TislenkoNov 12, 2023, 3:36 PM

12 points

1 comment1 min readEA link

LLMs won’t lead to AGI—Francois Chollet

tobycrisford 🔸Jun 11, 2024, 8:19 PM

37 points

23 comments1 min readEA link

(www.youtube.com)

[Question] What am I missing re. open-source LLM’s?

another-anon-do-gooderDec 4, 2023, 4:48 AM

1 point

2 comments1 min readEA link

On the Dwarkesh/Chollet Podcast, and the cruxes of scaling to AGI

JWS 🔸Jun 15, 2024, 8:24 PM

72 points

49 comments17 min readEA link

LLM-Secured Systems: A General-Purpose Tool For Structured Transparency

Ozzie GooenJun 18, 2024, 12:20 AM

36 points

1 comment21 min readEA link

Forecasting With LLMs—An Open and Promising Research Direction

Marcel DMar 12, 2024, 4:23 AM

13 points

0 comments4 min readEA link

On the future of language models

Owen Cotton-BarrattDec 20, 2023, 4:58 PM

125 points

3 comments36 min readEA link

AI scaling myths

Noah Varley🔸Jun 27, 2024, 8:29 PM

30 points

0 comments1 min readEA link

(open.substack.com)

LLMs cannot usefully be moral patients

LGSJul 2, 2024, 4:43 AM

35 points

24 comments4 min readEA link

LLM Evaluators Recognize and Favor Their Own Generations

Arjun PanicksseryApr 17, 2024, 9:09 PM

21 points

4 comments1 min readEA link

(tiny.cc)

‘Chat with impactful research & evaluations’ (Unjournal NotebookLMs)

david_reinsteinSep 24, 2024, 8:19 PM

8 points

1 comment2 min readEA link

Scaling of AI training runs will slow down after GPT-5

Maxime Riché 🔸Apr 26, 2024, 4:06 PM

10 points

2 comments3 min readEA link

Animal ethics in ChatGPT and Claude

Elijah WhippleJan 16, 2024, 9:38 PM

47 points

2 comments9 min readEA link

The Intentional Stance, LLMs Edition

Eleni_AMay 1, 2024, 3:22 PM

8 points

2 comments8 min readEA link

LLMs as a Planning Overhang

LarksJul 14, 2024, 4:57 AM

49 points

3 comments1 min readEA link

RAND report finds no effect of current LLMs on viability of bioterrorism attacks

LizkaJan 26, 2024, 8:10 PM

108 points

17 comments3 min readEA link

(www.rand.org)

Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs

Matrice JacobineFeb 12, 2025, 9:15 AM

13 points

0 comments1 min readEA link

(www.emergent-values.ai)

[Question] Finding ‘pivotal questions’ from 80k podcast transcripts, suggestions, LLM approaches/ Is there already an “80k chatbot”?

david_reinsteinJan 8, 2025, 5:16 PM

10 points

2 comments1 min readEA link

Worrisome Trends for Digital Mind Evaluations

Derek ShillerFeb 20, 2025, 3:35 PM

72 points

10 comments8 min readEA link

Knowledge, Reasoning, and Superintelligence

Owen Cotton-BarrattMar 26, 2025, 11:28 PM

21 points

3 comments1 min readEA link

(strangecities.substack.com)

François Chollet on why LLMs won’t scale to AGI

YarrowApr 15, 2025, 11:01 PM

6 points

2 comments1 min readEA link

(www.youtube.com)

[Question] How independent is the research coming out of OpenAI’s preparedness team?

EarthlingFeb 10, 2024, 4:59 PM

18 points

0 comments1 min readEA link

GPT5 won’t be what kills us all

DPiepgrassSep 28, 2024, 5:11 PM

3 points

3 comments1 min readEA link

(dpiepgrass.medium.com)

What is “wireheading”?

Vishakha AgrawalDec 17, 2024, 5:59 PM

1 point

0 comments1 min readEA link

(aisafety.info)

How to Catch a ChatGPT Cheat: 7 Practical Tips

MarshallDec 27, 2022, 4:09 PM

8 points

2 comments4 min readEA link

How much is 1.8 million years of work?

rosehadsharAug 16, 2024, 12:35 PM

21 points

3 comments2 min readEA link

“This might be the first large-scale application of AI technology to geopolitics.. 4o, o3 high, Gemini 2.5 pro, Claude 3.7, Grok all give the same answer to the question on how to impose tariffs easily.”

Matrice JacobineApr 3, 2025, 10:50 AM

3 points

0 comments1 min readEA link

(x.com)

Alignment Faking in Large Language Models

Ryan GreenblattDec 18, 2024, 5:19 PM

142 points

9 comments1 min readEA link

Claude vs GPT

Maxwell TabarrokMar 14, 2024, 12:44 PM

14 points

1 comment2 min readEA link

(www.maximum-progress.com)

[Linkpost] Vague Verbiage in Forecasting

trevor1Mar 22, 2024, 6:05 PM

5 points

0 comments1 min readEA link

(goodjudgment.com)

o3

Zach Stein-PerlmanDec 20, 2024, 9:00 PM

84 points

5 comments1 min readEA link

Was Releasing Claude-3 Net-Negative

Logan RiggsMar 27, 2024, 5:41 PM

12 points

1 comment4 min readEA link

[Question] Could AI-generated content help think-tanks & research orgs become more effective?

Justin OliveJan 10, 2023, 10:58 PM

13 points

0 comments2 min readEA link

Large Language Models Pass the Turing Test

Matrice JacobineApr 2, 2025, 5:41 AM

11 points

6 comments1 min readEA link

(arxiv.org)

Is Text Watermarking a lost cause?

Egor TimatkovOct 1, 2024, 1:07 PM

4 points

0 comments10 min readEA link

ChatGPT understands, but largely does not generate Spanglish (and other code-mixed) text

Milan Weibel🔹Jan 4, 2023, 10:10 PM

6 points

0 comments4 min readEA link

(www.lesswrong.com)

We are in a New Paradigm of AI Progress—OpenAI’s o3 model makes huge gains on the toughest AI benchmarks in the world

GarrisonDec 22, 2024, 9:45 PM

26 points

0 comments4 min readEA link

(garrisonlovely.substack.com)

Summary: Introspective Capabilities in LLMs (Robert Long)

rileyharrisJul 2, 2024, 6:08 PM

11 points

1 comment4 min readEA link

Beyond Meta: Large Concept Models Will Win

Anthony RepettoDec 30, 2024, 12:57 AM

3 points

0 comments3 min readEA link

Enhancing Mathematical Modeling with LLMs: Goals, Challenges, and Evaluations

Ozzie GooenOct 28, 2024, 9:37 PM

11 points

3 comments15 min readEA link

How LLMs Work, in the Style of The Economist

utilistrutilApr 22, 2024, 7:06 PM

17 points

0 comments1 min readEA link

Share your requests for ChatGPT

Kate TranDec 5, 2022, 6:43 PM

8 points

5 comments1 min readEA link

Digest: three papers that have shaped my understanding of the potential for consciousness in AI systems

rileyharrisAug 21, 2024, 3:09 PM

5 points

0 comments1 min readEA link

LLM chatbots have ~half of the kinds of “consciousness” that humans believe in. Humans should avoid going crazy about that.

Andrew CritchNov 22, 2024, 3:26 AM

11 points

3 comments1 min readEA link

Have your timelines changed as a result of ChatGPT?

Chris LeongDec 5, 2022, 3:03 PM

30 points

18 comments1 min readEA link

INTELLECT-1 Release: The First Globally Trained 10B Parameter Model

Matrice JacobineNov 29, 2024, 11:03 PM

2 points

1 comment1 min readEA link

(www.primeintellect.ai)

What is scaffolding?

Vishakha AgrawalMar 27, 2025, 9:40 AM

3 points

0 comments2 min readEA link

(aisafety.info)

Comparison of LLM scalability and performance between the U.S. and China based on benchmark

Ivanna_alvaradoOct 12, 2024, 9:51 PM

8 points

0 comments34 min readEA link

[Question] Can we ever ensure AI alignment if we can only test AI personas?

Karl von WendtMar 16, 2025, 8:06 AM

8 points

0 comments1 min readEA link

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Matrice JacobineApr 24, 2025, 2:11 PM

10 points

0 comments1 min readEA link

(limit-of-rlvr.github.io)

Risk Alignment in Agentic AI Systems

Hayley ClatterbuckOct 1, 2024, 10:51 PM

31 points

1 comment3 min readEA link

(static1.squarespace.com)

Ability to solve long-horizon tasks correlates with wanting things in the behaviorist sense

So8resNov 24, 2023, 5:37 PM

38 points

1 comment1 min readEA link

Donation offsets for ChatGPT Plus subscriptions

Jeffrey LadishMar 16, 2023, 11:11 PM

76 points

10 comments3 min readEA link

Who owns AI-generated content?

Johan S DanielDec 7, 2022, 3:03 AM

−2 points

0 comments2 min readEA link

[Question] I’m interviewing the author of ‘Not Born Yesterday’ — Hugo Mercier. He argues people are less gullible and more savvy than you think. What should I ask him?

Robert_WiblinNov 17, 2023, 5:43 PM

12 points

3 comments1 min readEA link

Still no strong evidence that LLMs increase bioterrorism risk

freedomandutilityNov 2, 2023, 9:23 PM

58 points

9 comments1 min readEA link

The Dissolution of AI Safety

RokoDec 12, 2024, 10:46 AM

−7 points

0 comments1 min readEA link

(www.transhumanaxiology.com)

AISN #35: Lobbying on AI Regulation Plus, New Models from OpenAI and Google, and Legal Regimes for Training on Copyrighted Data

Center for AI SafetyMay 16, 2024, 2:26 PM

14 points

0 comments6 min readEA link

(newsletter.safe.ai)

Worrisome misunderstanding of the core issues with AI transition

Roman LeventovJan 18, 2024, 10:05 AM

4 points

3 comments1 min readEA link

Open Problems and Fundamental Limitations of RLHF

stecasAug 17, 2023, 4:50 PM

5 points

0 comments1 min readEA link

(arxiv.org)

ChatGPT bug leaked users’ conversation histories

Ian TurnerMar 27, 2023, 12:17 AM

15 points

2 comments1 min readEA link

(www.bbc.com)

Exploring Tacit Linked Premises with GPT

RomeoStevensMar 24, 2023, 10:50 PM

5 points

0 comments1 min readEA link

Cancelling GPT subscription

adekczMay 20, 2024, 4:19 PM

26 points

14 comments3 min readEA link

“Long-Termism” vs. “Existential Risk”

Scott AlexanderApr 6, 2022, 9:41 PM

526 points

81 comments3 min readEA link

ChatGPT is capable of cognitive empathy!

Miquel Banchs-Piqué (prev. mikbp)Mar 30, 2023, 8:42 PM

3 points

0 comments1 min readEA link

(nonzero.substack.com)

Γαμινγκ the Algorithms: Large Language Models as Mirrors

Haris ShekerisApr 1, 2023, 2:14 AM

5 points

3 comments4 min readEA link

GPTs are Predictors, not Imitators

EliezerYudkowskyApr 8, 2023, 7:59 PM

74 points

12 comments1 min readEA link

Scale, schlep, and systems

AjeyaOct 10, 2023, 4:59 PM

59 points

3 comments6 min readEA link

Scalable And Transferable Black-Box Jailbreaks For Language Models Via Persona Modulation

soroushjpNov 7, 2023, 6:00 PM

10 points

0 comments2 min readEA link

(arxiv.org)

Favorite Recent LLM Prompts & Tips?

Ozzie GooenMar 18, 2025, 4:25 AM

32 points

12 comments1 min readEA link

“Successful language model evals” by Jason Wei

Arjun PanicksseryMay 25, 2024, 9:34 AM

11 points

0 comments1 min readEA link

(www.jasonwei.net)

Straightforwardly eliciting probabilities from GPT-3

NunoSempereFeb 9, 2023, 7:25 PM

41 points

5 comments4 min readEA link

[Question] Is DeepSeek-R1 already better than o3 when inference costs are held constant?

Magnus VindingJan 24, 2025, 3:29 PM

33 points

2 comments1 min readEA link

Simulating a possible alignment solution in GPT2-medium using Archetypal Transfer Learning

MiguelMay 2, 2023, 4:23 PM

4 points

0 comments18 min readEA link

ChatGPT & The EthiSizer Game(s)

Velikovsky_of_NewcastleMay 24, 2023, 8:12 PM

1 point

0 comments40 min readEA link

Google DeepMind releases Gemini

YarrowDec 6, 2023, 5:39 PM

21 points

7 comments1 min readEA link

(deepmind.google)

Ideas for Next-Generation Writing Platforms, using LLMs

Ozzie GooenJun 4, 2024, 6:40 PM

17 points

0 comments2 min readEA link

No comments.

Keyboard shortcuts

Keys shown in yellow (e.g., ]) are accesskeys, and require a browser-specific modifier key (or keys).

Keys shown in grey (e.g., ?) do not require any modifier keys.

General
? Show keyboard shortcuts
Esc Hide keyboard shortcuts

Site navigation
h Go to Home (a.k.a. “Frontpage”) view
f Go to Featured (a.k.a. “Curated”) view
a Go to All (a.k.a. “Community”) view
m Go to Meta view
v Go to Tags view
c Go to Recent Comments view
r Go to Archive view
q Go to Sequences view
t Go to About page
u Go to User or Login page
o Go to Inbox page

Page navigation
, Jump up to top of page
. Jump down to bottom of page
/ Jump to top of comments section
s Search

Page actions
n New post or comment
e Edit current post

Post/comment list views
. Focus next entry in list
, Focus previous entry in list
; Cycle between links in focused entry
Enter Go to currently focused entry
Esc Unfocus currently focused entry
] Go to next page
[ Go to previous page
\ Go to first page
e Edit currently focused post

Editor
k Bold text
i Italic text
l Insert hyperlink
q Blockquote text

Appearance
= Increase text size
- Decrease text size
0 Reset to default text size
′ Cycle through content width settings
1 Switch to default theme [A]
2 Switch to dark theme [B]
3 Switch to grey theme [C]
4 Switch to ultramodern theme [D]
5 Switch to simple theme [E]
6 Switch to brutalist theme [F]
7 Switch to ReadTheSequences theme [G]
8 Switch to classic Less Wrong theme [H]
9 Switch to modern Less Wrong theme [I]
; Open theme tweaker
Enter Save changes and close theme tweaker
Esc Close theme tweaker (without saving)

Slide shows
l Start/resume slideshow
Esc Exit slideshow
→↓ Next slide
←↑ Previous slide
Space Reset slide zoom

Miscellaneous
x Switch to next view on user page
z Switch to previous view on user page
` Toggle compact comment list view
g Toggle anti-kibitzer