RSS

Large LanĀ­guage Models

TagLast edit: Nov 24, 2023, 4:38 PM by Toby TremlettšŸ”¹

This topic is for posts discussing Large Language Models (LLMs) -- for example, the GPT models produced by OpenAI.

Related Entries


AI safety | Artificial intelligence | AI governance | AI forecasting

LLMs are weirder than you think

Derek ShillerNov 20, 2024, 1:39 PM
61 points
3 comments22 min readEA link

InĀ­troĀ­ducĀ­ing Senti—AnĀ­iĀ­mal Ethics AI Assistant

Animal_EthicsMay 9, 2024, 7:33 AM
40 points
2 comments2 min readEA link

The AnĀ­iĀ­mal Welfare Case for Open AcĀ­cess: BreakĀ­ing BarĀ­riĀ­ers to ScienĀ­tific KnowlĀ­edge and EnĀ­hancĀ­ing LLM Training

Wladimir J. AlonsoNov 23, 2024, 1:07 PM
32 points
2 comments3 min readEA link

TenĀ­taĀ­tive pracĀ­tiĀ­cal tips for usĀ­ing chatĀ­bots in research

Erich_Grunewald šŸ”øMar 29, 2023, 3:01 PM
48 points
7 comments5 min readEA link

InĀ­troĀ­ducĀ­ing SquigĀ­gle AI

Ozzie GooenJan 3, 2025, 5:53 PM
82 points
13 comments8 min readEA link

My CurĀ­rent Claims and Cruxes on LLM ForeĀ­castĀ­ing & Epistemics

Ozzie GooenJun 26, 2024, 12:40 AM
46 points
7 comments24 min readEA link

Sleeper Agents: Train­ing De­cep­tive LLMs that Per­sist Through Safety Training

evhubJan 12, 2024, 7:51 PM
65 points
0 comments1 min readEA link
(arxiv.org)

Briefly how I’ve upĀ­dated since ChatGPT

rimeApr 25, 2023, 7:39 PM
29 points
8 comments2 min readEA link
(www.lesswrong.com)

ChatGPT not so clever or not so arĀ­tifiĀ­cial as hyped to be?

Haris ShekerisMar 2, 2023, 6:16 AM
āˆ’7 points
2 comments1 min readEA link

A short conĀ­verĀ­saĀ­tion I had with Google GemĀ­ini on the danĀ­gers of unĀ­regĀ­uĀ­lated LLM API use, while mildly drunk in an airĀ­port.

EvanMcCormickDec 17, 2024, 12:25 PM
1 point
0 comments8 min readEA link

The case for more amĀ­biĀ­tious lanĀ­guage model evals

JozdienJan 30, 2024, 9:24 AM
7 points
0 comments5 min readEA link

ProbĀ­lem-solvĀ­ing tasks in Graph TheĀ­ory for lanĀ­guage modĀ­els

Bruno López OrozcoOct 1, 2024, 12:36 PM
21 points
1 comment9 min readEA link

Claude 3.5 Sonnet

Zach Stein-PerlmanJun 20, 2024, 6:00 PM
31 points
0 comments1 min readEA link
(www.anthropic.com)

In faĀ­vor of an AI-powĀ­ered transĀ­laĀ­tion butĀ­ton on the EA Forum

Alix PhamJun 6, 2024, 8:29 PM
49 points
4 comments1 min readEA link

PosĀ­siĀ­ble OpenAI’s Q* breakĀ­through and DeepĀ­Mind’s AlphaGo-type sysĀ­tems plus LLMs

BurnydelicNov 23, 2023, 7:02 AM
13 points
4 comments2 min readEA link

New ArĀ­tifiĀ­cial InĀ­telĀ­liĀ­gence quiz: can you beat ChatGPT?

AndreFerrettiMar 3, 2023, 3:46 PM
29 points
3 comments1 min readEA link

Life of GPT

Odd anonNov 8, 2023, 10:31 PM
āˆ’1 points
0 comments5 min readEA link

Pros and Cons of boyĀ­cotting paid Chat GPT

NickLaingMar 18, 2023, 8:50 AM
14 points
11 comments2 min readEA link

[Question] How would a lanĀ­guage model beĀ­come goal-diĀ­rected?

David MJul 16, 2022, 2:50 PM
113 points
20 comments1 min readEA link

How to quickly set up Claude as a chat bot for onĀ­line felĀ­lowĀ­ships and courses

Jamie_HarrisJul 22, 2023, 7:53 AM
38 points
10 comments4 min readEA link

DisĀ­cussing AI-HuĀ­man ColĀ­labĀ­oĀ­raĀ­tion Through FicĀ­tion: The Story of Laika and GPT-āˆž

LaikaJul 27, 2023, 6:04 AM
1 point
0 comments1 min readEA link

The Prospect of an AI Winter

Erich_Grunewald šŸ”øMar 27, 2023, 8:55 PM
56 points
13 comments1 min readEA link

Open Phil re­leases RFPs on LLM Bench­marks and Forecasting

Lawrence ChanNov 11, 2023, 3:01 AM
12 points
0 comments1 min readEA link
(www.openphilanthropy.org)

EA ExĀ­plorer GPT: A New Tool to ExĀ­plore EffecĀ­tive Altruism

Vlad_TislenkoNov 12, 2023, 3:36 PM
12 points
1 comment1 min readEA link

LLMs won’t lead to AGI—FranĀ­cois Chollet

tobycrisford šŸ”øJun 11, 2024, 8:19 PM
37 points
23 comments1 min readEA link
(www.youtube.com)

[Question] What am I missĀ­ing re. open-source LLM’s?

another-anon-do-gooderDec 4, 2023, 4:48 AM
1 point
2 comments1 min readEA link

On the Dwarkesh/​CholĀ­let PodĀ­cast, and the cruxes of scalĀ­ing to AGI

JWS šŸ”øJun 15, 2024, 8:24 PM
72 points
49 comments17 min readEA link

LLM-SeĀ­cured SysĀ­tems: A GenĀ­eral-PurĀ­pose Tool For StrucĀ­tured Transparency

Ozzie GooenJun 18, 2024, 12:20 AM
36 points
1 comment21 min readEA link

ForeĀ­castĀ­ing With LLMs—An Open and PromisĀ­ing ReĀ­search Direction

Marcel DMar 12, 2024, 4:23 AM
13 points
0 comments4 min readEA link

On the fuĀ­ture of lanĀ­guage models

Owen Cotton-BarrattDec 20, 2023, 4:58 PM
125 points
3 comments36 min readEA link

AI scal­ing myths

Noah VarleyšŸ”øJun 27, 2024, 8:29 PM
30 points
0 comments1 min readEA link
(open.substack.com)

LLMs canĀ­not useĀ­fully be moral patients

LGSJul 2, 2024, 4:43 AM
35 points
24 comments4 min readEA link

LLM Eval­u­a­tors Rec­og­nize and Fa­vor Their Own Generations

Arjun PanicksseryApr 17, 2024, 9:09 PM
21 points
4 comments1 min readEA link
(tiny.cc)

ā€˜Chat with imĀ­pactĀ­ful reĀ­search & evalĀ­uĀ­aĀ­tions’ (UnĀ­jourĀ­nal NoteĀ­bookLMs)

david_reinsteinSep 24, 2024, 8:19 PM
8 points
1 comment2 min readEA link

ScalĀ­ing of AI trainĀ­ing runs will slow down afĀ­ter GPT-5

Maxime RichĆ© šŸ”øApr 26, 2024, 4:06 PM
10 points
2 comments3 min readEA link

AnĀ­iĀ­mal ethics in ChatGPT and Claude

Elijah WhippleJan 16, 2024, 9:38 PM
47 points
2 comments9 min readEA link

The InĀ­tenĀ­tional Stance, LLMs Edition

Eleni_AMay 1, 2024, 3:22 PM
8 points
2 comments8 min readEA link

LLMs as a PlanĀ­ning Overhang

LarksJul 14, 2024, 4:57 AM
49 points
3 comments1 min readEA link

RAND re­port finds no effect of cur­rent LLMs on vi­a­bil­ity of bioter­ror­ism attacks

LizkaJan 26, 2024, 8:10 PM
108 points
17 comments3 min readEA link
(www.rand.org)

Utility Eng­ineer­ing: An­a­lyz­ing and Con­trol­ling Emer­gent Value Sys­tems in AIs

Matrice JacobineFeb 12, 2025, 9:15 AM
13 points
0 comments1 min readEA link
(www.emergent-values.ai)

[Question] FindĀ­ing ā€˜pivotal quesĀ­tions’ from 80k podĀ­cast tranĀ­scripts, sugĀ­gesĀ­tions, LLM apĀ­proaches/​ Is there already an ā€œ80k chatĀ­botā€?

david_reinsteinJan 8, 2025, 5:16 PM
10 points
2 comments1 min readEA link

WorĀ­riĀ­some Trends for DigiĀ­tal Mind Evaluations

Derek ShillerFeb 20, 2025, 3:35 PM
72 points
10 comments8 min readEA link

Knowl­edge, Rea­son­ing, and Superintelligence

Owen Cotton-BarrattMar 26, 2025, 11:28 PM
17 points
2 comments1 min readEA link
(strangecities.substack.com)

[Question] How inĀ­deĀ­penĀ­dent is the reĀ­search comĀ­ing out of OpenAI’s preĀ­paredĀ­ness team?

EarthlingFeb 10, 2024, 4:59 PM
18 points
0 comments1 min readEA link

GPT5 won’t be what kills us all

DPiepgrassSep 28, 2024, 5:11 PM
3 points
3 comments1 min readEA link
(dpiepgrass.medium.com)

What is ā€œwireĀ­headĀ­ingā€?

Vishakha AgrawalDec 17, 2024, 5:59 PM
1 point
0 comments1 min readEA link
(aisafety.info)

How to Catch a ChatGPT Cheat: 7 PracĀ­tiĀ­cal Tips

MarshallDec 27, 2022, 4:09 PM
8 points
2 comments4 min readEA link

How much is 1.8 milĀ­lion years of work?

rosehadsharAug 16, 2024, 12:35 PM
21 points
3 comments2 min readEA link

ļƒā€œThis might be the first large-scale apĀ­pliĀ­caĀ­tion of AI techĀ­nolĀ­ogy to geopoliĀ­tics.. 4o, o3 high, GemĀ­ini 2.5 pro, Claude 3.7, Grok all give the same anĀ­swer to the quesĀ­tion on how to imĀ­pose tarĀ­iffs easĀ­ily.ā€

Matrice JacobineApr 3, 2025, 10:50 AM
3 points
0 comments1 min readEA link
(x.com)

AlignĀ­ment FakĀ­ing in Large LanĀ­guage Models

Ryan GreenblattDec 18, 2024, 5:19 PM
142 points
9 comments1 min readEA link

Claude vs GPT

Maxwell TabarrokMar 14, 2024, 12:44 PM
14 points
1 comment2 min readEA link
(www.maximum-progress.com)

[Linkpost] Vague Ver­biage in Forecasting

trevor1Mar 22, 2024, 6:05 PM
5 points
0 comments1 min readEA link
(goodjudgment.com)

o3

Zach Stein-PerlmanDec 20, 2024, 9:00 PM
84 points
5 comments1 min readEA link

Was ReĀ­leasĀ­ing Claude-3 Net-Negative

Logan RiggsMar 27, 2024, 5:41 PM
12 points
1 comment4 min readEA link

[Question] Could AI-genĀ­erĀ­ated conĀ­tent help think-tanks & reĀ­search orgs beĀ­come more effecĀ­tive?

Justin OliveJan 10, 2023, 10:58 PM
13 points
0 comments2 min readEA link

Large Lan­guage Models Pass the Tur­ing Test

Matrice JacobineApr 2, 2025, 5:41 AM
11 points
6 comments1 min readEA link
(arxiv.org)

Is Text WaterĀ­markĀ­ing a lost cause?

Egor TimatkovOct 1, 2024, 1:07 PM
4 points
0 comments10 min readEA link

ChatGPT un­der­stands, but largely does not gen­er­ate Span­glish (and other code-mixed) text

Milan WeibelšŸ”¹Jan 4, 2023, 10:10 PM
6 points
0 comments4 min readEA link
(www.lesswrong.com)

We are in a New Paradigm of AI Progress—OpenAI’s o3 model makes huge gains on the toughĀ­est AI benchĀ­marks in the world

GarrisonDec 22, 2024, 9:45 PM
26 points
0 comments4 min readEA link
(garrisonlovely.substack.com)

SumĀ­mary: InĀ­troĀ­specĀ­tive CaĀ­paĀ­bilĀ­ities in LLMs (Robert Long)

rileyharrisJul 2, 2024, 6:08 PM
11 points
1 comment4 min readEA link

Beyond Meta: Large ConĀ­cept Models Will Win

Anthony RepettoDec 30, 2024, 12:57 AM
3 points
0 comments3 min readEA link

EnĀ­hancĀ­ing MathĀ­eĀ­matĀ­iĀ­cal ModelĀ­ing with LLMs: Goals, Challenges, and Evaluations

Ozzie GooenOct 28, 2024, 9:37 PM
11 points
3 comments15 min readEA link

How LLMs Work, in the Style of The Economist

utilistrutilApr 22, 2024, 7:06 PM
17 points
0 comments1 min readEA link

Share your reĀ­quests for ChatGPT

Kate TranDec 5, 2022, 6:43 PM
8 points
5 comments1 min readEA link

Digest: three paĀ­pers that have shaped my unĀ­derĀ­standĀ­ing of the poĀ­tenĀ­tial for conĀ­sciousĀ­ness in AI systems

rileyharrisAug 21, 2024, 3:09 PM
5 points
0 comments1 min readEA link

LLM chatĀ­bots have ~half of the kinds of ā€œconĀ­sciousĀ­nessā€ that huĀ­mans beĀ­lieve in. HuĀ­mans should avoid goĀ­ing crazy about that.

Andrew CritchNov 22, 2024, 3:26 AM
11 points
3 comments1 min readEA link

Have your timelines changed as a reĀ­sult of ChatGPT?

Chris LeongDec 5, 2022, 3:03 PM
30 points
18 comments1 min readEA link

INTELLECT-1 Re­lease: The First Globally Trained 10B Pa­ram­e­ter Model

Matrice JacobineNov 29, 2024, 11:03 PM
2 points
1 comment1 min readEA link
(www.primeintellect.ai)

What is scaf­fold­ing?

Vishakha AgrawalMar 27, 2025, 9:40 AM
3 points
0 comments2 min readEA link
(aisafety.info)

ComĀ­parĀ­iĀ­son of LLM scalĀ­aĀ­bilĀ­ity and perforĀ­mance beĀ­tween the U.S. and China based on benchmark

Ivanna_alvaradoOct 12, 2024, 9:51 PM
8 points
0 comments34 min readEA link

[Question] Can we ever enĀ­sure AI alĀ­ignĀ­ment if we can only test AI perĀ­sonas?

Karl von WendtMar 16, 2025, 8:06 AM
8 points
0 comments1 min readEA link

Risk Align­ment in Agen­tic AI Systems

Hayley ClatterbuckOct 1, 2024, 10:51 PM
31 points
1 comment3 min readEA link
(static1.squarespace.com)

AbilĀ­ity to solve long-horiĀ­zon tasks corĀ­reĀ­lates with wantĀ­ing things in the beĀ­havĀ­iorist sense

So8resNov 24, 2023, 5:37 PM
38 points
1 comment1 min readEA link

DonaĀ­tion offsets for ChatGPT Plus subscriptions

Jeffrey LadishMar 16, 2023, 11:11 PM
76 points
10 comments3 min readEA link

Who owns AI-genĀ­erĀ­ated conĀ­tent?

Johan S DanielDec 7, 2022, 3:03 AM
āˆ’2 points
0 comments2 min readEA link

[Question] I’m inĀ­terĀ­viewĀ­ing the auĀ­thor of ā€˜Not Born YesĀ­terĀ­day’ — Hugo Mercier. He arĀ­gues peoĀ­ple are less gullible and more savvy than you think. What should I ask him?

Robert_WiblinNov 17, 2023, 5:43 PM
12 points
3 comments1 min readEA link

Still no strong evĀ­iĀ­dence that LLMs inĀ­crease bioterĀ­rorĀ­ism risk

freedomandutilityNov 2, 2023, 9:23 PM
58 points
9 comments1 min readEA link

The Dis­solu­tion of AI Safety

RokoDec 12, 2024, 10:46 AM
āˆ’7 points
0 comments1 min readEA link
(www.transhumanaxiology.com)

AISN #35: Lob­by­ing on AI Reg­u­la­tion Plus, New Models from OpenAI and Google, and Le­gal Regimes for Train­ing on Copy­righted Data

Center for AI SafetyMay 16, 2024, 2:26 PM
14 points
0 comments6 min readEA link
(newsletter.safe.ai)

WorĀ­riĀ­some miĀ­sĀ­unĀ­derĀ­standĀ­ing of the core isĀ­sues with AI transition

Roman LeventovJan 18, 2024, 10:05 AM
4 points
3 comments1 min readEA link

Open Prob­lems and Fun­da­men­tal Limi­ta­tions of RLHF

stecasAug 17, 2023, 4:50 PM
5 points
0 comments1 min readEA link
(arxiv.org)

ChatGPT bug leaked users’ conĀ­verĀ­saĀ­tion histories

Ian TurnerMar 27, 2023, 12:17 AM
15 points
2 comments1 min readEA link
(www.bbc.com)

ExĀ­plorĀ­ing Tacit Linked Premises with GPT

RomeoStevensMar 24, 2023, 10:50 PM
5 points
0 comments1 min readEA link

CanĀ­celĀ­ling GPT subscription

adekczMay 20, 2024, 4:19 PM
26 points
14 comments3 min readEA link

ā€œLong-TerĀ­mismā€ vs. ā€œExĀ­isĀ­tenĀ­tial Riskā€

Scott AlexanderApr 6, 2022, 9:41 PM
526 points
81 comments3 min readEA link

ChatGPT is ca­pa­ble of cog­ni­tive em­pa­thy!

Miquel Banchs-PiquƩ (prev. mikbp)Mar 30, 2023, 8:42 PM
3 points
0 comments1 min readEA link
(nonzero.substack.com)

Γαμινγκ the AlĀ­gorithms: Large LanĀ­guage Models as Mirrors

Haris ShekerisApr 1, 2023, 2:14 AM
5 points
3 comments4 min readEA link

GPTs are PreĀ­dicĀ­tors, not Imitators

EliezerYudkowskyApr 8, 2023, 7:59 PM
74 points
12 comments1 min readEA link

Scale, schlep, and systems

AjeyaOct 10, 2023, 4:59 PM
59 points
3 comments6 min readEA link

Scal­able And Trans­fer­able Black-Box Jailbreaks For Lan­guage Models Via Per­sona Modulation

soroushjpNov 7, 2023, 6:00 PM
10 points
0 comments2 min readEA link
(arxiv.org)

FaĀ­vorite ReĀ­cent LLM Prompts & Tips?

Ozzie GooenMar 18, 2025, 4:25 AM
30 points
12 comments1 min readEA link

ļƒā€œSucĀ­cessĀ­ful lanĀ­guage model evalsā€ by JaĀ­son Wei

Arjun PanicksseryMay 25, 2024, 9:34 AM
11 points
0 comments1 min readEA link
(www.jasonwei.net)

StraightĀ­forĀ­wardly elicĀ­itĀ­ing probĀ­aĀ­bilĀ­ities from GPT-3

NunoSempereFeb 9, 2023, 7:25 PM
41 points
5 comments4 min readEA link

[Question] Is DeepĀ­Seek-R1 already betĀ­ter than o3 when inĀ­ferĀ­ence costs are held conĀ­stant?

Magnus VindingJan 24, 2025, 3:29 PM
33 points
2 comments1 min readEA link

SiĀ­muĀ­latĀ­ing a posĀ­siĀ­ble alĀ­ignĀ­ment soluĀ­tion in GPT2-medium usĀ­ing ArchetyĀ­pal TransĀ­fer Learning

MiguelMay 2, 2023, 4:23 PM
4 points
0 comments18 min readEA link

ChatGPT & The EthiSizer Game(s)

Velikovsky_of_NewcastleMay 24, 2023, 8:12 PM
1 point
0 comments40 min readEA link

Google Deep­Mind re­leases Gemini

YarrowDec 6, 2023, 5:39 PM
21 points
7 comments1 min readEA link
(deepmind.google)

Ideas for Next-GenĀ­erĀ­aĀ­tion WritĀ­ing PlatĀ­forms, usĀ­ing LLMs

Ozzie GooenJun 4, 2024, 6:40 PM
17 points
0 comments2 min readEA link

FranƧois CholĀ­let on why LLMs won’t scale to AGI

YarrowApr 15, 2025, 11:01 PM
6 points
2 comments1 min readEA link
(www.youtube.com)
No comments.