Risk Align­ment in Agen­tic AI Systems

Hayley Clatterbuck1 Oct 2024 22:51 UTC
31 points
1 comment3 min readEA link
(static1.squarespace.com)

Egre­gious Cru­elty: Hum­boldt Farm Sued for Put­ting Salt in Cows’ Eyes, Starv­ing Cattle

Sage Max1 Oct 2024 22:38 UTC
86 points
4 comments1 min readEA link

AI Safety Newslet­ter #42: New­som Ve­toes SB 1047 Plus, OpenAI’s o1, and AI Gover­nance Summary

Center for AI Safety1 Oct 2024 20:33 UTC
10 points
0 comments6 min readEA link
(newsletter.safe.ai)

Democ­racy and hu­man rights are only as­so­ci­ated with higher welfare hold­ing in­come con­stant in some countries

Vasco Grilo🔸1 Oct 2024 16:06 UTC
7 points
2 comments4 min readEA link

EA for Jews: an­nounc­ing our new Manag­ing Direc­tor and an ap­pre­ci­a­tion of our out­go­ing MD

EA for Jews1 Oct 2024 15:03 UTC
28 points
1 comment3 min readEA link

Sup­port Effec­tive Char­i­ties for Free When You Buy Life In­surance – Here’s How (Com­mis­sions for a Cause)

Brad West🔸1 Oct 2024 13:11 UTC
26 points
2 comments1 min readEA link

Is Text Water­mark­ing a lost cause?

Egor Timatkov1 Oct 2024 13:07 UTC
3 points
0 comments10 min readEA link

Nearly 2 mil­lion deaths per year by 2050 - New study on global bur­den of An­timicro­bial Re­sis­tance

DavidMcK1 Oct 2024 12:58 UTC
21 points
1 comment9 min readEA link

Prob­lem-solv­ing tasks in Graph The­ory for lan­guage mod­els

Bruno López Orozco1 Oct 2024 12:36 UTC
17 points
1 comment9 min readEA link

[Question] How much (more) data do we need to claim ex­treme cost-effec­tive­ness?

Niek Versteegde, founder GOAL 31 Oct 2024 12:36 UTC
24 points
14 comments6 min readEA link

An­nounc­ing Fore­castBench, a new bench­mark for AI and hu­man fore­cast­ing abilities

Forecasting Research Institute1 Oct 2024 12:31 UTC
20 points
1 comment3 min readEA link
(arxiv.org)

[Question] What posts do you want to see dur­ing de­bate week?

Toby Tremlett🔹1 Oct 2024 9:05 UTC
27 points
2 comments1 min readEA link

Is the Far Fu­ture Ir­rele­vant for Mo­ral De­ci­sion-Mak­ing?

Tristan D1 Oct 2024 7:42 UTC
35 points
31 comments2 min readEA link
(www.sciencedirect.com)

[Question] Are there stand­out EA digi­tal no­mad hubs?

Arepo1 Oct 2024 6:56 UTC
38 points
11 comments1 min readEA link

Re­duce AGI risks us­ing mod­ern lie de­tec­tion technology

NothingIsArt30 Sep 2024 18:12 UTC
1 point
0 comments1 min readEA link

Ap­pre­ci­at­ing Stable Sup­port Roles at EA Orgs

Amy Labenz30 Sep 2024 17:49 UTC
182 points
4 comments2 min readEA link

[Question] Are pe­ti­tions im­pact­ful?

JordanStone30 Sep 2024 15:20 UTC
5 points
3 comments1 min readEA link

Up­dates on the effec­tive giv­ing ecosys­tem (MCF 2024 memo)

Luke Moore 🔸30 Sep 2024 14:49 UTC
127 points
4 comments2 min readEA link

Open thread: Oc­to­ber—De­cem­ber 2024

Toby Tremlett🔹30 Sep 2024 14:43 UTC
11 points
24 comments1 min readEA link

Startup ad­vice tar­get­ing low and mid­dle in­come countries

lincolnq30 Sep 2024 14:35 UTC
63 points
2 comments3 min readEA link
(www.lincolnquirk.com)

Don’t rest too long on progress

JoeJones30 Sep 2024 14:29 UTC
4 points
0 comments3 min readEA link

GiveDirectly: TedTalk by EA-ad­ja­cent pres­i­dent Rory Stewart

Deborah W.A. Foulkes30 Sep 2024 11:57 UTC
4 points
2 comments1 min readEA link
(youtu.be)

A new pro­cess for map­ping discussions

Nathan Young30 Sep 2024 8:57 UTC
11 points
4 comments1 min readEA link
(open.substack.com)

Is un­der­stand­ing the moral sta­tus of digi­tal minds a press­ing world prob­lem?

Cody_Fenwick30 Sep 2024 8:50 UTC
42 points
0 comments34 min readEA link
(80000hours.org)

Anony­mous ex­perts on the best ways to fight pandemics

80000_Hours30 Sep 2024 8:47 UTC
19 points
0 comments15 min readEA link
(80000hours.org)

Gavin New­som ve­toes SB 1047

Larks30 Sep 2024 0:06 UTC
39 points
14 comments1 min readEA link
(www.wsj.com)

Not Just For Ther­apy Chat­bots: The Case For Com­pas­sion In AI Mo­ral Align­ment Research

Kenneth_Diao29 Sep 2024 22:58 UTC
8 points
3 comments12 min readEA link

Track­ing Crit­i­cal In­fras­truc­ture AI Incidents

Ben Turse29 Sep 2024 21:29 UTC
1 point
0 comments2 min readEA link

[Question] Con­soli­da­tion of EA crit­i­cism?

Joseph Lemien29 Sep 2024 16:51 UTC
25 points
14 comments2 min readEA link

Join Path­ways to Progress’s Book Dis­cus­sion in October

lmessner29 Sep 2024 12:54 UTC
4 points
0 comments1 min readEA link

[Question] DAE feel morally obli­gated to work hard and ne­glect leisure?

Eevee🔹29 Sep 2024 5:31 UTC
12 points
7 comments1 min readEA link

De­bat­ing AI’s Mo­ral Sta­tus: The Most Hu­mane and Silliest Thing Hu­mans Do(?)

Soe Lin29 Sep 2024 5:01 UTC
4 points
5 comments3 min readEA link

Eric Adams’ elec­tion in hindsight

kbog28 Sep 2024 22:30 UTC
135 points
2 comments3 min readEA link

An­nounc­ing Equal Hands — an ex­per­i­ment in de­moc­ra­tiz­ing effec­tive giv­ing.

abrahamrowe28 Sep 2024 21:29 UTC
190 points
28 comments10 min readEA link

GPT5 won’t be what kills us all

DPiepgrass28 Sep 2024 17:11 UTC
3 points
3 comments1 min readEA link
(dpiepgrass.medium.com)

Ques­tion­ing Benefi­cence: Four Philoso­phers on Effec­tive Altru­ism and Do­ing Good

Richard Y Chappell🔸28 Sep 2024 15:45 UTC
35 points
0 comments2 min readEA link
(www.goodthoughts.blog)

[Question] How do you fol­low AI (safety) news?

peterhartree28 Sep 2024 14:03 UTC
13 points
8 comments1 min readEA link

Cap­i­tal­is­ing on Trust—A Karmic Simulation

Non-zero-sum James28 Sep 2024 5:42 UTC
2 points
2 comments1 min readEA link
(nonzerosum.games)

Effec­tive Lo­cal Altruism

MatthewK28 Sep 2024 5:14 UTC
4 points
2 comments2 min readEA link

[Question] USD in­ter­est rates

artilugio28 Sep 2024 2:20 UTC
0 points
2 comments1 min readEA link

AMA, James Snow­den, Open Philanthropy

James Snowden🔸28 Sep 2024 1:43 UTC
90 points
24 comments1 min readEA link

The Offense-Defense Balance of Gene Drives

Maxwell Tabarrok27 Sep 2024 16:45 UTC
16 points
5 comments4 min readEA link
(www.maximum-progress.com)

Show Me Your Job Posting

Sharleen 27 Sep 2024 14:55 UTC
1 point
0 comments1 min readEA link

Show Me Your Hiring

Sharleen 27 Sep 2024 14:51 UTC
3 points
0 comments1 min readEA link

Ste­fan Schu­bert dis­cusses his new book “Effec­tive Altru­ism and the Hu­man Mind”

JonathanSalter27 Sep 2024 14:22 UTC
6 points
0 comments1 min readEA link

Last days: State of the Farmed An­i­mal Move­ment Sur­vey!

Daniela Waldhorn27 Sep 2024 9:24 UTC
20 points
0 comments1 min readEA link

Mak­ing it eas­ier for de­ci­sion-mak­ers to give feedback

Catherine Low🔸27 Sep 2024 1:13 UTC
92 points
4 comments5 min readEA link

Database of re­search pro­jects for vol­un­teers in food se­cu­rity dur­ing global catas­tro­phes (ALLFED)

JuanGarcia26 Sep 2024 19:39 UTC
46 points
1 comment1 min readEA link

Demo­cratic Fa­vor Channel

Vasco Grilo🔸26 Sep 2024 16:18 UTC
15 points
1 comment1 min readEA link
(www.arxiv.org)

John Cochrane on why reg­u­la­tion is the wrong tool for AI Safety

ezrah26 Sep 2024 8:48 UTC
3 points
2 comments1 min readEA link
(www.grumpy-economist.com)