AI alignment
TagLast edit: Jul 22, 2022, 8:58 PM by Leo AI alignment is research on how to align AI systems with human or moral goals.
Evaluation
80,000 Hours rates AI alignment a “highest priority area”: a problem at the top of their ranking of global issues assessed by importance, tractability and neglectedness.
Further reading
Christiano, Paul (2020) Current work in AI alignment, Effective Altruism Forum, April 3.
Shah, Rohin (2020) What’s been happening in AI alignment?, Effective Altruism Forum, July 29.
External links
AI Alignment Forum.
Related entries
AI governance | AI forecasting | alignment tax | Center for Human-Compatible Artificial Intelligence | Machine Intelligence Research Institute | rationality community
AjeyaSep 21, 2021, 3:35 PM153 points
14 min readEA link(www.cold-takes.com)
JacyFeb 20, 2018, 6:29 PM107 points
35 min readEA link(www.sentienceinstitute.org)
BenJul 23, 2020, 7:02 PM10 points
3 min readEA link EJTFeb 20, 2023, 9:52 PM107 points
19 min readEA link KRJul 7, 2020, 5:53 PM18 points
8 min readEA link evhubJan 12, 2024, 7:51 PM65 points
1 min readEA link(arxiv.org)
ChiAug 15, 2020, 7:59 PM38 points
39 min readEA link KRJul 22, 2020, 7:07 PM21 points
3 min readEA link joeFeb 9, 2023, 11:55 AM27 points
18 min readEA link micJun 30, 2022, 6:37 PM53 points
11 min readEA link AjeyaDec 15, 2020, 12:10 PM35 points
1 min readEA link(alignmentforum.org)
evhubJul 1, 2020, 8:59 PM13 points
1 min readEA link(futureoflife.org)
AkashSep 11, 2022, 11:45 PM31 points
6 min readEA link(www.lesswrong.com)
PabloNov 21, 2020, 12:02 PM36 points
1 min readEA link(www.lesswrong.com)
aogJun 20, 2022, 2:59 PM20 points
22 min readEA link tsJan 20, 2022, 9:10 PM11 points
1 min readEA link DavidWFeb 15, 2023, 8:12 PM20 points
1 min readEA link(www.lesswrong.com)
GilFeb 20, 2023, 5:27 AM16 points
2 min readEA link AjeyaMar 27, 2023, 7:46 PM198 points
1 min readEA link(www.planned-obsolescence.org)
leopoldMar 29, 2023, 2:26 PM327 points
9 min readEA link(www.forourposterity.com)
ChiSep 20, 2024, 1:14 AM21 points
1 min readEA link ChiMar 3, 2024, 6:07 PM113 points
21 min readEA link FaiFeb 25, 2022, 8:43 PM46 points
8 min readEA link HznDec 16, 2024, 2:01 PM−1 points
3 min readEA link plexMay 18, 2024, 3:06 PM13 points
1 min readEA link(aisafety.info)
JakubKDec 13, 2022, 7:04 PM21 points
2 min readEA link(www.lesswrong.com)
rgbApr 25, 2022, 1:42 PM91 points
1 min readEA link adamShimiJul 20, 2022, 10:44 AM43 points
9 min readEA link(www.alignmentforum.org)
GarrisonOct 23, 2024, 11:42 PM57 points
7 min readEA link(garrisonlovely.substack.com)
micFeb 2, 2024, 6:22 PM27 points
22 min readEA link(papers.ssrn.com)
MiguelFeb 3, 2023, 7:32 PM18 points
1 min readEA link(www.whitehatstoic.com)
MiguelFeb 4, 2023, 4:49 PM4 points
12 min readEA link(www.whitehatstoic.com)
Matt KeeneFeb 10, 2023, 6:15 PM−9 points
5 min readEA link(www.creatingafuturewewant.com)
AustinMar 14, 2025, 8:46 PM29 points
1 min readEA link(manifund.substack.com)
RokoDec 12, 2024, 10:46 AM−7 points
1 min readEA link(www.transhumanaxiology.com)
OttoJul 24, 2023, 10:18 AM36 points
7 min readEA link(time.com)
leopoldMar 29, 2023, 3:19 PM56 points
5 min readEA link(www.forourposterity.com)
Dan HMar 30, 2023, 1:09 PM41 points
2 min readEA link(arxiv.org)
Mark XuDec 24, 2020, 11:08 PM23 points
2 min readEA link(www.alignmentforum.org)
RaemonNov 19, 2018, 2:21 AM26 points
1 min readEA link(www.lesswrong.com)
yixiongNov 14, 2024, 1:34 PM18 points
5 min readEA link(yixiong.substack.com)
JeremyNov 21, 2022, 5:47 PM15 points
1 min readEA link(scottaaronson.blog)
JeremyMay 17, 2022, 3:05 AM11 points
1 min readEA link(www.lesswrong.com)
acFeb 10, 2020, 10:10 AM26 points
16 min readEA link