RSS

Dan H

Karma: 1,053

https://​​danhendrycks.com

NeurIPS ML Safety Work­shop 2022

Dan H26 Jul 2022 15:33 UTC
72 points
0 comments1 min readEA link
(neurips2022.mlsafety.org)

AI and Evolution

Dan H30 Mar 2023 13:09 UTC
41 points
1 comment2 min readEA link
(arxiv.org)

Catas­trophic Risks from AI #1: Introduction

Dan H22 Jun 2023 17:09 UTC
28 points
1 comment1 min readEA link
(arxiv.org)

[MLSN #6]: Trans­parency sur­vey, prov­able ro­bust­ness, ML mod­els that pre­dict the future

Dan H12 Oct 2022 20:51 UTC
21 points
1 comment6 min readEA link

Catas­trophic Risks from AI #2: Mal­i­cious Use

Dan H22 Jun 2023 17:10 UTC
19 points
0 comments1 min readEA link

Ag­gre­gat­ing Utilities for Cor­rigible AI [Feed­back Draft]

Dan H12 May 2023 20:57 UTC
12 points
0 comments1 min readEA link

The Po­lar­ity Prob­lem [Draft]

Dan H23 May 2023 21:05 UTC
11 points
0 comments1 min readEA link

Catas­trophic Risks from AI #3: AI Race

Dan H23 Jun 2023 19:21 UTC
9 points
0 comments1 min readEA link

Catas­trophic Risks from AI #4: Or­ga­ni­za­tional Risks

Dan H26 Jun 2023 19:36 UTC
7 points
0 comments1 min readEA link