RSS

Dan H

Karma: 1,053

https://​​danhendrycks.com

NeurIPS ML Safety Work­shop 2022

Dan H26 Jul 2022 15:33 UTC
72 points
0 comments1 min readEA link
(neurips2022.mlsafety.org)

[MLSN #6]: Trans­parency sur­vey, prov­able ro­bust­ness, ML mod­els that pre­dict the future

Dan H12 Oct 2022 20:51 UTC
21 points
1 comment6 min readEA link

AI and Evolution

Dan H30 Mar 2023 13:09 UTC
41 points
1 comment2 min readEA link
(arxiv.org)

Ag­gre­gat­ing Utilities for Cor­rigible AI [Feed­back Draft]

Dan H12 May 2023 20:57 UTC
12 points
0 comments1 min readEA link

The Po­lar­ity Prob­lem [Draft]

Dan H23 May 2023 21:05 UTC
11 points
0 comments1 min readEA link