RSS

AlexChalk

Karma: 3

Re­in­force­ment Learn­ing: A Non-Tech­ni­cal Primer on o1 and Deep­Seek-R1

AlexChalkFeb 9, 2025, 11:58 PM
4 points
0 comments9 min readEA link
(alexchalk.net)