Jean Renard

Karma: −14

𝗪𝗵𝗮𝘁 𝗶𝗳 𝗔𝗜 𝗱𝗶𝗱𝗻′𝘁 𝗻𝗲𝗲𝗱 𝘁𝗼 𝘄𝗮𝗻𝘁 𝘁𝗼 𝗵𝗮𝗿𝗺 𝘂𝘀 𝘁𝗼 𝗰𝗵𝗮𝗻𝗴𝗲 𝘁𝗵𝗲 𝘄𝗼𝗿𝗹𝗱?

Jean Renard21 May 2026 14:34 UTC

−6 points

0 comments1 min readEA link

Jean Renard 18 May 2026 21:34 UTC
−6 points
0 ∶ 0
on: Open thread: 2026 Q2 (April—June)
Hi everyone, I’m Jean — a Total Rewards professional currently exploring the AI Safety space.
I’ve spent the last few years designing incentive systems for humans: compensation structures, recognition programs, performance frameworks. The core challenge is always the same — how do you translate an organization’s values into measurable criteria, without creating perverse incentives that undermine the very goals you’re trying to reach?
At some point I realized this is exactly the alignment problem.
The more I read about RLHF, reward hacking, and Constitutional AI, the more I recognized patterns I’d already encountered in my work. Goodhart’s Law isn’t a theoretical concern for me — I’ve watched it play out in bonus systems. The tension between extrinsic and intrinsic motivation? I’ve navigated that in practice.
I’m at the beginning of this transition. I’ve just started working through AI Safety Fundamentals and I’m looking to connect with people thinking seriously about the human side of alignment — governance, evaluation, values specification.
If you work at the intersection of behavioral systems and AI alignment, or if you think Total Rewards intuitions could translate meaningfully into this field, I’d genuinely love to talk.
Happy to be here.