Strad Slater

Karma: 34

Grokking: When AI Suddenly Starts to Understand

Strad Slater1 Dec 2025 8:00 UTC

4 points

1 comment4 min readEA link

(williamslater2003.medium.com)

How Goodfire Is Turning AI Interpretability Into Real Products

Strad Slater30 Nov 2025 11:00 UTC

0 points

0 comments4 min readEA link

(williamslater2003.medium.com)

4 Lessons From Anthropic on Scaling Interpretability Research

Strad Slater29 Nov 2025 11:22 UTC

4 points

0 comments4 min readEA link

(williamslater2003.medium.com)

Why Explaining AI Is Not the Same as Understanding It

Strad Slater28 Nov 2025 10:38 UTC

2 points

0 comments4 min readEA link

(williamslater2003.medium.com)

Reflections on Dario Amodei’s ‘Urgency of Interpretability’

Strad Slater27 Nov 2025 8:30 UTC

2 points

0 comments5 min readEA link

(williamslater2003.medium.com)

The Causal Inner Product: How LLMs Turn Concepts Into Directions (Part 2)

Strad Slater26 Nov 2025 11:03 UTC

2 points

0 comments4 min readEA link

(williamslater2003.medium.com)

Inside the Linear Representation Hypothesis: How LLMs Turn Concepts Into Directions (Part 1)

Strad Slater25 Nov 2025 11:26 UTC

4 points

0 comments4 min readEA link

(williamslater2003.medium.com)

The Hidden Problem Inside Every AI Model: Superposition

Strad Slater24 Nov 2025 10:14 UTC

4 points

1 comment4 min readEA link

(williamslater2003.medium.com)

Goodfire — The Startup Trying to Decode How AI Thinks

Strad Slater23 Nov 2025 10:22 UTC

2 points

1 comment5 min readEA link

(williamslater2003.medium.com)

Are AI Models Escaping Plato’s Cave?

Strad Slater22 Nov 2025 11:46 UTC

2 points

0 comments5 min readEA link

(williamslater2003.medium.com)

The Universality Hypothesis — Do All AI Models Think The Same?

Strad Slater21 Nov 2025 10:55 UTC

2 points

0 comments4 min readEA link

(williamslater2003.medium.com)

Mechanistic Interpretability — Make AI Safe By Understanding Them

Strad Slater20 Nov 2025 10:52 UTC

2 points

0 comments6 min readEA link

(williamslater2003.medium.com)

6 Insights From Anthropic’s Recent Discussion On LLM Interpretability

Strad Slater19 Nov 2025 10:51 UTC

2 points

0 comments5 min readEA link

(williamslater2003.medium.com)

6 Ways AI Can Harm You — and How to Stop It

Strad Slater18 Nov 2025 10:36 UTC

3 points

0 comments6 min readEA link

(williamslater2003.medium.com)

Why You Should Think About What You Tell ChatGPT : Data Privacy In A World With AI

Strad Slater17 Nov 2025 10:15 UTC

2 points

1 comment5 min readEA link

(williamslater2003.medium.com)

How I Deal With My Anxiety Around AI

Strad Slater16 Nov 2025 11:30 UTC

4 points

0 comments4 min readEA link

(williamslater2003.medium.com)

Do Humans Have A Competitive Edge Against AGI?

Strad Slater15 Nov 2025 10:20 UTC

4 points

0 comments7 min readEA link

(williamslater2003.medium.com)

Why ChatGPT Can’t Be Your Therapist

Strad Slater14 Nov 2025 10:07 UTC

13 points

0 comments4 min readEA link

(williamslater2003.medium.com)

The Need for an Effective AI Incident Reporting Framework

Strad Slater13 Nov 2025 8:53 UTC

2 points

0 comments4 min readEA link

(williamslater2003.medium.com)

The Merge: A Potentially Positive Future with Super Intelligence

Strad Slater12 Nov 2025 8:00 UTC

2 points

0 comments4 min readEA link

(williamslater2003.medium.com)