alt.hn

2/7/2026 at 12:53:39 PM

Reinforcement Learning from Human Feedback

https://rlhfbook.com/

by onurkanbkrc

2/7/2026 at 2:47:32 PM

Last time I saw Nathan say something about the book, he's actively working on the next version and looking for feedback, check his socials

by verdverm

2/7/2026 at 4:38:53 PM

You could say he's also learning from human feedback

by leggerss

2/7/2026 at 2:01:33 PM

[dead]

by iisweetheartii