Reinforcement Learning from Human Feedback

rlhfbook.com

95 points

onurkanbkrc

9 hours ago


6 comments

verdverm 8 hours ago

Last time I saw Nathan say something about the book, he's actively working on the next version and looking for feedback, check his socials

  • leggerss 6 hours ago

    You could say he's also learning from human feedback