Hacker News new | past | comments | ask | show | jobs | submit

Reinforcement Learning from Human Feedback

https://rlhfbook.com/
loading story #46926118
loading story #46924305
loading story #46923832
loading story #46923959