Acknowledgments
Initial versions of this book were compiled as lecture notes to the class CS329H: Machine Learning from Human Preferences at Stanford University taught in Fall 2023 and Fall 2024. We thank Rehaan Ahmad, Ahmed Ahmed, Jirayu Burapacheep, Michael Byun, Akash Chaurasia, Andrew Conkey, Tanvi Deshpande, Eric Han, Laya Iyer, Adarsh Jeewajee, Shreyas Kar, Arjun Karanam, Jared Moore, Aashiq Muhamed, Bidipta Sarkar, William Shabecoff, Stephan Sharkov, Max Sobol Mark, Kushal Thaman, Joe Vincent, Yibo Zhang, Duc Nguyen, Grace Sodunke, Ky Nguyen, and Mykkel Kochenderfer for their early contributions and feedback.
Citation
Thanks for reading our book! We hope you find this book useful in your research and teaching.
BibTeX citation:
@book{mlhp,
author = {Truong, Sang and Haupt, Andreas and Koyejo, Sanmi},
title = {{Machine Learning from Human Preferences}},
year = {2025},
publisher = {Stanford University},
doi = {},
note = {}
}
For attribution, please cite this work as:
S. Truong, A. Haupt, and S. Koyejo. 2025. Machine Learning from Human Preferences. Stanford University.