Gathering human feedback

OpenAI Blog · Aug 3, 2017

RL-Teacher open-source tool enables training AIs via human feedback rather than hand-crafted reward functions, early RLHF work.

Categories: OSS & Tools, Research

Excerpt

RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as a step towards safe AI systems, but also applies to reinforcement learning problems with rewards that are hard to specify.

Read at source: https://openai.com/index/gathering-human-feedback