Learning to play Minecraft with Video PreTraining
OpenAI's Video PreTraining (VPT) learned Minecraft gameplay from unlabeled human videos, then fine-tuned to craft diamond tools in 24,000 actions—a step toward general computer-using agents.
Excerpt
We trained a neural network to play Minecraft by Video PreTraining (VPT) on a massive unlabeled video dataset of human Minecraft play, while using only a small amount of labeled contractor data. With fine-tuning, our model can learn to craft diamond tools, a task that usually takes proficient humans over 20 minutes (24,000 actions). Our model uses the native human interface of keypresses and mouse movements, making it quite general, and represents a step towards general computer-using agents.
Read at source: https://openai.com/index/vpt