This repository contains the official release of data generation code, simulation environments, and datasets for the CoRL 2023 paper "MimicGen: A Data Generation System for Scalable Robot Learning using Human Demonstrations".
The released datasets contain over 48,000 task demonstrations across 12 tasks and the MimicGen data generation tool can create as many as you'd like.
Website: https://mimicgen.github.io
Paper: https://arxiv.org/abs/2310.17596
Documentation: https://mimicgen.github.io/docs/introduction/overview.html
For business inquiries, please submit this form: NVIDIA Research Licensing
- [07/09/2024] v1.0.0: Full code release, including data generation code
- [04/04/2024] v0.1.1: Dataset license changed to CC-BY 4.0, which is less restrictive (see License)
- [09/28/2023] v0.1.0: Initial code and paper release
Some helpful suggestions on useful documentation pages to view next:
- Getting Started
- Launching Several Data Generation Runs
- Reproducing Published Experiments and Results
- Data Generation for Custom Environments
- Overview of MimicGen Codebase
Please see the troubleshooting section for common fixes, or submit an issue on our github page.
The code is released under the NVIDIA Source Code License and the datasets are released under CC-BY 4.0.
Please cite the MimicGen paper if you use this code in your work:
@inproceedings{mandlekar2023mimicgen,
title={MimicGen: A Data Generation System for Scalable Robot Learning using Human Demonstrations},
author={Mandlekar, Ajay and Nasiriany, Soroush and Wen, Bowen and Akinola, Iretiayo and Narang, Yashraj and Fan, Linxi and Zhu, Yuke and Fox, Dieter},
booktitle={7th Annual Conference on Robot Learning},
year={2023}
}