Undergraduate Researcher · RAIVN Lab
Mentored by Nabil Omi · Advised by Prof. Ali Farhadi
- Co-authoring a paper (under review) training 145,000+ policies across 114 datasets to study how reporting and hyperparameter choices affect offline-RL benchmark conclusions.
- Implemented and adapted offline RL algorithms (CQL, BCQ) in PyTorch and d4rl; integrated ViZDoom environments into the MuJoCo-based benchmark suite.
- Built data pipelines for JumpStart — the project's open-source resource suite of trained models, per-model hyperparameter and reward data, and strong baselines across 114 datasets.
- PyTorch
- d4rl
- MuJoCo
- ViZDoom
- CQL
- BCQ