Intro
Hi, I'm Div. I am a computer scientist and AI researcher. I am on a leave of absence from a CS PhD at Stanford University. I am building MultiOn - a Personal AI Agent startup with the mission of building general-purpose AI similar to JARVIS to do your digital tasks and bring it to everyday consumer use.
I am passionate about all things Deep Learning as well as its applications to Robotics and Vision. My recent focuses have been on building systems that are generally intelligent and able to reason, designing learning algorithms that can learn as efficiently
as humans and making Reinforcement Learning work in the real world.
I created and taught the first course on Transformers at Stanford — CS 25: Transformers United — discussing the latest breakthroughs and broad implications of Transformers in AI. The class invites people at the forefront of Transformers research in various fields to spark cross-collaborative research, including eminent speakers like Prof. Geoffrey Hinton. Our lectures have received an overwhelming public reception with over 1M views on Youtube.
I have given invited talks at OpenAI, DeepMind and Apple on my research. I recently worked on a new RL algorithm — Extreme Q-Learning — that builds on a 2000 Nobel prize work in Economics to solve a previously assumed intractable problem and reaches state-of-the-art in Offline RL. My past work — IQ-Learn — introduces a novel imitation learning framework to have AI agents learn from observing videos of humans, creating agents that can play video games like Atari at human performance and won the #1 place in NeurIPS '21 Minecraft AI competition. The work received media coverage from Stanford HAI.
In my undergrad at Cornell, I created the first working camera-based self-driving car system and published 4 major conference papers over 2 years in 3D Vision. My work was widely covered in the media: Forbes, The Robot Report, Gizmodo.
I have been fortunate to be mentored by Ian Goodfellow (and was his first intern at Apple).
In a past life, I was a child prodigy in Physics and won medals in a few International Olympiads.
Recent News:
- Jan 2024: Announcing MultiOn: Building a Brighter Future for Humanity with AI Agents
- April 2023: I gave research talks on AI Autonomous Agents at the Stanford NLP group & Langchain Agents Webinar on Youtube.
- March 2023: Released a blog post on the future of software: Software 3.0
- Feb 2023: Extreme Q-learning was accepted in ICLR 2023 as Oral!
- Nov 2022: Released my very first blog post on Learning to Imitate to improve current AI systems on the Stanford AI blog.
- Sept 2022: Got Oral in European Workshop for Reinforcement Learning (EWRL) for IQ-Learn
- Aug 2022: Publicly released our CS 25: Transformers United lectures on Youtube with great reception!
- May 2022: Stanford HAI featured a story on IQ-Learn: Training Smarter Bots for the Real World!
- March 2022: Released LISA: a heirarchical framework to make robots better understand natural language for solving long-range tasks on Arxiv
- Dec 2021: Won #1 place in creating an AI bot to play Minecraft in NeurIPS '21 MineRL Basalt challenge using only recorded videos of human players
- Oct 2021: Gave invited research talk at Apple on IQ-Learn
- Sept 2021: IQ-Learn was accepted in NeurIPS 2021 with Spotlight!
- Aug 2021: Teaching Stanford's first class on Transformers: CS 25
- July 2021: I gave a invited research talk at OpenAI on improving imitation by more than 5x with my recent work: IQ-Learn.
- June 2021: Preprint on our state-of-art Imitation Learning method IQ-Learn available on Arxiv
- Nov 2020: I gave a talk on a youtube channel: Computer Vision Talks
- Sept 2020: My paper for statistical learning of stereo depth was accepted in NeurIPS 2020 with Spotlight!
- March 2020: Started Apple internship under Ian Goodfellow.
- Feb 2020: New paper on 3D Object Detection accepted in CVPR 2020.
Experience
I worked as a research intern in Apple Special Projects Group. Directly supervised by renowned researcher Ian Goodfellow. Researched on RL, Inverse RL and Generative Modeling.
CV Research
I did Computer Vision research with Prof. Kilian Q. Weinberger and Prof. Bharath Hariharan . My focus was on camera-only depth estimation and 3D object detection. I created a state-of-the-art model for stereo-only 3D object detection.
Publications
Extreme Q-Learning: MaxEnt RL without Entropy
LISA: Learning Interpretable Skill Abstractions from Language
IQ-Learn: Inverse soft-Q Learning for Imitation
Media Coverage: Stanford HAI
Wasserstein Distances for Stereo Disparity Estimation
End-to-end Pseudo-LiDAR for Image-Based 3D Object Detection
Pseudo-LiDAR++: Accurate Depth for 3D Object Detection in Autonomous Driving
Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving
Media Coverage: Cornell Chronicle, Forbes, NSF, Teslarati, Gizmodo
* denotes equal contribution