Aashish Rai
I'm a Computer Science Ph.D. student at Brown University, supervised by Srinath Sridhar at the
Interactive 3D Vision & Learning Lab (IVL).
I'm driven by the vision of building AI systems that understand the world like humans do.
My research focuses on two main areas: first, multimodal learning, where I study the interplay of vision, sound, and language;
and second, leveraging 2D foundation models for efficient 3D and 4D world reconstruction.
I hope my work can contribute to building AI systems that can effectively integrate different modalities for richer dynamic environmental understanding.
I also spend time at Meta Reality Labs in Burlingame, California, working with Aayush Prakash.
Previously, I worked as a full-time Research Assistant at Robotics Institute, Carnegie
Mellon University, advised by Fernando De la Torre at Human Sensing Lab.
My work involved realistic 3D face generation by leveraging 2D models in collaboration with Meta Reality Labs.
Before joining CMU, I completed my undergraduate studies in ECE at the National Institute of Technology (NIT) Surat, India.
During this time, I worked with
Kishor Upla on problems in Deep Learning and Computer Vision. I also had the opportunity to work with
McGill University, Norwegian Biometrics Lab,
and Indian Space Research Organization (ISRO).
I am always open to research collaborations in vision, multimodal learning, and related fields. Feel free to email me to discuss potential collaborations.
Email  / 
Google Scholar  / 
GitHub  / 
LinkedIn  / 
CV
|
|
RESEARCH EXPERIENCE
I am fortunate to have worked with some of the best people at the following places:
|
|
Researcher (CW) Meta Reality Labs
Burlingame, CA, USA
(May/2024 - )
Hosted by: Aayush Prakash
|
|
Research Assistant Robotics Institute, Carnegie Mellon University
Pittsburgh, PA, USA
(Sept/2021 - June/2023)
Advisor: Fernando De la Torre
|
|
Research Intern Shared Reality Lab, McGill University
Montreal, Canada
(May/2020 - Mar/2021)
Advisor: Jeremy Cooperstock
|
|
Undergraduate Researcher Norwegian Biometrics Laboratory, NTNU Norway
(Dec/2019 - May/2020)
Advisor: Kishor Upla,
Christoph Busch
|
|
Summer Research Intern IIRS, Indian Space Research Organization (ISRO)
Dehradun, India
(May/2019 - Jul/2019)
Advisor: Anil Kumar
|
|
Undergraduate Researcher MLCV Lab, NIT Surat
Surat, India
(Jan/2019 - Nov/2019)
Advisor: Kishor Upla
|
|
UVGS: Reimagining Unstructured 3D Gaussian Splatting using UV Mapping
Aashish Rai, Dilin Wang, Mihir Jain, Nikolaos Sarafianos, Arthur Chen, Srinath Sridhar, Aayush Prakash
Accepted to CVPR 2025
project page | pdf
|
|
EgoSonics: Generating Synchronized Audio for Silent Egocentric Videos
Aashish Rai, Srinath Sridhar
Winter Conference on Applications of Computer Vision (WACV), 2025
project page | pdf
|
|
Towards Realistic Generative 3D Face Models
Aashish Rai, Hiresh Gupta, Ayush Pandey, Francisco Vicente Carrasco, Shingo Jason Takagi, Amaury Aubel, Daeil Kim, Aayush Prakash, Fernando de la Torre
Winter Conference on Applications of Computer Vision (WACV), 2024
project page | pdf
|
|
Controllable 3D Generative Adversarial Face Model via Disentangling Shape and Appearance
Fariborz Taherkhani, Aashish Rai, Quankai Gao, Shaunak Srivastava, Xuanbai Chen, Fernando de la Torre, Steven Song, Aayush Prakash, Daeil Kim
Winter Conference on Applications of Computer Vision (WACV), 2023
project page | pdf
|
|
Improved Attribute Manipulation in the Latent Space of StyleGAN for Semantic Face Editing
Aashish Rai, Clara Ducher, Jeremy Cooperstock
20th IEEE International Conference on Machine Learning and Applications, Pasadena, CA, USA, 2021
pdf | project page
|
|
ComSupResNet: A Compact Super-Resolution Network for Low-Resolution Face Images.
Aashish Rai, Vishal Chudasama, Kishor Upla, Kiran Raja, Raghavendra Ramachandra, Christoph Busch
8th International Workshop on Biometrics and Forensics (IWBF), Porto, Portugal, 2020
pdf | project page
(extended version is accepted in IEEE Transactions on Biometrics, Behavior and Identity Science (T-BIOM))
|
MORE
|
[Mentor] Google exploreCSR 2025 Ph.D. mentor.
[Teaching] Teaching Assistant (Fall 2024): CSCI 1430, Computer Vision.
[Reviewer] CVPR, ECCV, NeurIPS, ICLR, ICML, WACV, SIGGRAPH, etc.
[Hobbies] I love photography, scenic drives, biking, and reading books. Some of my photography work has been featured on Google Pixel and Pexels.
|
(website template modified from repo )
|
|