Aashish Rai

I'm a Computer Science Ph.D. student at Brown University, supervised by Srinath Sridhar at the Interactive 3D Vision & Learning Lab (IVL). My research focuses on two main areas: first, multimodal learning, where I study the interplay of vision, sound, and language; and second, leveraging 2D foundation models for efficient 3D and 4D world reconstruction. I hope my work can contribute to building AI systems that can effectively integrate different modalities for richer dynamic environmental understanding.

I also spent time at Meta Reality Labs in Burlingame, California, working with Aayush Prakash on leveraging 2D foundation models for 3D asset synthesis.

Previously, I worked as a full-time Research Assistant at Robotics Institute, Carnegie Mellon University, advised by Fernando De la Torre at Human Sensing Lab. My work involved realistic 3D face generation by leveraging 2D models in collaboration with Meta Reality Labs.

Before joining CMU, I completed my undergraduate studies in ECE at the National Institute of Technology (NIT) Surat, India. During this time, I worked with Kishor Upla on problems in Deep Learning and Computer Vision. I also had the opportunity to work with McGill University, Norwegian Biometrics Lab, and others.

I am always open to research collaborations in vision, multimodal learning, and related fields. Feel free to email me to discuss potential collaborations.

Email / Google Scholar / GitHub / LinkedIn / CV

RECENT UPDATES

[Feb 2025] One paper [ UVGS ] accepted at CVPR 2025. See you in Nashville TN!
[Oct 2024] One paper [ EgoSonics ] accepted at WACV 2025. See you in Arizona!
[May 2024] Started working at Meta Reality Labs in Burlingame, CA.
[Oct 2023] One paper [ Towards Realistic Generative 3D Face Models ] accepted at WACV 2024.
[Aug 2023] Started Ph.D. in Computer Science at Brown University.
[May 2023] Our work [ Towards Realistic Generative 3D Face Models ] is trending on Twitter. Covered by media, Synced and many other!!
[Jan 2023] Presented one paper on 3D Face Generation at WACV 2023 in Hawaii.
[Sep 2022] Collaborated with Pavlos Protopapas's group at Harvard University to work on Black Holes research.

RESEARCH EXPERIENCE

I am fortunate to have worked at the following places:

	Researcher/Intern (CW) Meta Reality Labs Burlingame, CA, USA (May/2024 - 2025) Hosted by: Aayush Prakash
	Research Assistant Robotics Institute, Carnegie Mellon University Pittsburgh, PA, USA (Sept/2021 - June/2023) Advisor: Fernando De la Torre
	Research Intern Shared Reality Lab, McGill University Montreal, Canada (May/2020 - Mar/2021) Advisor: Jeremy Cooperstock
	Undergraduate Researcher Norwegian Biometrics Laboratory, NTNU Norway (Dec/2019 - May/2020) Advisor: Kishor Upla, Christoph Busch
	Undergraduate Researcher MLCV Lab, NIT Surat Surat, India (Jan/2019 - Nov/2019) Advisor: Kishor Upla

PUBLICATIONS/SUBMISSIONS

	GaussUV: Long-Duration Free-Viewpoint Video with Gaussian UV Maps Aashish Rai, Xiaoyan Cong, Angela Xing, Zekun Li, Tao Lu, Srinath Sridhar in submission...
	UVGS: Reimagining Unstructured 3D Gaussian Splatting using UV Mapping Aashish Rai, Dilin Wang, Mihir Jain, Nikolaos Sarafianos, Arthur Chen, Srinath Sridhar, Aayush Prakash CVPR 2025 project page \| pdf
	EgoSonics: Generating Synchronized Audio for Silent Egocentric Videos Aashish Rai, Srinath Sridhar Winter Conference on Applications of Computer Vision (WACV), 2025 project page \| pdf
	Towards Realistic Generative 3D Face Models Aashish Rai, Hiresh Gupta, Ayush Pandey, Francisco Vicente Carrasco, Shingo Jason Takagi, Amaury Aubel, Daeil Kim, Aayush Prakash, Fernando de la Torre Winter Conference on Applications of Computer Vision (WACV), 2024 project page \| pdf
	Controllable 3D Generative Adversarial Face Model via Disentangling Shape and Appearance Fariborz Taherkhani, Aashish Rai, Quankai Gao, Shaunak Srivastava, Xuanbai Chen, Fernando de la Torre, Steven Song, Aayush Prakash, Daeil Kim Winter Conference on Applications of Computer Vision (WACV), 2023 project page \| pdf
	Improved Attribute Manipulation in the Latent Space of StyleGAN for Semantic Face Editing Aashish Rai, Clara Ducher, Jeremy Cooperstock 20th IEEE International Conference on Machine Learning and Applications, Pasadena, CA, USA, 2021 pdf \| project page
	ComSupResNet: A Compact Super-Resolution Network for Low-Resolution Face Images. Aashish Rai, Vishal Chudasama, Kishor Upla, Kiran Raja, Raghavendra Ramachandra, Christoph Busch 8th International Workshop on Biometrics and Forensics (IWBF), Porto, Portugal, 2020 pdf \| project page (extended version is accepted in IEEE Transactions on Biometrics, Behavior and Identity Science (T-BIOM))

[Mentor] Google exploreCSR 2025 Ph.D. mentor.
[Teaching] Teaching Assistant (Fall 2024): CSCI 1430, Computer Vision.
[Reviewer] CVPR, ECCV, NeurIPS, ICLR, ICML, WACV, SIGGRAPH, etc.
[Hobbies] I love photography, scenic drives, biking, and reading books. Some of my photography work has been featured on Google Pixel and Pexels.

(website template modified from repo )

Hey folks from around the globe!!