Publications
For a complete and up to date list of publications please visit Google Scholar.
-
ICLR
2026
FlashDLM: Accelerating Diffusion Language Model Inference via Efficient KV Caching and Guided Diffusion
Zhanqiu Hu, Jian Meng, Yash Akhauri, Mohamed S. Abdelfattah, Jae-sun Seo, Zhiru Zhang, Udit Gupta
To appear in ICLR 2026
2026
-
DATE
COFFEE: A Carbon-Modeling and Optimization Framework for HZO-based FeFET eNVMs
Hongbang Wu, Xuesi Chen, Shubham Jadhav, Amit Lal, Lillian Pentecost, and Udit Gupta
To appear in DATE 2026
-
ICCAD
2025
EPiCarbon: A Carbon Modeling Tool for Electro-Photonic Accelerators
Farbin Fayza, Cansu Demirkiran, Satyavolu Papa Rao, Darius Bunandar, Udit Gupta, Ajay Joshi
IEEE/ACM International Conference On Computer Aided Design
2025
-
ICCAD
CarbonClarity: Understanding and Addressing Uncertainty in Embodied Carbon for Sustainable Computing
Xuesi Chen, Leo Han, Anvita Bhagavathula, Udit Gupta
IEEE/ACM International Conference On Computer Aided Design
-
NatC
Photonics for sustainable AI
Farbin Fayza, Cansu Demirkiran, Satyavolu Papa Rao, Darius Bunandar, Udit Gupta, Ajay Joshi
Nature Communications Physics
-
NEJM
Sustainably Advancing Health AI: A Decision Framework to Mitigate the Energy, Emissions, and Cost of AI Implementation
Anu Ramachandran, Chethan Sarabu, Udit Gupta, Shomit Ghose, Vivian S Lee
New England Journal of Medicine (NEJM) Catalyst Innovations in Care Delivery
-
SIGM
Metrics for Data Center Embodied Carbon
Leo Han, Yueying Lisa Li, Udit Gupta
ACM SIGMETRICS Performance Evaluation Review
-
SIGM
Fair, Practical, and Efficient Carbon Accounting for LLM Serving
Yueying Lisa Li, Leo Han, G Edward Suh, Christina Delimitrou, Fiodar Kazhamiaka, Esha Choukse, Rodrigo Fonseca, Liangcheng Yu, Jonathan Mace, Udit Gupta
ACM SIGMETRICS Performance Evaluation Review
-
DIS
Silicon Foraging: Harvesting Excess Compute for Sustainable Edge Computing
Ilan Mandel, Udit Gupta
ACM Designing Interactive Systems Conference
-
SIGM
From Component to System: Rethinking Edge Computing Design through a Carbon-Aware Lens
Xuesi Chen, Ariel Goldner, Eren Yildiz, Ilan Mandel, Tingyu Cheng, Josiah Hester, Udit Gupta
ACM SIGENERGY Energy Informatics Review
-
CACM
Empowering Users to Make Sustainability-Forward Decisions for Computing Services
Sydney Young, Udit Gupta , Josiah Hester
Communications of the ACM
-
ISCA
Hermes: Algorithm-System Co-design for Efficient Retrieval-Augmented Generation At-Scale
Michael Shen, Muhammad Umar, Kiwan Maeng, G Edward Suh, Udit Gupta
International Symposium on Computer Architecture -
ISCA
Fair-CO2: Fair attribution for cloud carbon emissions
Leo Han, Jash Kakadia, Benjamin C Lee, Udit Gupta
International Symposium on Computer Architecture -
JCSS
Slower is Greener: Acceptance of Eco-feedback Interventions on Carbon Heavy Internet Services
Hyeonwook Kim, Sydney Young, Xuesi Chen, Udit Gupta , Josiah Hester
ACM Journal on Computing and Sustainable Societies
-
IoTJ
GreenScale: Carbon Optimization for Edge Computing
Yonglak Son, Udit Gupta, Andrew McCrabb, Young Geun Kim, Valeria Bertacco, David Brooks, Carole-Jean Wu
IEEE Internet of Things Journal
-
HPCA
CORDOBA: Carbon-efficient optimization framework for computing systems
Mariam Elgamal, Doug Carmean, Elnaz Ansari, Okay Zed, Ramesh Peri, Srilatha Manne, Udit Gupta, Gu-Yeon Wei, David Brooks, Gage Hills, Carole-Jean Wu
IEEE International Sympsoium on High Performance Computer Architecture
-
arXiv
Ecoserve: Designing carbon-aware ai inference systems
Yueying Li, Zhanqiu Hu, Esha Choukse, Rodrigo Fonseca, G Edward Suh, Udit Gupta
-
JXCDC
2024
Energy-/carbon-aware evaluation and optimization of 3d ic architecture with digital compute-in-memory designs
Hyung Joon Byun, Udit Gupta , Jae-sun Seo
IEEE Journal on Exploratory Solid-State Computational Devices and Circuits
2024
-
ISPLED
3d ic architecture evaluation and optimization with digital compute-in-memory designs
Hyung Joon Byun, Udit Gupta , Jae-sun Seo
International Symposium on Low Power Electronics and Design
-
USENIX
Information flow control in machine learning through modular model architecture
Trishita Tiwari, Suchin Gururangan, Chuan Guo, Weizhe Hua, Sanjay Kariyappa, Udit Gupta , Wenjie Xiong, Kiwan Maeng, Hsien-Hsin S Lee, G Edward Suh
USENIX Security Symposium
-
ASPLOS
2023
MP-Rec: Hardware-Software Co-Design to Enable Multi-Path Recommendation
Samuel Hsia, Udit Gupta, Bilge Acun, Newsha Ardalani, Pan Zhong, Gu-Yeon Wei, David Brooks, Carole-Jean Wu
International Conference on Architectural Support for Programming Languages and Operating Systems
2023
ASPLOS
A Holistic Approach for Designing Carbon Aware Datacenters
Bilge Acun, Benjamin Lee, Kiwan Maeng, Manoj Chakkaravarthy, Udit Gupta, David Brooks, Carole-Jean Wu
International Conference on Architectural Support for Programming Languages and Operating Systems
-
ISCA
2022
ACT: Designing Sustainable Computer Systems With An Architectural Carbon Modeling Tool
Udit Gupta , Mariam Elgamal, Gage Hills, Gu-Yeon Wei, Hsien-Hsin S. Lee, David Brooks, Carole-Jean Wu
International Symposium on Computer Architecture
[PDF] [Slides] [Github]
IEEE MICRO Top Picks Award!2022
-
MLSys
Sustainable AI: Environmental Implications, Challenges and Opportunities
Carole-Jean Wu, Ramya Raghavendra, Udit Gupta, Bilge Acun, Newsha Ardalani, Kiwan Maeng, et. al.
Machine Learning and Systems
[ArXiv] -
HPCA
Hercules: Heterogeneity-Aware Inference Serving for At-scale Personalized Recommendation
Liu Ke, Udit Gupta, Mark Hempstead, Carole-Jean Wu, Hsien-Hsin Sean Lee, Xuan Zhang
IEEE International Symposium on High-Performance Computer Architecture -
MICRO
2021
RecPipe: Co-designing Models and Hardware to Jointly Optimize Recommendation Quality and Performance
Udit Gupta, Samuel Hsia, Jeff Zhang, Mark Wilkening, Javin Pombra, Hsien-Hsin S. Lee, Gu-Yeon Wei, Carole-Jean Wu, David Brooks
IEEE/ACM International Symposium on Microarchitecture
[ArXiv] [Github] [MICRO 2021 Talk] [MICRO 2021 Lightning Talk] [PDF]
[Artifact] (available, functional, and reproducible)2021
-
WCAE
Mentoring Opportunities in Computer Architecture: Analyzing the Past to Develop the Future
Elba Garza, Gururaj Saileshwar, Udit Gupta, Tianyi Liu, Abdulrahman Mahmoud, Saugata Ghose, Joel Emer
Workshop on Computer Architecture Education (WCAE) co-located with ISCA
[PDF] -
HPCA
RecSSD: near data processing for solid state drive based recommendation inference
Mark Wilkening, Udit Gupta, Samuel Hsia, Caroline Trippel, Carole-Jean Wu, David Brooks, Gu-Yeon Wei
IEEE International Sympsoium on High Performance Computer Architecture
[PDF] -
HPCA
Chasing Carbon: The Elusive Environmental Footprint of Computing
Udit Gupta, Young Geun Kim, Silvia Lee, Jordan Tse, Hsien-Hsin S. Lee, Gu-Yeon Wei, David Brooks, Carole-Jean Wu
IEEE International Sympsoium on High Performance Computer Architecture
[PDF]
IEEE MICRO Top Picks Award! -
IISWC
2020
Cross-stack workload characterization of deep recommendation systems
Samuel Hsia, Udit Gupta , Mark Wilkening, Carole-Jean Wu, Gu-Yeon Wei, David Brooks
International Sympsoium on Workload Characterization
[PDF]2020
-
ISCA
DeepRecSys: A System for Optimizing End-to-End At-scale Neural Recommendation Inference
Udit Gupta, Samuel Hsia, Vikram Saraph, Xiaodong Wang, Brandon Reagen, Gu-Yeon Wei, Hsien-Hsin S. Lee, David Brooks, Carole-Jean Wu
International Sympsoium on Computer Architecture
[PDF] -
ISCA
RecNMP: Accelerating Personalized Recommendation with Near-Memory Processing
Liu Ke, Udit Gupta, Benjamin Youngjae Cho, David Brooks, et. al.
International Sympsoium on Computer Architecture
[PDF] -
MLSys
MLPerf Training Benchmark
Peter Mattson, Christine Cheng, Gregory Diamos, Cody Coleman, Paulius Mickievicius, David Patterson, et. al.
Proceedings of Machine Learning and Systems
[PDF] -
MLSys
The Architectural Implications of Facebook's DNN-bsed Personalized Recommendation Models
Udit Gupta, Carole-Jean Wu, Xiaodong Wang, Maxim Naumov, Brandon Reagen, David Brooks, et. al.
International Symposium on High Performance Computer Architecture
[PDF]
IEEE MICRO Top Picks Honorable Mention! -
ProTools
2019
ChampVis: Comparative Hierarchical Analysis of Microarchitectural Performance
Lillian Pentecost, Udit Gupta, Elisa Ngan, Johanna Beyer, Gu-Yeon Wei, David Brooks
IEEE/ACM International Workshop on Programming and Performance Visualization Tools (ProTools) co-located with Super Computing
2019
-
MICRO
MaxNVM: Maximizing DNN Storage Density and Inference Efficiency with Sparse Encoding and Error Mitigation
Lillian Pentecost, Marco Donato, Brandon Reagen, Udit Gupta, Siming Ma, Gu-Yeon Wei, David Brooks
IEEE/ACM International Symposium on Microarchitecture
-
PACT
MASR: A Modular Accelerator for Sparse RNNs
Udit Gupta, Brandon Reagen, Lillian Pentecost, Marco Donato, Thierry Tambe, Alexander Rush, Gu-Yeon Wei, David Brooks,
International Conference on Parallel Architecture and Compilation Techniques
[PDF]
Best paper nominee! -
VLSI
A 16nm 25mm2 SoC with 54.5x Flexibility-Efficiency Range from Dual-Core ARM Cortex-A53 to eFPGA and Cache-Coherent Accelerators
Paul Whatmough, Sae Kyu Lee, Marco Donato, Thomas Hsueh, Sam Xi, Udit Gupta, Lillian Pentecost, Glenn Ko, David Brooks, Gu-Yeon Wei
Symposium on VLSI Circuits -
arXiv
Deep Learning Recommendation Model for Personalization and Recommendation Systems
Maxim Naumov, Dheevatsa Mudigere, Michael Whi, Jianyu Huang, Narayanan Sundaraman, Jongoo Park, Xiaodong Wang, Udit Gupta, Carole-Jean Wu et. al.
arXiv -
ICML
2018
Weightless: Lossy Weight Encoding for Deep Neural Network Compression
Brandon Reagen, Udit Gupta, Bob Adolf, Michael Mitzenmacher, Alexander Rush, Gu-Yeon Wei, David Brooks
International Conference on Machine Learning
2018
-
DAC
Ares: A Framework for Quantifying the Resilience of Deep Neural Networks
Brandon Reagen, Udit Gupta, Lillian Pentecost, Paul Whatmough, Sae Kyu Lee, Niam Mullholland, David Brooks, Gu-Yeon Wei
Design Automation Conference
Best paper nominee! -
DAC
On-chip Deep Neural Network Storage with Multi-level eNVM
Marco Donato, Brandon Reagen, Lillian Pentecost, Udit Gupta, David Brooks, Gu-Yeon Wei
Design Automation Conference -
FPGA
Rosetta: A Realistic High-level Synthesis Benchmark Suite for Software Programmable FPGAs
Yuan Zhou, Udit Gupta, Steve Dai, Ritchie Zhao, Nitish Srivastava, Hanchen Jin, Joseph Featherston, Yi-Hsiang Lai, Gai Liu, Gustavo Angarita Velasquez, Wenping Wang, Zhiru Zhang
International Symposium on Field Programmable Gate Arrays -
FPGA
2017
Dynamic Hazard Resolution of Pipelining Irregular Loops in High-level Synthesis
Steve Dai, Ritchie Zhao, Shreesha Srinath, Udit Gupta, Chris Batten, Zhiru Zhang
International Symposium on Field Programmable Gate Arrays2017
-
FPGA
2015
Mapping-aware Constrained Scheduling for LUT-based FPGAs
Mingxing Tan, Steve Dai, Udit Gupta, Zhiru Zhang
International Symposium on Field Programmable Gate Arrays2015
Patents
-
Computer memory module processing device with cache storage
Liu Ke, Silvia Zhang, Udit Gupta , Carole-Jean Wu, Mark Hempstead, Brandon Reagen, Hsien-Hsin S. Lee
Theses and Books
-
Enabling High Performance, Efficient, and Sustainable Deep Learning Systems At Scale
Udit Gupta
PhD Dissertation, Computer Science, Harvard University, 2022
[PDF]