Greetings! Welcome to the IAP Newsletter with research publications, conferences and news regarding applications and infrastructure for AI and machine learning, hardware acceleration, operating systems, networking, security, and storage.


In this edition, we highlight key papers at USENIX ATC, SOUPS, SIGCOMM, MICRO-58, EMNLP, SoCC and NeurIPS, in addition to Awards, Books, Blogs and News Items. We begin with a recap of recent IAP Workshops at UC Berkeley and the University of Washington.

AI and CLOUD WORKSHOPS - 2025

UC Berkeley Workshop on the Future of AI in the Cloud


Tuesday, November 18, 2025


Soda Hall, UC Berkeley, Berkeley, CA 

This workshop was hosted and co-organized by Prof. Sagar Karandikar with keynote speakers Prof. Dave Patterson and Prof. Ion Stoica (portraits above).

Pictured above are Ion Stoica, Natacha Crooks, and Ralph Wittig presenting their research on November 18.


Speakers (in order of appearance):


KEYNOTE: Prof. Dave Patterson, UC Berkeley and Google, the Pardee Professor of Computer Science, Emeritus, "How to Give AI a Bad Carbon Footprint"


Dr. Bilge Acun, Research Scientist, FAIR / Meta Superintelligence Labs, “CATransformers: Carbon Aware Transformers Through Joint Model-Hardware Optimization”


Ralph Wittig, Head of Research & Advanced Development, AMD, "Accelerating the Future: AI, Compute and Innovation at Scale"


Prof. Sagar Karandikar, UC Berkeley, “Agile Hardware/Software Co-Design for Hyperscale Cloud Systems”


Lightning Session for Student Posters: Best Poster Award: "Autocomp: A Powerful and Portable Code Optimizer for Tensor Accelerators" by Charles Hong, Sahil Bhatia, Alvin Cheung and Sophia Shao.


KEYNOTE: Prof. Ion Stoica, UC Berkeley, the Xu Bao Chancellor's Chair Professor, "An AI Stack: from Scaling AI Workloads to Evaluating LLMs"


Dr. Erich Haratsch, Senior Director of Architecture, Marvell, “Data Storage Innovations for Scalable AI Infrastructure”


Prof. Sophia Shao, UC Berkeley, “From Algorithms to Silicon: Accelerating Full-Stack Co-Design for the AI Era”


Dr. Liguang Xie, Engineering Director of Global Compute Infrastructure, ByteDance, “Towards Disaggregated LLM Inference: AIBrix, Agentic Workloads, and the Rise of DPU-Accelerated AI Systems"


Prof. Natacha Crooks, UC Berkeley, “Supporting Our AI Overlords: Redesigning Data Systems to be Agent-First”


This was the third AI and Cloud Workshop hosted by UC Berkeley. Please see the BERKELEY WORKSHOP WEB PAGE for the speaker bios, abstracts and videos of the presentations.

UW WORKSHOP ON THE FUTURE OF AI AND CLOUD COMPUTING


Friday, May 9, 2025


Husky Union Building, UW, Seattle, WA 

Shihang Vic Li and Matthew Giordano won the Best Poster Award for "NEMO: Flexible and High-Fidelity Telemetry on Programmable Memeory Controllers." Congratulating them (left to right) are Ulf Hanebutte (Marvell), Prof. Stephanie Wang, Mats Oberg (Marvell), Prof. Tom Anderson, Prof. Baris Kasikci, Prof. Simon Peter, Liguang Xie (ByteDance), Victor Cao (Furturewei) and Brad Beckmann (AMD).


This was hosted by Prof. Stephanie Wang. Please see the UW WORKSHOP WEB PAGE for the speaker bios, abstracts and videos of the presentations.

AWARDS for IAP COLLEAGUES


ACM SIGARCH Maurice Wilkes Award

Carole-Jean Wu, Meta


ACM SIGARCH Alan D. Berenbaum Distinguished Service Award 

Joel Emer, MIT and Nvidia


ACM Charles P. “Chuck” Thacker Breakthrough in Computing Award

Jason Cong, UCLA


2025 Sloan Fellow

Natacha Crooks, University of California, Berkeley


Presidential Early Career Award for Scientists and Engineers

Christina Delimitrou, MIT


BEST DISSERTATION and PAPER AWARDS


2025 ACM SIGARCH/IEEE CS TCCA Outstanding Dissertation Award

Sagar Karandikar, UC Berkeley


2025 NeurIPS Madrona Prize

"Enhancing Personalized Multi-Turn Dialogue with Curiosity Reward"

Yanming Wan, Jiaxing WuMarwa AbdulhaiLior ShaniNatasha Jaques


2025 ACM SIGCOMM Networking Systems Award

"Batfish: An Open-source Network Configuration Analysis and Verification Platform"

Matt Brown, Ari Fogel, Spencer Fraint, Daniel Halperin, Victor Heohiardi, Ratul Mahajan, Todd Millstein, Corina Miner, and Samir Parikh.


The ICCAD 2025 Ten-Year Retrospective Most Influential Paper Award 

"Optimizing FPGA-based Accelerator Design for Deep Convolutional Neural Networks"

Chen Zhang, Peng Li, Guangyu Sun, Yijin Guan, and Bingjun Xiao, Jason Cong

NEW RESEARCH CENTERS


November 4, 2025

UT Austin launched the Center for Systems Infrastructure and AI to tackle the foundational challenges facing advances in AI; focused on the co-evolution of nimble AI stacks and intelligent infrastructureInfraAI@UT is led by Professors Aditya Akella and Chris Rossbach (photos below) and partners in close collaboration with industry stakeholders to build a new generation of systems. The Center builds upon their NSF Expedition in Computing for Learning Directed Operating Systems.

BOOKS


MILESTONE: The 7th Edition of the classic text book "Computer Architecture: A Quantitative Approach" was published on October 24, 2025. Congrats to the authors John L. Hennessy, David A. Patterson, and Christos Kozyrakis, who was added as an author to this edition, contributing two new chapters. See Christos's comments on LinkedIn regarding the impact of the book (and his co-authors) on himself and tens of thousands of scientists and engineers.

SELECT CONFERENCES and PUBLICATIONS in JULY-DECEMBER 2025

USENIX ATC, Boston, MA, July 7-9, 2025


Resource Multiplexing in Tuning and Serving Large Language Models

Yongjun He and Haofeng Yang, ETH Zurich; Yao Lu, National University of Singapore; Ana Klimovic and Gustavo Alonso, ETH Zurich


PageFlex: Flexible and Efficient User-space Delegation of Linux Paging Policies with eBPF

Anil Yelam and Kan Wu, Google; Zhiyuan Guo, UC San Diego; Suli Yang, Google;Rajath Shashidhara, University of Washington; Wei Xu and Stanko Novaković, Google; Alex C. Snoeren, Google and UC San Diego; Kimberly Keeton, Google


Internet Connection Splitting: What’s Old is New Again

Gina Yuan, Thea Rossman, and Keith Winstein, Stanford University


Cuckoo for Clients: Disaggregated Cuckoo Hashing

Stewart Grant and Alex C. Snoeren, UC San Diego



Accelerating Nested Virtualization with HyperTurtle

Ori Ben Zur and Jakob Krebs, Technion - Israel Institute of Technology; Shai Aviram Bergman, Huawei Zurich Research Center; Mark Silberstein, Technion - Israel Institute of Technology

Awarded Best Paper!

Twenty-First Symposium on Usable Privacy and Security (SOUPS 2025), August 10-12, 2025, Seattle, WA


Design and Evaluation of Privacy-Preserving Protocols for Agent-Facilitated Mobile Money Services in Kenya

Karen Sowon, Indiana University; Collins W. Munyendo, The George Washington University; Lily Klucinec, Carnegie Mellon University; Eunice Maingi and Gerald Suleh, Strathmore University; Lorrie Faith Cranor and Giulia Fanti, Carnegie Mellon University; Conrad Tucker and Assane Gueye, Carnegie Mellon University-Africa

IAPP SOUPS Privacy Award


Adopting AI to Protect Industrial Control Systems: Assessing Challenges and Opportunities from the Operators’ Perspective

Clement Fung, Carnegie Mellon University; Eric Zeng, Georgetown University; Lujo Bauer, Carnegie Mellon University


Misuse, Misreporting, Misinterpretation of Statistical Methods in Usable Privacy and Security Papers

Jenny Tang, Lujo Bauer, and Nicolas Christin, Carnegie Mellon University

SIGCOMM 2025, Coimbra, Portugal, September 8 - 11, 2025.


SpliDT: Partitioned Decision Trees for Scalable Stateful Inference at Line Rate

Murayyiam Parvez (Purdue University); Annus Zulfiqar (University of Michigan); Roman Beltiukov (University of California, Santa Barbara); Shir Landau Feibish (Open University of Israel); Walter Willinger (NIKSUN, Inc.); Arpit Gupta (University of California, Santa Barbara); Muhammad Shahbaz (University of Michigan)


Carbon- and Precedence-Aware Scheduling for Data Processing Clusters

Adam Lechowicz (University of Massachusetts Amherst); Rohan Shenoy (University of California Berkeley); Noman Bashir (Massachusetts Institute of Technology); Mohammad Hajiesmaili (University of Massachusetts Amherst); Adam Wierman (California Institute of Technology); Christina Delimitrou (Massachusetts Institute of Technology)


RANBooster: Democratizing advanced cellular connectivity through fronthaul middleboxes

Xenofon Foukas (Microsoft); Tenzin Samten Ukyab (UC Berkeley); Bozidar Radunovic (Microsoft); Sylvia Ratnasamy (UC Berkeley); Scott Shenker (ICSI AND UC Berkeley)


Falcon: A Reliable, Low Latency Hardware Transport

Arjun Singhvi (Google); Nandita Dukkipati (Google LLC); Prashant Chandra, Hassan M. G. Wassel, Naveen Kr. Sharma, Anthony Rebello, Henry Schuh, Praveen Kumar, Behnam Montazeri, Neelesh Bansod, Sarin Thomas, Inho Cho, Hyojeong Lee Seibert, Baijun Wu, Rui Yang, Yuliang Li, Kai Huang, Qianwen Yin, Abhishek Agarwal (Google); Srinivas Vaduvatha (Meta); Weihuang Wang, Masoud Moshref (Nvidia); Tao Ji (Microsoft); David Wetherall, Amin Vahdat (Google)


Firefly: Scalable, Ultra-Accurate Clock Synchronization for Datacenters

Pooria Namyar (USC & Google LLC); Yuliang Li, Weitao Wang, Nandita Dukkipati, KK Yap, Junzhi Gong, Chen Chen, Peixuan Gao (Google LLC); Devdeep Ray (NVIDIA); Gautam Kumar, Yidan Ma (Google LLC); Ramesh Govindan (USC & Google LLC); Amin Vahdat (Google LLC)


Scalable Video Conferencing Using SDN Principles

Oliver Michel, Satadal Sengupta (Princeton University); Hyojoon Kim (University of Virginia); Ravi Netravali, Jennifer Rexford 

IEEE/ACM International Symposium on Microarchitecture (MICRO-58), Seoul, Korea, October 18-22, 2025


AI compilers and inference at scale: efficiency and velocity

Luis Ceze, Nvidia and UW 

KEYNOTE


Can We Do Better?

Onur Mutlu, ETH Zurich

KEYNOTE


LongSight: Compute-Enabled Memory to Accelerate Large-Context LLMs via Sparse Attention

Derrick Quinn, E. Ezgi Yücel, Jinkwon Kim, José F. Martínez, Mohammad Alian (Cornell Univ.)


A Probabilistic Perspective on Tiling Sparse Tensor Algebra

Ritvik Sharma (Stanford Univ.); Fisher Xue (Massachusetts Inst. of Technology); Nathan Zhang, Rubens Lacouture, Fredrik Kjolstad, Sara Achour, Mark Horowitz (Stanford Univ.)


ATR: Out-of-Order Register Release Exploiting Atomic Regions

Yinyuan Zhao, Surim Oh, Mingsheng Xu, Heiner Litz (Univ. of California, Santa Cruz)


Quartz: A Reconfigurable, Distributed Memory Accelerator for Sparse Applications

Courtney Golden, Axel Feldmann (Massachusetts Inst. of Technology); Joel Emer (MIT/NVIDIA); Daniel Sanchez (Massachusetts Inst. of Technology)


Flexing RISC-V Instruction Subset Processors to Extreme Edge

Best Paper Award

Alireza Raisiardali (Pragmatic Semiconductor / KU Leuven); Konstantinos Iordanou, Jedrzej Kufel, Kowshik Gudimetla (Pragmatic Semiconductor); Kris Myny (KU Leuven); Emre Ozer (Pragmatic Semiconductor)

The 2025 Conference on Empirical Methods in Natural Language Processing, November 4 –9, Suzhou, China


Open-Science AI: Building Language, Vision, and Reasoning Models that Drive Innovation

Hannaneh Hajishirzi

KEYNOTE


Infini-gram mini: Exact n-gram Search at the Internet Scale with FM-Index

Hao Xu, Jiacheng Liu, Yejin Choi, Noah A. Smith, Hannaneh Hajishirzi

BEST PAPER AWARD


Large Language Models as Realistic Microservice Trace Generators

Donghyun Kim, Sriram Ravula, Taemin Ha, Alex Dimakis, Daehyeok Kim, Aditya Akella


Estimating LLM Consistency: A User Baseline vs Surrogate Metrics

Xiaoyuan Wu, Weiran Lin, Omer Akgul, Lujo Bauer

SENIOR AREA CHAIR HIGHLIGHT AWARD


Stronger Baselines for Retrieval-Augmented Generation with Long-Context Language Models

Alex Laitenberger, Christopher D Manning, Nelson F. Liu

ACM Symposium on Cloud Computing 2025 (SoCC '25), November 19-21, 2025 (Virtual)


Valet: Efficient Data Placement on Modern SSDs

Devashish R. Purandare, Peter Alvaro (University of California, Santa Cruz); Avani Wildani (Emory University and Cloudflare); Darrell D. E. Long (University of California, Santa Cruz); Ethan L. Miller (Pure Storage / University of California, Santa Cruz)

BEST PAPER AWARD


Towards a Lightweight Sidecar-based Service Mesh for Serverless

Lazar Cvetković, Ana Klimovic (ETH Zurich)


Rethinking Web Cache Design for the AI Era

Yazhuo Zhang, Jinqing Cai (ETH Zurich); Avani Wildani (Cloudflare); Ana Klimovic (ETH Zurich)


VLCs: Managing Parallelism with Virtualized Libraries

Yineng Yan, William Ruys, Hochan Lee, Ian Henriksen, Arthur Peters, Sean Stephens, Bozhi You, Henrique Fingler (University of Texas at Austin); Martin Burtscher (Texas State University); Milos Gligoric, Keshav Pingali, Mattan Erez, George Biros, Christopher J. Rossbach (University of Texas at Austin)


Understanding GPU Resource Interference One Level Deeper

Paul Elvinger, Foteini Strati (ETH Zurich); Natalie Enright Jerger (University of Toronto); Ana Klimovic (ETH Zurich)


Snap & Replay: A new way to analyze uarch-scale performance bottlenecks for ML accelerators

Ioannis Zarkadas (Columbia University); Amanda Tomlinson (University of California, San Diego); Asaf Cidon (Columbia University); Baris Kasikci (University of Washington); Ofir Weisse (Google)

NeurIPS 2025, The Thirty-Ninth Annual Conference on Neural Information Processing Systems, San Diego, CA, December 2-7, 2025


The Art of (Artificial) Reasoning

Yejin Choi

INVITED TALK


Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-Free

Zihan Qiu · Zekun Wang · Bo Zheng · Zeyu Huang · Kaiyue Wen · Songlin Yang · Rui Men · Le Yu · Fei Huang · Suozhi Huang · Dayiheng Liu · Jingren Zhou · Junyang Lin

BEST PAPER AWARD


Why Diffusion Models Don’t Memorize: The Role of Implicit Dynamical Regularization in Training

Tony Bonnaire · Raphaël Urfin · Giulio Biroli · Marc Mezard

BEST PAPER AWARD


Why Do Multi-Agent LLM Systems Fail?

Mert Cemri · Melissa Z Pan · Shuyi Yang · Lakshya A Agrawal · Bhavya Chopra · Rishabh Tiwari · Kurt Keutzer · Aditya Parameswaran · Dan Klein · Kannan Ramchandran · Matei A Zaharia · Joseph Gonzalez · Ion Stoica

Adaptive Surrogate Gradients for Sequential Reinforcement Learning in Spiking Neural Networks

Korneel Van den Berghe · Stein Stroobants · Vijay Janapa Reddi · Guido de Croon


CATransformers: Carbon Aware Transformers Through Joint Model-Hardware Optimization

Irene Wang · Mostafa Elhoushi · H Ekin Sumbul · Samuel Hsia · Daniel Jiang · Newsha Ardalani · Divya Mahajan · Carole-Jean Wu · Bilge Acun

NEW APPOINTMENTS


December 5, 2025

Professor Ken Birman, Cornell, announced he is stepping down from Cornell, after teaching there since 1982. Going forward, he’ll be working at Microsoft, with the Copilot Tuning effort, headed by Ranveer Chandra

BLOGS


October 17, 2025

Barbarians at The Gate: How AI is Upending Systems Research

by Audrey Cheng, Shu Liu, Melissa Pan, Ion Stoica, and the ADRS team


December 5, 2025

Glia: A Human-Inspired AI for Systems Design and Optimization

 by Pouya Hamadanian, Pantea Karimi, Arash Nash-Esfahany, Kimia Noorbakhsh, Joseph Chandler, Ali Parandeh, Mohammad Alizadeh, and Hari Balakrishnan



NEWS ITEMS


December 2, 2025

Marvell Technology to Acquire Celestial AI in $3.25 Billion Deal


October 9, 2025

How AMD’s AI Software Helped it Score the Multi-Billion Dollar OpenAI Deal


September 30, 2025

The Next Computing Revolution: Bringing Processing Inside Memory


August 9, 2025

How a Berkeley Professor Built Billion-Dollar Companies in his Lab 


August 7, 2025

Congress Wants to Cut the Smartest Investment Taxpayers ever Made

by Professor David Patterson

IAP Workshop Testimonials

 

Professor Christos Kozyrakis, Stanford - “As a starting point, I think of these IAP workshops as ‘Hot Chips meets ISCA’, i.e., an intersection of industry’s newest solutions in hardware (Hot Chips) with academic research in computer architecture (ISCA); but more so, these workshops additionally cover new subsystems and applications, and in a smaller venue where it is easy to discuss ideas and cross-cutting approaches with colleagues.” 

 

Professor Heiner Litz, UC Santa Cruz - "The IAP workshops represent extremely valuable events for all attendees including industry members, students and faculty. On my side, multiple project collaborations and student internships have evolved from these meetings leading to a win-win-win situation for all participants.” 


Professor Ana Klimovic, ETH Zurich - “I have attended three IAP workshops as a PhD student at Stanford and was consistently impressed by the quality of the talks and the breadth of the topics covered. These workshops bring top-tier industry and academia together to discuss cutting-edge research challenges. It is a great opportunity to exchange ideas and get inspiration for new research opportunities." 

 

Dr. Nathan Pemberton, Scientist, Amazon Web Services - "IAP workshops provide a valuable chance to explore emerging research topics with a focused group of participants, and without all the time/effort of a full-scale conference. Instead of rushing from talk to talk, you can slow down and dive deep into a few topics with experts in the field." 


Support a unique tech forum that brings together academia and industry under your company's banner? 

Please feel free to contact us regarding sponsorship opportunities, and for more info about any of the items above.

 

Best,

 

Jim Ballingall

Executive Director

Industry-Academia Partnership (IAP) 

www.industry-academia.org

jim.ballingall@gmail.com

cel: 408-212-1035


Copyright © 2013-2025 Industry-Academia Partnership