|
Greetings! Welcome to the IAP Newsletter with research publications, conferences and news regarding applications and infrastructure for AI and machine learning, hardware acceleration, operating systems, networking, security, and storage.
In this edition, we highlight key papers at USENIX ATC, SOUPS, SIGCOMM, MICRO-58, EMNLP, SoCC and NeurIPS, in addition to Awards, Books, Blogs and News Items. We begin with a recap of recent IAP Workshops at UC Berkeley and the University of Washington.
| | AI and CLOUD WORKSHOPS - 2025 | | |
UC Berkeley Workshop on the Future of AI in the Cloud
Tuesday, November 18, 2025
Soda Hall, UC Berkeley, Berkeley, CA
| This workshop was hosted and co-organized by Prof. Sagar Karandikar with keynote speakers Prof. Dave Patterson and Prof. Ion Stoica (portraits above). | | |
Pictured above are Ion Stoica, Natacha Crooks, and Ralph Wittig presenting their research on November 18.
Speakers (in order of appearance):
KEYNOTE: Prof. Dave Patterson, UC Berkeley and Google, the Pardee Professor of Computer Science, Emeritus, "How to Give AI a Bad Carbon Footprint"
Dr. Bilge Acun, Research Scientist, FAIR / Meta Superintelligence Labs, “CATransformers: Carbon Aware Transformers Through Joint Model-Hardware Optimization”
Ralph Wittig, Head of Research & Advanced Development, AMD, "Accelerating the Future: AI, Compute and Innovation at Scale"
Prof. Sagar Karandikar, UC Berkeley, “Agile Hardware/Software Co-Design for Hyperscale Cloud Systems”
Lightning Session for Student Posters: Best Poster Award: "Autocomp: A Powerful and Portable Code Optimizer for Tensor Accelerators" by Charles Hong, Sahil Bhatia, Alvin Cheung and Sophia Shao.
KEYNOTE: Prof. Ion Stoica, UC Berkeley, the Xu Bao Chancellor's Chair Professor, "An AI Stack: from Scaling AI Workloads to Evaluating LLMs"
Dr. Erich Haratsch, Senior Director of Architecture, Marvell, “Data Storage Innovations for Scalable AI Infrastructure”
Prof. Sophia Shao, UC Berkeley, “From Algorithms to Silicon: Accelerating Full-Stack Co-Design for the AI Era”
Dr. Liguang Xie, Engineering Director of Global Compute Infrastructure, ByteDance, “Towards Disaggregated LLM Inference: AIBrix, Agentic Workloads, and the Rise of DPU-Accelerated AI Systems"
Prof. Natacha Crooks, UC Berkeley, “Supporting Our AI Overlords: Redesigning Data Systems to be Agent-First”
This was the third AI and Cloud Workshop hosted by UC Berkeley. Please see the BERKELEY WORKSHOP WEB PAGE for the speaker bios, abstracts and videos of the presentations.
| |
UW WORKSHOP ON THE FUTURE OF AI AND CLOUD COMPUTING
Friday, May 9, 2025
Husky Union Building, UW, Seattle, WA
| | Shihang Vic Li and Matthew Giordano won the Best Poster Award for "NEMO: Flexible and High-Fidelity Telemetry on Programmable Memeory Controllers." Congratulating them (left to right) are Ulf Hanebutte (Marvell), Prof. Stephanie Wang, Mats Oberg (Marvell), Prof. Tom Anderson, Prof. Baris Kasikci, Prof. Simon Peter, Liguang Xie (ByteDance), Victor Cao (Furturewei) and Brad Beckmann (AMD).
This was hosted by Prof. Stephanie Wang. Please see the UW WORKSHOP WEB PAGE for the speaker bios, abstracts and videos of the presentations.
| | |
AWARDS for IAP COLLEAGUES
ACM SIGARCH Maurice Wilkes Award
Carole-Jean Wu, Meta
ACM SIGARCH Alan D. Berenbaum Distinguished Service Award
Joel Emer, MIT and Nvidia
ACM Charles P. “Chuck” Thacker Breakthrough in Computing Award
Jason Cong, UCLA
2025 Sloan Fellow
Natacha Crooks, University of California, Berkeley
Presidential Early Career Award for Scientists and Engineers
Christina Delimitrou, MIT
BEST DISSERTATION and PAPER AWARDS
2025 ACM SIGARCH/IEEE CS TCCA Outstanding Dissertation Award
Sagar Karandikar, UC Berkeley
2025 NeurIPS Madrona Prize
"Enhancing Personalized Multi-Turn Dialogue with Curiosity Reward"
Yanming Wan, Jiaxing Wu, Marwa Abdulhai, Lior Shani, Natasha Jaques
2025 ACM SIGCOMM Networking Systems Award
"Batfish: An Open-source Network Configuration Analysis and Verification Platform"
Matt Brown, Ari Fogel, Spencer Fraint, Daniel Halperin, Victor Heohiardi, Ratul Mahajan, Todd Millstein, Corina Miner, and Samir Parikh.
The ICCAD 2025 Ten-Year Retrospective Most Influential Paper Award
"Optimizing FPGA-based Accelerator Design for Deep Convolutional Neural Networks"
Chen Zhang, Peng Li, Guangyu Sun, Yijin Guan, and Bingjun Xiao, Jason Cong
| | |
NEW RESEARCH CENTERS
November 4, 2025
UT Austin launched the Center for Systems Infrastructure and AI to tackle the foundational challenges facing advances in AI; focused on the co-evolution of nimble AI stacks and intelligent infrastructure. InfraAI@UT is led by Professors Aditya Akella and Chris Rossbach (photos below) and partners in close collaboration with industry stakeholders to build a new generation of systems. The Center builds upon their NSF Expedition in Computing for Learning Directed Operating Systems.
| | |
BOOKS
MILESTONE: The 7th Edition of the classic text book "Computer Architecture: A Quantitative Approach" was published on October 24, 2025. Congrats to the authors John L. Hennessy, David A. Patterson, and Christos Kozyrakis, who was added as an author to this edition, contributing two new chapters. See Christos's comments on LinkedIn regarding the impact of the book (and his co-authors) on himself and tens of thousands of scientists and engineers.
| | |
SELECT CONFERENCES and PUBLICATIONS in JULY-DECEMBER 2025 | | |
USENIX ATC, Boston, MA, July 7-9, 2025
Yongjun He and Haofeng Yang, ETH Zurich; Yao Lu, National University of Singapore; Ana Klimovic and Gustavo Alonso, ETH Zurich
Anil Yelam and Kan Wu, Google; Zhiyuan Guo, UC San Diego; Suli Yang, Google;Rajath Shashidhara, University of Washington; Wei Xu and Stanko Novaković, Google; Alex C. Snoeren, Google and UC San Diego; Kimberly Keeton, Google
Internet Connection Splitting: What’s Old is New Again
Gina Yuan, Thea Rossman, and Keith Winstein, Stanford University
Stewart Grant and Alex C. Snoeren, UC San Diego
Ori Ben Zur and Jakob Krebs, Technion - Israel Institute of Technology; Shai Aviram Bergman, Huawei Zurich Research Center; Mark Silberstein, Technion - Israel Institute of Technology
Awarded Best Paper!
| | |
Karen Sowon, Indiana University; Collins W. Munyendo, The George Washington University; Lily Klucinec, Carnegie Mellon University; Eunice Maingi and Gerald Suleh, Strathmore University; Lorrie Faith Cranor and Giulia Fanti, Carnegie Mellon University; Conrad Tucker and Assane Gueye, Carnegie Mellon University-Africa
IAPP SOUPS Privacy Award
Clement Fung, Carnegie Mellon University; Eric Zeng, Georgetown University; Lujo Bauer, Carnegie Mellon University
Jenny Tang, Lujo Bauer, and Nicolas Christin, Carnegie Mellon University
| | |
SIGCOMM 2025, Coimbra, Portugal, September 8 - 11, 2025.
SpliDT: Partitioned Decision Trees for Scalable Stateful Inference at Line Rate
Murayyiam Parvez (Purdue University); Annus Zulfiqar (University of Michigan); Roman Beltiukov (University of California, Santa Barbara); Shir Landau Feibish (Open University of Israel); Walter Willinger (NIKSUN, Inc.); Arpit Gupta (University of California, Santa Barbara); Muhammad Shahbaz (University of Michigan)
Carbon- and Precedence-Aware Scheduling for Data Processing Clusters
Adam Lechowicz (University of Massachusetts Amherst); Rohan Shenoy (University of California Berkeley); Noman Bashir (Massachusetts Institute of Technology); Mohammad Hajiesmaili (University of Massachusetts Amherst); Adam Wierman (California Institute of Technology); Christina Delimitrou (Massachusetts Institute of Technology)
RANBooster: Democratizing advanced cellular connectivity through fronthaul middleboxes
Xenofon Foukas (Microsoft); Tenzin Samten Ukyab (UC Berkeley); Bozidar Radunovic (Microsoft); Sylvia Ratnasamy (UC Berkeley); Scott Shenker (ICSI AND UC Berkeley)
Falcon: A Reliable, Low Latency Hardware Transport
Arjun Singhvi (Google); Nandita Dukkipati (Google LLC); Prashant Chandra, Hassan M. G. Wassel, Naveen Kr. Sharma, Anthony Rebello, Henry Schuh, Praveen Kumar, Behnam Montazeri, Neelesh Bansod, Sarin Thomas, Inho Cho, Hyojeong Lee Seibert, Baijun Wu, Rui Yang, Yuliang Li, Kai Huang, Qianwen Yin, Abhishek Agarwal (Google); Srinivas Vaduvatha (Meta); Weihuang Wang, Masoud Moshref (Nvidia); Tao Ji (Microsoft); David Wetherall, Amin Vahdat (Google)
Firefly: Scalable, Ultra-Accurate Clock Synchronization for Datacenters
Pooria Namyar (USC & Google LLC); Yuliang Li, Weitao Wang, Nandita Dukkipati, KK Yap, Junzhi Gong, Chen Chen, Peixuan Gao (Google LLC); Devdeep Ray (NVIDIA); Gautam Kumar, Yidan Ma (Google LLC); Ramesh Govindan (USC & Google LLC); Amin Vahdat (Google LLC)
Scalable Video Conferencing Using SDN Principles
Oliver Michel, Satadal Sengupta (Princeton University); Hyojoon Kim (University of Virginia); Ravi Netravali, Jennifer Rexford
| | |
AI compilers and inference at scale: efficiency and velocity
Luis Ceze, Nvidia and UW
KEYNOTE
Can We Do Better?
Onur Mutlu, ETH Zurich
KEYNOTE
LongSight: Compute-Enabled Memory to Accelerate Large-Context LLMs via Sparse Attention
Derrick Quinn, E. Ezgi Yücel, Jinkwon Kim, José F. Martínez, Mohammad Alian (Cornell Univ.)
A Probabilistic Perspective on Tiling Sparse Tensor Algebra
Ritvik Sharma (Stanford Univ.); Fisher Xue (Massachusetts Inst. of Technology); Nathan Zhang, Rubens Lacouture, Fredrik Kjolstad, Sara Achour, Mark Horowitz (Stanford Univ.)
ATR: Out-of-Order Register Release Exploiting Atomic Regions
Yinyuan Zhao, Surim Oh, Mingsheng Xu, Heiner Litz (Univ. of California, Santa Cruz)
Quartz: A Reconfigurable, Distributed Memory Accelerator for Sparse Applications
Courtney Golden, Axel Feldmann (Massachusetts Inst. of Technology); Joel Emer (MIT/NVIDIA); Daniel Sanchez (Massachusetts Inst. of Technology)
Flexing RISC-V Instruction Subset Processors to Extreme Edge
Best Paper Award
Alireza Raisiardali (Pragmatic Semiconductor / KU Leuven); Konstantinos Iordanou, Jedrzej Kufel, Kowshik Gudimetla (Pragmatic Semiconductor); Kris Myny (KU Leuven); Emre Ozer (Pragmatic Semiconductor)
| | |
Open-Science AI: Building Language, Vision, and Reasoning Models that Drive Innovation
Hannaneh Hajishirzi
KEYNOTE
Infini-gram mini: Exact n-gram Search at the Internet Scale with FM-Index
Hao Xu, Jiacheng Liu, Yejin Choi, Noah A. Smith, Hannaneh Hajishirzi
BEST PAPER AWARD
Large Language Models as Realistic Microservice Trace Generators
Donghyun Kim, Sriram Ravula, Taemin Ha, Alex Dimakis, Daehyeok Kim, Aditya Akella
Estimating LLM Consistency: A User Baseline vs Surrogate Metrics
Xiaoyuan Wu, Weiran Lin, Omer Akgul, Lujo Bauer
SENIOR AREA CHAIR HIGHLIGHT AWARD
Stronger Baselines for Retrieval-Augmented Generation with Long-Context Language Models
Alex Laitenberger, Christopher D Manning, Nelson F. Liu
| | |
ACM Symposium on Cloud Computing 2025 (SoCC '25), November 19-21, 2025 (Virtual)
Valet: Efficient Data Placement on Modern SSDs
Devashish R. Purandare, Peter Alvaro (University of California, Santa Cruz); Avani Wildani (Emory University and Cloudflare); Darrell D. E. Long (University of California, Santa Cruz); Ethan L. Miller (Pure Storage / University of California, Santa Cruz)
BEST PAPER AWARD
Towards a Lightweight Sidecar-based Service Mesh for Serverless
Lazar Cvetković, Ana Klimovic (ETH Zurich)
Rethinking Web Cache Design for the AI Era
Yazhuo Zhang, Jinqing Cai (ETH Zurich); Avani Wildani (Cloudflare); Ana Klimovic (ETH Zurich)
VLCs: Managing Parallelism with Virtualized Libraries
Yineng Yan, William Ruys, Hochan Lee, Ian Henriksen, Arthur Peters, Sean Stephens, Bozhi You, Henrique Fingler (University of Texas at Austin); Martin Burtscher (Texas State University); Milos Gligoric, Keshav Pingali, Mattan Erez, George Biros, Christopher J. Rossbach (University of Texas at Austin)
Understanding GPU Resource Interference One Level Deeper
Paul Elvinger, Foteini Strati (ETH Zurich); Natalie Enright Jerger (University of Toronto); Ana Klimovic (ETH Zurich)
Snap & Replay: A new way to analyze uarch-scale performance bottlenecks for ML accelerators
Ioannis Zarkadas (Columbia University); Amanda Tomlinson (University of California, San Diego); Asaf Cidon (Columbia University); Baris Kasikci (University of Washington); Ofir Weisse (Google)
| | |
NeurIPS 2025, The Thirty-Ninth Annual Conference on Neural Information Processing Systems, San Diego, CA, December 2-7, 2025
Yejin Choi
INVITED TALK
Zihan Qiu · Zekun Wang · Bo Zheng · Zeyu Huang · Kaiyue Wen · Songlin Yang · Rui Men · Le Yu · Fei Huang · Suozhi Huang · Dayiheng Liu · Jingren Zhou · Junyang Lin
BEST PAPER AWARD
Tony Bonnaire · Raphaël Urfin · Giulio Biroli · Marc Mezard
BEST PAPER AWARD
Mert Cemri · Melissa Z Pan · Shuyi Yang · Lakshya A Agrawal · Bhavya Chopra · Rishabh Tiwari · Kurt Keutzer · Aditya Parameswaran · Dan Klein · Kannan Ramchandran · Matei A Zaharia · Joseph Gonzalez · Ion Stoica
Korneel Van den Berghe · Stein Stroobants · Vijay Janapa Reddi · Guido de Croon
Irene Wang · Mostafa Elhoushi · H Ekin Sumbul · Samuel Hsia · Daniel Jiang · Newsha Ardalani · Divya Mahajan · Carole-Jean Wu · Bilge Acun
| | |
NEW APPOINTMENTS
December 5, 2025
Professor Ken Birman, Cornell, announced he is stepping down from Cornell, after teaching there since 1982. Going forward, he’ll be working at Microsoft, with the Copilot Tuning effort, headed by Ranveer Chandra.
| | |
BLOGS
October 17, 2025
Barbarians at The Gate: How AI is Upending Systems Research
December 5, 2025
Glia: A Human-Inspired AI for Systems Design and Optimization
by Pouya Hamadanian, Pantea Karimi, Arash Nash-Esfahany, Kimia Noorbakhsh, Joseph Chandler, Ali Parandeh, Mohammad Alizadeh, and Hari Balakrishnan
NEWS ITEMS
December 2, 2025
Marvell Technology to Acquire Celestial AI in $3.25 Billion Deal
October 9, 2025
How AMD’s AI Software Helped it Score the Multi-Billion Dollar OpenAI Deal
September 30, 2025
The Next Computing Revolution: Bringing Processing Inside Memory
August 9, 2025
How a Berkeley Professor Built Billion-Dollar Companies in his Lab
August 7, 2025
Congress Wants to Cut the Smartest Investment Taxpayers ever Made
by Professor David Patterson
IAP Workshop Testimonials
Professor Christos Kozyrakis, Stanford - “As a starting point, I think of these IAP workshops as ‘Hot Chips meets ISCA’, i.e., an intersection of industry’s newest solutions in hardware (Hot Chips) with academic research in computer architecture (ISCA); but more so, these workshops additionally cover new subsystems and applications, and in a smaller venue where it is easy to discuss ideas and cross-cutting approaches with colleagues.”
Professor Heiner Litz, UC Santa Cruz - "The IAP workshops represent extremely valuable events for all attendees including industry members, students and faculty. On my side, multiple project collaborations and student internships have evolved from these meetings leading to a win-win-win situation for all participants.”
Professor Ana Klimovic, ETH Zurich - “I have attended three IAP workshops as a PhD student at Stanford and was consistently impressed by the quality of the talks and the breadth of the topics covered. These workshops bring top-tier industry and academia together to discuss cutting-edge research challenges. It is a great opportunity to exchange ideas and get inspiration for new research opportunities."
Dr. Nathan Pemberton, Scientist, Amazon Web Services - "IAP workshops provide a valuable chance to explore emerging research topics with a focused group of participants, and without all the time/effort of a full-scale conference. Instead of rushing from talk to talk, you can slow down and dive deep into a few topics with experts in the field."
Support a unique tech forum that brings together academia and industry under your company's banner?
Please feel free to contact us regarding sponsorship opportunities, and for more info about any of the items above.
Best,
Jim Ballingall
Executive Director
Industry-Academia Partnership (IAP)
www.industry-academia.org
jim.ballingall@gmail.com
cel: 408-212-1035
| | Copyright © 2013-2025 Industry-Academia Partnership | | | | |