Greetings Jim, Welcome to the IAP Newsletter with research publications, conferences and news regarding applications and infrastructure for AI and machine learning, hardware acceleration, operating systems, networking, security, and storage.

AI and CLOUD WORKSHOPS - 2023 and 2024

MIT WORKSHOP ON THE FUTURE OF AI and CLOUD COMPUTING


Friday, September 29, 2023


MIT, Stata Center, Cambridge, MA

Speakers Joel Emer and Carole-Jean Wu (center) pose with attendees during the Poster Session at lunch.


Speakers included:


Dr. Carole-Jean Wu, Meta, "Scaling AI Computing Sustainably"


Prof. Joel Emer, MIT and Nvidia, “Einsums, Fibertrees and Dataflow: Architecture for the Post-Moore Era”


Prof. Vijay Janapa Reddi, Harvard, “Architecture 2.0: Why Architects Need a Data-centric AI Gymnasium”  

 

Dr. Richard Kessler, Marvell, “AI, Cloud, and Marvell Semiconductor”


Prof. Song Han, MIT, "TinyChat for On-device LLM"

Prof. Manya Ghobadi, MIT, “Next-Generation Optical Networks for Machine Learning Jobs”


Prof. Daniel Sanchez, MIT, "A Hardware and Software Architecture to Accelerate Computation on Encrypted Data"


Sundar Dev, Google, “AI-powered infrastructure for the AI-driven future”


Prof. Christina Delimitrou, MIT, “Designing the Next Generation Cloud Systems: To ML or not to ML”

 

Dr. Jiaqi Gao, Alibaba, “Towards a 100,000-GPU Machine Learning Infrastructure"


This was the fourth AI and Cloud Workshop hosted by MIT. Please see the MIT WORKSHOP WEB PAGE for the speaker bios, abstracts and videos of the presentations.

UCSD WORKSHOP ON THE FUTURE OF AI and CLOUD COMPUTING


Friday, April 21, 2023


UCSD, CSE Building, La Jolla, CA

Dr. Doug Terry, Distinguished Scientist and VP at AWS, kicks off the morning session with “Replicating Cloud Data Across Regions for Business Continuity."


Other Speakers on April 21 included:


Prof. Salman Avestimehr, USC, “Collaborative Machine Learning at the Edge”

 

Dr. Yunfei Ma, Alibaba, “Overcome the "Last-mile" Challenge through Wireless and Video Collaboration"

 

Prof. Amy Ousterhout, UCSD, "Optimizing CPU Efficiency and Tail Latency in Datacenters"


Prof. Yiying Zhang, UCSD, “An End-to-End Implementation of A “Server-less” Data Center” 

Dr. Amir Yazdanbakhsh, Google, "From Data to Success: Leveraging Machine Learning for Full Stack Optimization"


Prof. Hung-Wei Tseng, UCR, “The upcoming revolution of general-purpose computing”


Prof. Hyoukjun Kwon, UCI, “XRBench: An Extended Reality (XR) Machine Learning Benchmark Suite for the Metaverse”


Prof. Hadi Esmaeilzadeh, UCSD, “Towards Architecting Machines for Empathic Conscious Intelligence”  

 

This was the first AI and Cloud Workshop hosted by UCSD. Please see the UCSD WORKSHOP WEB PAGE for the speaker bios, abstracts and videos of the presentations.

COMING in SPRING 2024


UW AI and CLOUD WORKSHOP


University of Washington

Seattle, WA


Organized by Prof. Arvind Krishnamurthy and the IAP. 


Please watch for IAP Newsletters and our website for further updates on the date and agenda.


At right, Prof. Arvind Krishnamurthy presents at a previous IAP Workshop at UW.

SELECT CONFERENCES and PUBLICATIONS in JUNE-DECEMBER 2023


MLSys 2023, Miami, FL, June 4-8, 2023


RecD: Deduplication for End-to-End Deep Learning Recommendation Model Training Infrastructure Storage, Scheduling, and Networking

Mark Zhao (Stanford) · Dhruv Choudhary (Meta) · Devashish Tyagi (Meta) · Ajay Somani (Meta) · Max Kaplan (Meta) · Sung-Han Lin (Meta) · Sarunya Pumma (Meta) · Jongsoo Park (Meta) · Aarti Basant (Meta) · Niket Agarwal (NVIDIA) · Carole-Jean Wu (Meta) · Christos Kozyrakis (Stanford)


MegaBlocks: Efficient Sparse Training with Mixture-of-Experts Sparsity 1: Models and Algorithms

Trevor Gale (Stanford), Deepak Narayanan (Microsoft Research), Cliff Young (Google), Matei Zaharia (Stanford and Databricks)


XRBench: An Extended Reality (XR) Machine Learning Benchmark Suite for the Metaverse Emerging Models and Domains

Hyoukjun Kwon (UC Irvine), Krishnakumar Nair (Meta) · Jamin Seo (Georgia Tech) · Jason Yik (Harvard) · Debabrata Mohapatra (Meta) · Dongyuan Zhan (Meta) · JINOOK SONG (META) · Peter Capak (Meta) · Peizhao Zhang (Meta) · Peter Vajda (Meta) · Colby Banbury (Harvard) · Mark Mazumder (Harvard) · Liangzhen Lai (Facebook Inc) · Ashish Sirasao (Meta inc) · Tushar Krishna (Georgia Tech) · Harshit Khaitan (Meta) · Vikas Chandra (Meta) · Vijay Janapa Reddi (Harvard)


On Optimizing the Communication of Model Parallelism Parallel and Distributed Systems 2: Communication

Yonghao Zhuang (Carnegie Mellon University) · Hexu Zhao (Tsinghua University) · Lianmin Zheng (UC Berkeley) · Zhuohan Li (UC Berkeley) · Eric Xing (MBZUAI, CMU, and Petuum Inc.) · Qirong Ho (MBZUAI) · Joseph Gonzalez (UC Berkeley) · Ion Stoica (UC Berkeley) · Hao Zhang (UC Berkeley) · Hexu Zhao (Tsinghua University)

ISCA 2023, Orlando, Florida, June 17-22, 2023


Mystique: Enabling Accurate and Scalable Generation of Production AI Benchmarks

Mingyu Liang, Wenyin Fu, Louis Feng, Zhongyi Lin, Pavani Panakanti, Shengbao Zheng, Srinivas Sridharan, Christina Delimitrou


μManycore: A Cloud-Native CPU for Tail at Scale

Jovan Stojkovic, Chunao Liu, Muhammad Shahbaz, Josep Torrella


Metior: A Comprehensive Model to Evaluate Obfuscating Side-Channel Defense Schemes

Peter W. Deutsch, Weon Taek Na, Thomas Bourgeat, Joel S. Emer, Mengjia Yan


RoboShape: Using Topology Patterns to Scalably and Flexibly Deploy Accelerators Across Robots

Sabrina M. Neuman, Radhika Ghosal, Thomas Bourgeat, Brian Plancher, Vijay Janapa Reddi


CDPU: Co-designing Compression and Decompression Processing Units for Hyperscale Systems

Sagar Karandikar, Aniruddha N. Udipi, Junsun Choi, Joonho Whangbo, Jerry Zhao, Svilen Kanev, Edwin Lim, Jyrki Alakuijala, Vrishab Madduri, Yakun Sophia Shao, Borivoje Nikolic, Krste Asanovic, Parthasarathy Ranganathan


ArchGym: An Open-Source Gymnasium for Machine Learning Assisted Architecture Design

Srivatsan Krishnan, Amir Yazdanbakhsh, Shvetank Prakash, Jason Jabbour, Ikechukwu Uchendu, Susobhan Ghosh, Behzad Boroujerdian, Daniel Richins, Devashree Tripathy, Aleksandra Faust, Vijay Janapa Redd


Profiling Hyperscale Data Processing

Abraham Gonzalez, Aasheesh Kolli, Samira Khan, Sihang Liu, Vidushi Dadu, Sagar Karandikar, Jichuan Chang, Krste Asanović, Parthasarathy Ranganathan


TPU v4: An Optically Reconfigurable Supercomputer for Machine Learning with Hardware Support for Embedding

Norm Jouppi, George Kurian, Sheng Li, Peter Ma, Rahul Nagarajan, Lifeng Nai, Nishant Patil, Suvinay Subramanian, Andy Swin, Brian Towles, Clifford Young, Xiang Zhou, Zongwei Zhou, David A Patterson


A Research Retrospective on AMD’s Exascale Computing Journey

Gabriel H. Loh, Michael J. Schulte, Mike Ignatowski, V. Adhinarayanan, et. al.

USENIX ATC, Boston, MA, July 10-12, 2023


Sky Computing

Ion Stoica, University of California, Berkeley


Comosum: An Extensible, Reconfigurable, and Fault-Tolerant IoT Platform for Digital Agriculture

Gloire Rubambiza, Shiang-Wan Chin, Mueed Rehman, Sachille Atapattu, José F. Martínez, and Hakim Weatherspoon, Cornell University


Tectonic-Shift: A Composite Storage Fabric for Large-Scale ML Training

Mark Zhao, Stanford University and Meta; Satadru Pan, Niket Agarwal, Zhaoduo Wen, David Xu, Anand Natarajan, Pavan Kumar, Shiva Shankar P, Ritesh Tijoriwala, Karan Asher, Hao Wu, Aarti Basant, Daniel Ford, Delia David, Nezih Yigitbasi, Pratap Singh, and Carole-Jean Wu, Meta; Christos Kozyrakis, Stanford University


SAGE: Software-based Attestation for GPU Execution

Andrei Ivanov and Benjamin Rothenberger, ETH Zürich; Arnaud Dethise and Marco Canini, KAUST; Torsten Hoefler and Adrian Perrig, ETH Zürich


Distributed Transactions at Scale in Amazon DynamoDB

Joseph Idziorek, Alex Keyes, Colin Lazier, Somu Perianayagam, Prithvi Ramanathan, James Christopher Sorenson III, Doug Terry, and Akshat Vig, Amazon Web Services

SIGCOMM 2023, New York City, September 10-14, 2023


P4Testgen: An Extensible Test Oracle For P4-16     

Fabian Ruffy (NYU), Jed Liu (Postman), Prathima Kotikalapudi, Vojtech Havel, Hanneli Tavante (Intel), Rob Sherwood (NetDebug.com), Vladyslav Dubina, Volodymyr Peschanenko (Litsoft), Anirudh Sivaraman (NYU), Nate Foster (Cornell)


Host Congestion Control

Saksham Agarwal (Cornell), Arvind Krishnamurthy (Google/University of Washington), Rachit Agarwal (Cornell)


Murphy: Performance Diagnosis of Distributed Cloud Applications

Vipul Harsh (University of Illinois Urbana-Champaign), Wenxuan Zhou (VMware), Sachin Ashok (UIUC), Radhika Niranjan Mysore (VMware Research), Brighten Godfrey (UIUC and VMware), Sujata Banerjee (VMware Research)


Exoshuffle: An Extensible Shuffle Architecture

Frank Sifei Luan, Stephanie Wang, Samyukta Yagati, Sean Kim, Kenneth Lien, Isaac Ong, Tony Hong (UC Berkeley), SangBin Cho, Eric Liang (Anyscale), Ion Stoica (UC Berkeley)


Destination Unreachable: Characterizing Internet Outages and Shutdowns 

Zachary S. Bischof (Georgia Tech), Kennedy Pitcher (UC San Diego), Esteban Carisimo (Northwestern), Amanda Meng (Georgia Tech), Rafael Bezerra Nunes (Yale), Ramakrishna Padmanabhan (Amazon), Margaret E. Roberts (UC San Diego), Alex C. Snoeren (UC San Diego), Alberto Dainotti (Georgia Tech)


Resilient Baseband Processing in Virtualized RANs with Slingshot

Nikita Lazarev (MIT), Tao Ji (UT Austin), Anuj Kalia (Microsoft), Daehyeok Kim (UT Austin and Microsoft), Ilias Marinos, Francis Y. Yan (Microsoft), Christina Delimitrou (MIT), Zhiru Zhang (Cornell), Aditya Akella (The University of Texas at Austin)


CellFusion: Multipath Vehicle-to-Cloud Video Streaming with Network Coding in the Wild

Yunzhe Ni (Alibaba Cloud & Peking Univ.), Zhilong Zheng, Xianshang Lin, Fengyu Gao, Xuan Zeng, Yirui Liu, Tao Xu, Hua Wang, Zhidong Zhang, Senlang Du, Guang Yang, Yuanchao Su, Dennis Cai (Alibaba Cloud), Hongqiang Harry Liu (Uber Technology), Chenren Xu (Peking Univ.), Ennan Zhai, Yunfei Ma (Alibaba Cloud)

ARCHITECTURE 2.0 - Virtual - September 15, 2023

Keynote: Partha Ranganathan

The goal of this virtual event is to identify the next generation of machine learning algorithms, datasets, tools, infrastructure, best practices, education and workforce training required to drive progress in our community.

ACM SIGPLAN Conference on Systems, Programming, Languages, and Applications: Software for Humanity (SPLASH) with Object-Oriented Programming, Systems, Languages & Applications (OOPSLA), Cascais, Portugal, October 22-27, 2023


Hydroflow: A Compiler Target for Fast, Correct Distributed Programs

Keynote

Joseph M. Hellerstein, UC Berkeley


A Full Employment Theorem for PL Researchers: Domain-Specific Languages

Nate Foster, Cornell


Degrees of Separation: A Flexible Type System for Data Race Prevention

Yichen Xu, EPFL, Aleksander Boruch-Gruszecki, EPFL, Martin Odersky, EPFL


Hardware-Aware Static Optimization of Hyperdimensional Computations

Pu (Luke) Yi, Stanford, Sara Achour, Stanford


Formal Abstractions for Packet Scheduling (Distinguished Paper Award)

Anshuman Mohan, Cornell, Yunhe Liu, Cornell University, Nate Foster, Cornell University, Tobias Kappé, Open University of the Netherlands; University of Amsterdam, Dexter Kozen, Cornell

IEEE/ACM International Symposium on Microarchitecture (MICRO-56), Toronto, Canada, October 28-November 1, 2023 

 

Social Infrastructure in the Age of Artificial General Intelligence

Amin Vahdat (Google)

Keynote


AuRORA: Virtualized Accelerator Orchestration for Multi-Tenant Workloads

Seah Kim, Jerry Zhao, Krste Asanovic, Borivoje Nikolic, Yakun Sophia Shao (UC Berkeley)

 

Accelerating RTL Simulation with Hardware-Software Co-Design

Fares Elsabbagh, Shabnam Sheikhha, Victor A. Ying, Quan M. Nguyen (MIT); Joel Emer (MIT/NVIDIA); Daniel

Sanchez (MIT)

 

Simultaneous and Heterogenous Multithreading

Kuan-Chieh Hsu, Hung-Wei Tseng (UC Riverside)

 

Spatula: A Hardware Accelerator for Sparse Matrix Factorization

Axel Feldmann, Daniel Sanchez (MIT)

 

TorchSparse++: Efficient Training and Inference Framework for Sparse Convolution on GPUs

Haotian Tang, Shang Yang, Zhijian Liu (MIT); Ke Hong (Tsinghua); Zhongming Yu (UC San Diego); Xiuyu Li

(UC Berkeley); Guohao Dai (Shanghai Jiao Tong University); Yu Wang (Tsinghua); Song Han (MIT)

 

SecureLoop: Design Space Exploration of Secure DNN Accelerators

Kyungmi Lee, Mengjia Yan (MIT); Joel Emer (MIT, NVIDIA); Anantha Chandrakasan (MIT)

 

HighLight: Efficient and Flexible DNN Acceleration with Hierarchical Structured Sparsity

Yannan Nellie Wu (MIT); Po-An Tsai, Saurav Muralidharan, Angshuman Parashar (NVIDIA); Vivienne Sze (MIT); Joel Emer (MIT/NVIDIA)

 

Victima: Drastically Increasing Address Translation Reach by Leveraging Underutilized Cache Resources

Konstantinos Kanellopoulos, Hong Chul Nam, Nisa Bostanci, Rahul Bera, Mohammad Sadrosadati (ETH Zürich); Rakesh Kumar (Norwegian University of Science and Technology (NTNU)); Davide Basilio Bartolini (Huawei); Onur Mutlu (ETH Zürich)

 

PockEngine: Sparse and Efficient Fine-tuning in a Pocket

Ligeng Zhu (MIT); Lanxiang Hu (Columbia); Ji Lin, Wei-Ming Chen, Wei-Chen Wang, Chuang Gan, Song Han (MIT)

 

Pipestitch: An energy-minimal dataflow architecture with lightweight threads

Nathan Serafin, Souradip Ghosh, Harsh Desai, Nathan Beckmann, Brandon Lucia (Carnegie Mellon University)

ACM Symposium on Cloud Computing 2023 (SoCC '23), Santa Cruz, CA, October 30-November 1, 2023

 

Dissecting Overheads of Service Mesh Sidecars

Xiangfeng Zhu (U of Washington); Guozhen She (Duke); Bowen Xue (U of Washington); Yu Zhang, Yongsu Zhang, Xuan Kelvin Zou, Xiong Chun Duan, Peng He (ByteDance Inc.); Arvind Krishnamurthy (U of Washington); Matthew Lentz (Duke University and VMware Research); Danyang Zhuo (Duke); Ratul Mahajan (U of Washington, Amazon)

 

Enabling Multi-tenancy on SSDs with Accurate IO Interference Modeling

Lokesh N. Jaliminche (UC Santa Cruz); Chandranil (Nil)

Chakraborttii (Trinity College, Hartford, USA); Changho Choi (Samsung

Semiconductor, Inc, USA); Heiner Litz (UC Santa Cruz)

 

TMC: Near-Optimal Resource Allocation for Tiered-Memory Systems

Yuanjiang Ni (Alibaba Group); Pankaj Mehra (Elephance Memory; Ethan Miller (Pure Storage / UC Santa Cruz); Heiner Litz (UC Santa Cruz)


Anticipatory Resource Allocation for ML Training Clusters

Tapan Chugh (U of Washington); Srikanth Kandula (Microsoft); Arvind Krishnamurthy (U of Washington); Ratul Mahajan (U of Washington, Amazon); Ishai Menache (Microsoft)

 

Disaggregating ML Input Data Processing at Scale

Andrew Audibert, Yang Chen (Google); Dan-Ovidiu Graur, Ana Klimovic (ETH Zurich); Jiri Simsa, Chandramohan A. Thekkath (Google)

 

Yama: Providing Performance Isolation for Black-Box Offloads

Tao Ji, Divyanshu Saxena (UT Austin); Brent Stephens (U of Utah); Aditya Akella (UT Austin and Google)

AWARDS


2023 ACM-IEEE CS Eckert-Mauchly Award: The ACM and IEEE Computer Society named Kunle Olukotun, Professor at Stanford University, as the recipient of the ACM-IEEE CS Eckert-Mauchly Award for contributions and leadership in the development of parallel systems, especially multicore and multithreaded processors.


2023 ACM-IEEE CS Ken Kennedy Award: ACM has named Keshav Pingali, the W.A. "Tex" Moncrief Chair of Grid and Distributed Computing at the University of Texas at Austin, as the recipient of the 2023 ACM-IEEE CS Ken Kennedy Award. The Ken Kennedy Award recognizes groundbreaking achievements in parallel and high-performance computing. Pingali is cited for contributions to high-performance parallel computing for irregular algorithms such as graph algorithms. 


BEST PAPER AWARDS


ASPLOS 2023 Distinguished Papers:

MC Mutants: Evaluating and Improving Testing for Memory Consistency Specifications

Reese Levine, Tianhao Guo, Mingun Cho, Alan Baker, Raph Levien, David Neto, Andrew Quinn, Tyler Sorensen


Pond: CXL-Based Memory Pooling Systems for Cloud Platforms

Huaicheng Li, Daniel S. Berger, Lisa Hsu, Daniel Ernst, Pantea Zardoshti, Stanko Novakovic, Monish Shah, Samir Rajadnya, Scott Lee, Ishwar Agarwal, Mark D. Hill, Marcus Fontoura, Ricardo Bianchini


RepCut: Superlinear Parallel RTL Simulation with Replication-Aided Partitioning

Haoyuan Wang and Scott Beamer


GRANTS and FUND RAISING


Hakim Weatherspoon and Robbert van Renesse (Cornell)

Exostellar raises $15M to help companies optimize their cloud spend



BOOKS


MACHINE LEARNING SYSTEMS with TINYML

Machine Learning Systems with TinyML” offers an accessible entry into the fascinating world of computing that enables machine learning, with a special focus on its implementation in resource-constrained environments using TinyML. This is an open source project on GitHub; contributors are welcome!


BLOGS


September 25, 2023

Remembering Luiz Barroso

by Partha Ranganathan


August 3, 2023

Reducing Embodied Carbon is Important

by Daniel S. Berger, David Brooks, Fiodar Kazhamiaka, Mark D. Hill, Ricardo Bianchini, Carole-Jean Wu, Karin Strauss, Kali Frost, Jaylen Wang, Kevin Martins, Sharon Gillett, Esha Choukse, Dan Ernst, Rodrigo Fonseca, Kari Lio, Bhargavi Narayanasetty, Pratyush Patel, Celine Irvene, Akshitha Sriraman, George Porter, Alex Jones, Udit Gupta, Bilge Acun-Uyan, Kim Hazelwood, and Doug Carmean


July 3, 2023

ISCA@50 Retrospective: 1996-2020

by José Martínez


June 14, 2023

Architecture 2.0: Why Computer Architects Need a Data-Centric AI Gymnasium

by Vijay Janapa Reddi and Amir Yazdanbakhsh 



NEWS ITEMS


December 6, 2023

AMD Takes On Nvidia with New GPU for AI


September 13, 2023

Cornell Start-up Exostellar raises $15M to help companies optimize their cloud spend


August 9, 2023

Baidu, Bytedance, Tencent and Alibaba Order $5B of Nvidia Chips to Power AI Ambitions


June 20, 2023

AMD hopes to catch up on AI with Instinct MI300 AI supercomputing hybrid processor


June 14, 2023

OctoML launches OctoAI, a self-optimizing compute service for AI


May 26, 2023 

Marvell Follows Nvidia as the Next Hot AI Stock. The Sector Is on Fire.


IAP Workshop Testimonials

 

Professor Christos Kozyrakis, Stanford - “As a starting point, I think of these IAP workshops as ‘Hot Chips meets ISCA’, i.e., an intersection of industry’s newest solutions in hardware (Hot Chips) with academic research in computer architecture (ISCA); but more so, these workshops additionally cover new subsystems and applications, and in a smaller venue where it is easy to discuss ideas and cross-cutting approaches with colleagues.” 

 

Professor Heiner Litz, UC Santa Cruz - "The IAP workshops represent extremely valuable events for all attendees including industry members, students and faculty. On my side, multiple project collaborations and student internships have evolved from these meetings leading to a win-win-win situation for all participants.” 


Prof. Ana Klimovic, ETH Zurich - “I have attended three IAP workshops as a PhD student at Stanford and was consistently impressed by the quality of the talks and the breadth of the topics covered. These workshops bring top-tier industry and academia together to discuss cutting-edge research challenges. It is a great opportunity to exchange ideas and get inspiration for new research opportunities." 

 

Dr. Nathan Pemberton, Scientist, Amazon Web Services - "IAP workshops provide a valuable chance to explore emerging research topics with a focused group of participants, and without all the time/effort of a full-scale conference. Instead of rushing from talk to talk, you can slow down and dive deep into a few topics with experts in the field." 


Support a unique tech forum that brings together academia and industry under your company's banner? 

Please feel free to contact us regarding sponsorship opportunities, and for more info about any of the items above.

 

Best,

 

Jim Ballingall

Executive Director

Industry-Academia Partnership (IAP) 

www.industry-academia.org

jim.ballingall@gmail.com

cel: 408-212-1035


Copyright © 2013-2024 Industry-Academia Partnership