SELECT CONFERENCES and PUBLICATIONS in 1H 2024
POPL 2024, The 51st ACM Symposium on Principles of Programming Languages, January 14-20, 2024, London, UK
The Network is the Computer: A Programming Language Perspective
Nate Foster, Cornell University and Jane Street
Inside the Scala Capture Checker
Martin Odersky, EPFL
Computational-Bounded Robust Compilation and Universally Composable Security
Robert Künnemann, CISPA Helmholtz Center for Information Security, Ethan Cecchetti, University of Wisconsin-Madison
FAST '24 - The 22nd USENIX Conference on File and Storage Technologies - February 26-29, 2024, Santa Clara, CA, USA
Baleen: ML Admission & Prefetching for Flash Caches Prefetching for Flash Caches
Daniel Lin-Kit Wong, Carnegie Mellon University; Hao Wu, Meta; Carson Molder, UT Austin; Sathya Gunasekar, Jimmy Lu, Snehal Khandkar, and Abhinav Sharma, Meta; Daniel S. Berger, Microsoft and University of Washington; Nathan Beckmann and Gregory R. Ganger, Carnegie Mellon University
Weidong Zhang, Erci Xu, Qiuping Wang, Xiaolu Zhang, Yuesheng Gu, Zhenwei Lu, Tao Ouyang, Guanqun Dai, Wenwen Peng, Zhe Xu, Shuo Zhang, Dong Wu, Yilei Peng, Tianyun Wang, Haoran Zhang, Jiasheng Wang, Wenyuan Yan, Yuanyuan Dong, Wenhui Yao, Zhongjie Wu, Lingjun Zhu, Chao Shi, Yinhu Wang, Rong Liu, Junping Wu, Jiaji Zhu, and Jiesheng Wu, Alibaba Group
Awarded Best Paper!
HPCA 2024 - The 30th IEEE International Symposium on High-Performance Computer Architecture - March 2-6, 2024, Edinburgh, Scotland
Keynote: Terminus: Moving the Center of Cloud Servers to SmartNICs and Beyond, Derek Chiou
Revet: A Language and Compiler for Dataflow Threads
Alexander Rucker, Shiv Sundram, Coleman Smith, Matt Vilim, Raghu Prabhakar, Fredrik Kjolstad, Kunle Olukotun
MIMDRAM: An End-to-End Processing-Using-DRAM System for High-Throughput, Energy-Efficient and Programmer-Transparent Multiple-Instruction Multiple-Data Computing
Geraldo Francisco De Oliveira Junior, Ataberk Olgun, Giray Yaglikci, Nisa Bostanci, Juan Gómez Luna, Saugata Ghose, Onur Mutlu
CAMEL: Co-Designing AI Models and eDRAMs for Efficient On-Device Learning
Sai Qian Zhang, Thierry Tambe, Nestor Cuevas, Gu-Yeon Wei, David Brooks
LibPreemptible: Enabling Fast, Adaptive, and Hardware-Assisted User-Space Scheduling
Yueying Li, Nikita Lazarev, David Koufaty, Yijun Yin, Andy Anderson, Zhiru Zhang, G. Edward Suh, Kostis Kaffes, Christina Delimitrou, David Koufaty
Data Motion Acceleration: Chaining Cross-Domain Multi Accelerators
Shu-Ting Wang, Hanyang Xu, Amin Mamandipoor, Rohan Mahapatra, Byung Hoon Ahn, Soroush Ghodrati, Krishnan Kailas, Mohammad Alian, Hadi Esmaeilzadeh
ASPLOS 2024 - The 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, April 27-May 1, 2024, San Diego, CA
Keynote: Societal infrastructure in the age of Artificial General Intelligence,
Amin Vahdat
CC-NIC: a Cache-Coherent Interface to the NIC
Henry N. Schuh and Arvind Krishnamurthy (Google and University of Washington); David Culler (Google); Henry M. Levy (Google and University of Washington); Luigi Rizzo (Google); Samira Khan (Google and University of Virginia); Brent E. Stephens (Google and University of Utah)
DREAM: A Dynamic Scheduler for Dynamic Real-time Multi-model ML Workloads
Seah Kim (University of California Berkeley); Hyoukjun Kwon (University of California Irvine); Jinook Song, Jihyuck Jo, Yu-Hsin Chen, Liangzhen Lai, and Vikas Chandra (Meta)
8-bit Transformer Inference and Fine-tuning for Edge Accelerators
Jeffrey Yu, Kartik Prabhu, Yonatan Urman, Robert M. Radway, Eric Han, and Priyanka Raina (Stanford University)
Keynote: Challenges and Opportunities for Systems Using CXL Memory, Emmett Witchel
BaCO: A Fast and Portable Bayesian Compiler Optimization Framework
Erik Orm Hellsten (Lund University);Artur Souza (Federal University of Minas Gerais); Johannes Lenfers (University of Münster); Rubens Lacouture and Olivia Hsu (Stanford University);Adel Ejjeh (University of Illinois at Urbana-Champaign); Fredrik Kjolstad (Stanford University); Michel Steuwer (University of Edinburgh); Kunle Olukotun (Stanford University); Luigi Nardi (Lund University and Stanford University)
Characterizing a Memory Allocator at Warehouse Scale
Zhuangzhuang Zhou (Cornell University); Vaibhav Gogte, Nilay Vaish, Chris Kennelly, Patrick Xia, Svilen Kanev, and Tipp Moseley (Google); Christina Delimitrou (MIT); Parthasarathy Ranganathan (Google)
Tandem Processor: Grappling with Emerging Operators in Neural Networks
Soroush Ghodrati, Sean Kinzer, Hanyang Xu, and Rohan Mahapatra (University of California San Diego); Yoonsung Kim (KAIST); Byung Hoon Ahn (University of California San Diego); Dong Kai Wang (University of Illinois Urbana-Champaign); Lavanya Karthikeyan (University of California San Diego); Amir Yazdanbakhsh (Google DeepMind); Jongse Park (KAIST); Nam Sung Kim (University of Illinois Urbana-Champaign); Hadi Esmaeilzadeh (University of California San Diego)
GPU-based Private Information Retrieval for On-Device Machine Learning Inference
Maximilian Lam (Harvard University); Jeff Johnson (Meta); Wenjie Xiong (Virginia Tech); Kiwan Maeng (Pennsylvania State University); Udit Gupta (Harvard University); Yang Li, Liangzhen Lai, and Ilias Leontiadis (Meta); Minsoo Rhu (KAIST and Meta); Hsien-Hsin S. Lee (Intel); Vijay Janapa Reddi, Gu-Yeon Wei, and David Brooks (Harvard University); Edward Suh (Meta and Cornell University)
RPG^2: Robust Profile-Guided Runtime Prefetch Generation
Yuxuan Zhang, Nathan Sobotka, and Soyoon Park (University of Pennsylvania);Saba Jamilan (University of California Santa Cruz);Tanvir Ahmed Khan (Columbia University);Baris Kasikci (University of Washington and Google);Gilles A Pokam (Intel);Heiner Litz (University of California Santa Cruz);Joseph Devietti (University of Pennsylvania)
NSDI '24 - The 21st USENIX Symposium on Networked Systems Design and Implementation - April 16-18, 2024, Santa Clara, CA
Zhanghao Wu, Wei-Lin Chiang, Ziming Mao, and Zongheng Yang, University of California, Berkeley; Eric Friedman and Scott Shenker, University of California, Berkeley, and ICSI; Ion Stoica, University of California, Berkeley
Awarded Outstanding Paper!
Sieve is Simpler than LRU: An Efficient Turn-Key Eviction Algorithm for Web Caches
Yazhuo Zhang, Juncheng Yang, Yao Yue, Ymir Vigfusson, K. V. Rashmi
Community Award Winner!
Sidekick: In-Network Assistance for Secure End-to-End Transport Protocols
Gina Yuan, Matthew Sotoudeh, and David K. Zhang, Stanford University; Michael Welzl, University of Oslo; David Mazières and Keith Winstein, Stanford University
Outstanding Paper Award and Community Award Winner!
TECC: Towards Efficient QUIC Tunneling via Collaborative Transmission Control
Jiaxing Zhang, Alibaba Group, University of Chinese Academy of Sciences; Furong Yang, Alibaba Group; Ting Liu, Alibaba Group, University of Chinese Academy of Sciences; Qinghua Wu, University of Chinese Academy of Sciences, Purple Mountain Laboratories, China; Wu Zhao, Yuanbo Zhang, Wentao Chen, Yanmei Liu, Hongyu Guo, and Yunfei Ma, Alibaba Group; Zhenyu Li, University of Chinese Academy of Sciences, Purple Mountain Laboratories, China
Nirav Atre, Hugo Sadok, and Justine Sherry, Carnegie Mellon University
Jiaqi Gao, Jiamin Cao, Yifan Li, Mengqi Liu, Ming Tang, Dennis Cai, and Ennan Zhai, Alibaba Cloud
Sudarsanan Rajasekaran and Manya Ghobadi, Massachusetts Institute of Technology; Aditya Akella, UT Austin
Ao Li, Carnegie Mellon University; Shan Lu, Microsoft Research and University of Chicago; Suman Nath, Microsoft Research; Rohan Padhye and Vyas Sekar, Carnegie Mellon University
EuroSys 2024 - April 22-25, 2024, Athens, Greece
Model Selection for Latency-Critical Inference Serving
Daniel Mendoza (Stanford University), Francisco Romero (Stanford University), Caroline Trippel (Stanford University)
Enoki: High Velocity Linux Kernel Scheduler Development
Samantha Miller (University of Washington), Anirudh Kumar (University of Washington), Tanay Vakharia (University of Washington), Ang Chen (University of Michigan), Danyang Zhuo (Duke University), Thomas Anderson (University of Washington)
Orion: Interference-aware, Fine-grained GPU Sharing for ML Applications
Foteini Strati (ETH Zurich), Xianzhe Ma (ETH Zurich), Ana Klimovic (ETH Zurich)
ZKML: An Optimizing System for ML Inference in Zero-Knowledge Proofs
Bing-Jyue Chen (UIUC), Suppakit Waiwitlikhit (Stanford), Ion Stoica (UC Berkeley), Daniel Kang (UIUC)
ISCA 51 - The International Symposium on Computer Architecture
June 29-July 3, 2024, Buenos Aires, Argentina
Constable: Improving Performance and Power Efficiency by Safely Eliminating Load Execution
R. Bera, A. Ranganathan, J. Rakshit, S. Mahto, A. Nori, J. Gaur, A. Olgun, K. Kanellopoulos, M. Sadrosadati, S. Subramoney, O. Mutlu
FireAxe: Partitoned FPGA-Accelerated Simulation of Large-Scale RTL Designs
J. Whangbo, E. Lim, C. Zhang, K. Anderson, A. Gonzalez, R. Gupta, N. Krishnakumar, S. Karandikar, B. Nikolic, Y. Shao, K. Asanovic
MAD Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems
S. Hsia, A. Golden, B. Acun, N. Ardalani, Z. DeVito, G. Wei, D. Brooks, C. Wu
Trapezoid: A Versatile Accelerator for Dense and Sparse Matrix Multiplications
Y. Yang, J. Emer, D. Sanchez
UDP: Utility-Driven Fetch Directed Instruction Prefetching
S. Oh, M. Xu, T. Khan, B. Kasikci, H. Litz
|