Gregory Ganger Professor, Affiliated Faculty Office 2208 Mehrabian Collaborative Innovation Center Email ganger@ece.cmu.edu Phone (412) 268-1297 Department Electrical and Computer Engineering Computer Science Department: Affiliated Research Interests Data-Intensive and Cloud Computing Distributed Systems Advisees Daiyaan Arfeen Sanjith Athlur Suhas Jayaram Subramanya Sara McAllister Hojin Park Ziyue Qiu Daniel L.-K. Wong Timothy Kim CSD Courses Taught 15719 - Spring, 2024 Research/Teaching Statement I have broad research interests in computer systems, including cloud computing, storage/file systems, operating systems and distributed systems. I am involved in several ongoing projects in such areas as systems for large-scale ML, cloud/cluster resource scheduling, and exploitation of new storage/NVM technologies. Big-learning systems for Big Data Modern data analytics often relies on statistical machine learning (ML) to parameterize models that fit observation data, for use in making predictions, correlating causes with effects, etc. Growth in data and desired model precision dictate parallel execution of ML algorithms on clusters, with the corresponding work distribution, synchronization, and data consistency challenges. The big-learning group is exploring powerful new approaches for efficient, scalable, and robust big-learning on Big Data. Cloud Computing We are exploring software systems challenges in efficiently supporting and exploiting cloud computing, such as resource allocation/scheduling and exploiting elasticity for stateful services (e.g., storage) and long-running computations (e.g., large-scale ML). Parallel Data Lab (PDL) As Director of the Parallel Data Lab, I lead and collaborate on a number of storage-related projects in areas such as storage system architecture, file systems, and Big Data systems. For example, in addition to the activities discussed above, we are exploring how system software should change to accommodate new storage technologies like non-volatile RAM (e.g., PCM) and best exploit Flash. Publications Journal Article Extending and Programming the NVMe I/O Determinism Interface for Flash Arrays 2023 • ACM Transactions on Storage • 19(1): Li H, Putra ML, Shi R, Kurnia FI, Lin X, Do J, Kistijantoro AI, Ganger GR, Gunawi HS Conference Mimir: Finding Cost-efficient Storage Configurations in the Public Cloud 2023 • PROCEEDINGS OF THE 16TH ACM INTERNATIONAL SYSTEMS AND STORAGE CONFERENCE, SYSTOR 2023 • 22-34 Park H, Ganger GR, Amvrosiadis G Conference RAIZN: Redundant Array of Independent Zoned Namespaces 2023 • International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS • 660-673 Kim T, Jeon J, Arora N, Li H, Kaminsky M, Andersen DG, Ganger GR, Amvrosiadis G, Bjorling M Conference Sia: Heterogeneity-aware, goodput-optimized ML-cluster scheduling 2023 • PROCEEDINGS OF THE TWENTY-NINTH ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES, SOSP 2023 • 642-657 Subramanya SJ, Arfeen D, Lin S, Qiao A, Jia Z, Ganger GR Journal Article Kangaroo: Theory and Practice of Caching Billions of Tiny Objects on Flash 2022 • ACM Transactions on Storage • 18(3): MCallister S, Berg B, Tutuncu-Macias J, Yang J, Gunasekar S, Lu J, Berger DS, Beckmann N, Ganger GR
Journal Article Extending and Programming the NVMe I/O Determinism Interface for Flash Arrays 2023 • ACM Transactions on Storage • 19(1): Li H, Putra ML, Shi R, Kurnia FI, Lin X, Do J, Kistijantoro AI, Ganger GR, Gunawi HS
Conference Mimir: Finding Cost-efficient Storage Configurations in the Public Cloud 2023 • PROCEEDINGS OF THE 16TH ACM INTERNATIONAL SYSTEMS AND STORAGE CONFERENCE, SYSTOR 2023 • 22-34 Park H, Ganger GR, Amvrosiadis G
Conference RAIZN: Redundant Array of Independent Zoned Namespaces 2023 • International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS • 660-673 Kim T, Jeon J, Arora N, Li H, Kaminsky M, Andersen DG, Ganger GR, Amvrosiadis G, Bjorling M
Conference Sia: Heterogeneity-aware, goodput-optimized ML-cluster scheduling 2023 • PROCEEDINGS OF THE TWENTY-NINTH ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES, SOSP 2023 • 642-657 Subramanya SJ, Arfeen D, Lin S, Qiao A, Jia Z, Ganger GR
Journal Article Kangaroo: Theory and Practice of Caching Billions of Tiny Objects on Flash 2022 • ACM Transactions on Storage • 18(3): MCallister S, Berg B, Tutuncu-Macias J, Yang J, Gunasekar S, Lu J, Berger DS, Beckmann N, Ganger GR