CS 728 Spring 2020

CS 728 Reading List

Read the "Task of the Referee" by Alan Jay Smith posted on the Links page.

For each reading, write a concise paragraph for each of the following questions:
1. What is the problem? (How important is it?)
2. What are the insights?
3. What is the solution? (Is it feasible?)
4. What is the takeaway message?
5. Will this paper win the test of time award?
6. Name one reason why this paper should have not appeared in a top venue (e.g., ISCA, SOSP)?

Please submit your answers in a single pdf file before noon (12pm) on Fridays.

Feb. 17: Introduction
	The task of the referee
Feb. 24: Datacenter basics
	Web search for a planet: The Google cluster architecture [Google, IEEE Micro '03]
	A view of cloud computing [Berkeley, CACM '10]
Mar. 02: Datacenter workloads
	Clearing the Clouds: a study of emerging scale-out workloads on modern hardware [EPFL, ASPLOS '12]
	AsmDB: Understanding and Mitigating Front-End Stalls in Warehouse-Scale Computers [Google, ISCA '19]
Mar. 09: Quality of service
	Tail at Scale [Google, CACM '13]
	Few-to-Many: Incremental Parallelism for Reducing Tail Latency in Interactive Services [Microsoft, ASPLOS '15]
Mar. 16: No class
Mar. 23: Data services
	Spanner: Google’s Globally-Distributed Database [Google, OSDI '12]
	MICA: A Holistic Approach to Fast In-Memory Key-Value Storage [CMU & Intel, NSDI '14]
Mar. 30: Networking
	VL2: A Scalable and Flexible Data Center Network [Microsoft, SIGCOMM '09]
	Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities [Stanford & MIT, SIGCOMM '18]
Apr. 06: Microservices
	RPCValet: NI-Driven Tail-Aware Balancing of µs-Scale RPCs [EPFL, ASPLOS '19]
	Optimus Prime: Accelerating Data Transformation in Servers [EPFL, ASPLOS '20]
Apr. 13: No class -- Easter break
Apr. 20: Processor design
	Scale-out Processors [EPFL, ISCA '12]
	Stretch: Balancing QoS and Throughput for Colocated Server Workloads on SMT Cores [uEdinburgh & EPFL, HPCA '19]
Apr. 27: Memory systems
	Scale-Out NUMA [EPFL, ASPLOS '14]
	Efficient Memory Disaggregation with INFINISWAP [uMichigan, NSDI '09]
May. 04: Storage
	Windows Azure Storage: a highly available cloud storage service with strong consistency [Microsoft, SOSP '11]
	f4: Facebook’s Warm BLOB Storage System [Facebook, OSDI '14]
May. 11: Virtualization and serverless computing
	SCONE: Secure Linux Containers with Intel SGX [TU Dresden, OSDI '16]
	Serverless Computation with OpenLambda [uWisconsin, HotCloud '16]
May. 18: Deep learning in datacenters (1)
	TensorFlow: A System for Large-Scale Machine Learning [Google, OSDI '16]
	Scaling Distributed Machine Learning with the Parameter Server [CMU, Baidu, and Google, OSDI '14]
May. 25: Deep learning in datacenters (2)
	A Configurable Cloud-Scale DNN Processor for Real-Time AI [Microsoft, ISCA '18]
	Applied Machine Learning at Facebook: A Datacenter Infrastructure Perspective [Facebook, HPCA '18]

CS 728 Topics on Datacenter DesignSpring 2020

CS 728 Reading List

CS 728 Topics on Datacenter Design
Spring 2020