Course Schedule and Reading List

Introduction


Exercising logic

Handling data

  • Oct. 17: Alper Okcan and Mirek Riedewald. Processing theta-joins using MapReduce . SIGMOD 2011:949-960. (Zhao Zhou)

  • Oct. 19: Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung. The Google file system. In Proc. of the 9th ACM symposium on Operating Systems Principles (SOSP'03), New York, NY, USA. (Subrota)

  • Oct. 19: Yang, H., Dasdan, A., Hsiao, R., and Parker, D. S. Map-reduce-merge: simplified relational data processing on large clusters. In Proceedings of the 2007 ACM SIGMOD international Conference on Management of Data (Beijing, China, June 11 - 14, 2007). SIGMOD '07. 1029-1040. (Jiangchuan Zheng)

  • Oct. 24: Cooper, B. F., Ramakrishnan, R., Srivastava, U., Silberstein, A., Bohannon, P., Jacobsen, H., Puz, N., Weaver, D., and Yerneni, R. PNUTS: Yahoo!'s hosted data serving platform. In Proc. VLDB Endow. 1, 2 (Aug. 2008), 1277-1288. (Chen Qian).

  • Oct. 24: Agrawal, P., Silberstein, A., Cooper, B. F., Srivastava, U., and Ramakrishnan, R. Asynchronous view maintenance for VLSD databases. In Proceedings of the 35th SIGMOD international Conference on Management of Data (Providence, Rhode Island, USA, June 29 - July 02, 2009). SIGMOD '09. 179-192. (Chen Qian)

  • Case study: Google App Engine (Notes...).

  • Oct. 26: Chang, F., Dean, J., Ghemawat, S., Hsieh, W. C., Wallach, D. A., Burrows, M., Chandra, T., Fikes, A., and Gruber, R. E. Bigtable: a distributed storage system for structured data. In Proceedings of the 7th Symposium on Operating Systems Design and Implementation (Seattle, Washington, November 06 - 08, 2006). Operating Systems Design and Implementation. USENIX Association, Berkeley, CA, 205-218. (Jiangchuan Zheng)

  • Oct. 26: DeCandia, G., Hastorun, D., Jampani, M., Kakulapati, G., Lakshman, A., Pilchin, A., Sivasubramanian, S., Vosshall, P., and Vogels, W. Dynamo: Amazon's highly available key-value store. In Proc. of the 21st ACM SIGOPS Symposium on Operating Systems Principles (SOSP'07). Stevenson, Washington, USA, Oct. 14 - 17, 2007, 205-220.

  • Oct. 31: Lab 3: Windows Azure (Notes...) -Room 3654/HP Teaching Lab (inside the Embedded Systems Lab).

  • Nov. 2: Lab 4: Windows Azure (Notes...) -Room 3654/HP Teaching Lab (inside the Embedded Systems Lab).

  • Pike, R., Dorward, S., Griesemer, R., and Quinlan, S. Interpreting the data: Parallel analysis with Sawzall. Sci. Program. 13, 4 (Oct. 2005), 277-298. (Notes...).

  • Jason Baker, Chris Bond, James C. Corbett, JJ Furman, Andrey Khorlin, James Larson, Jean-Michel Leon, Yawei Li, Alexander Lloyd and Vadim Yushprakh. Megastore: Providing Scalable, Highly Available Storage for Interactive Services . In Proc. of the Conference on Innovative Data system Research (CIDR), 2011, pp. 223-234.

  • Olston, C., Reed, B., Srivastava, U., Kumar, R., and Tomkins, A. Pig Latin: a not-so-foreign language for data processing. In Proc. of the 2008 ACM SIGMOD international Conference on Management of Data (Vancouver, Canada, June 09 - 12, 2008). SIGMOD '08. 1099-1110 (Notes...).

  • Sudipto Das, Divyakant Agrawal, and Amr El Abbadi ElasTraS: An Elastic Transactional Data Store in the Cloud HotCloud'09 (Notes...).

  • Nov. 7: Brantner, M., Florescu, D., Graf, D., Kossmann, D., and Kraska, T. Building a database on S3. In Proc. of the 2008 ACM SIGMOD international Conference on Management of Data (Vancouver, Canada, June 09 - 12, 2008). SIGMOD '08. 1099-1110 (Zhao Zhou).


Platforms, supporting services, and applications

  • Nov. 7 Al-Fares, M., Loukissas, A., and Vahdat, A. A scalable, commodity data center network architecture. In Proceedings of the ACM SIGCOMM 2008 Conference on Data Communication (Seattle, WA, USA, August 17 - 22, 2008). SIGCOMM '08. 63-74. (Notes...). (Wang Min)

  • Case study: Clean Slate and the future Internet (Notes...).

  • Nov. 9: Chuanxiong Guo, Haitao Wu, Kun Tan, Lei Shi, Yongguang Zhang, and Songwu Lu. DCell: A Scalable and Fault-Tolerant Network Structure for Data Centers. ACM SIGCOMM 08. (Notes...). (Jiansong)

  • Nov. 9: Chuanxiong Guo, Guohan Lu, Dan Li, Xuan Zhang, Haitao Wu, Yunfeng Shi, Chen Tian, Yongguang Zhang, and Songwu Lu. BCube: A High Performance, Server-centric Network Architecture for Modular Data Centers. ACM SIGCOMM 09. (Notes...). (Jiansong)

  • Nov. 14: Case study: OpenFlow switching (Notes...). (Wang Min)

  • Nov. 14: Greenberg, A., Hamilton, J., Maltz, D. A., and Patel, P. The cost of a cloud: research problems in data center networks. ACM SIGCOMM 08. (Notes...). (Xie Ning)

  • Burrows, M. The Chubby lock service for loosely-coupled distributed systems. In Proc. of the 7th Symposium on Operating Systems Design and Implementation (OSDI'06), December, 2008. Seattle, WA, USA. (Notes...).

  • Nov. 16: Jorge Londono, Azer Bestavros, and Shang-Hua Teng. Colocation Games and Their Application to Distributed Resource Management. HotCloud'09 (Kaixiang)

  • Vishwanath, K. V., Greenberg, A., and Reed, D. A. Modular data centers: how to design them? . Proceedings of the 1st ACM Workshop on Large-Scale System and Application Performance (Garching, Germany, June 10 - 10, 2009). LSAP '09. 3-10.

  • Church, K., Greenberg, A. and Hamilton, J. On Delivering Embarrassingly Distributed Cloud Services ACM Workshop on Hot Topics in Networks, (HotNets-VII), Calgary (Notes...).

  • Abhishek Chandra and Jon Weissman Nebulas: Using Distributed Voluntary Resources to Build Clouds HotCloud'09 (Notes...).

  • Nov. 21: Timothy Wood, Prashant Shenoy, Alexandre Gerber, K.K. Ramakrishnan, and Jacobus Van der Merwe The Case for Enterprise-Ready Virtual Private Clouds. HotCloud'09. (Bo)

  • Buyya, R. Chee Shin Yeo Venugopal, S. Market-Oriented Cloud Computing: Vision, Hype, and Reality for Delivering IT Services as Computing Utilities. The 10th IEEE International Conference on High Performance Computing and Communications, 2008 (Notes...).

  • Nov. 21: Michael Isard, Vijayan Prabhakaran, Jon Currey, Udi Wieder, Kunal Talwar and Andrew Goldberg Quincy: Fair Scheduling for Distributed Computing Clusters. SOSP'09 (Jinguo)

  • Nov. 23 Fan, Xiaobo and Weber, Wolf-Dietrich and Barroso, Luiz Andre. Power provisioning for a warehouse-sized computer. In Proceedings of the 34th Annual international Symposium on Computer Architecture (San Diego, California, USA, June 09 - 13, 2007). ISCA '07. 13-23. (Pang Lei)

  • Nov. 23: Ramya Raghavendra, Parthasarathy Ranganathan, Vanish Talwar, Zhikui Wang, Xiaoyun Zhu No "Power" Struggles: Coordinated Multi-level Power Management for the Data Center. In Proceedings of the International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Seattle, WA, March 2008. (Bo)

  • Nov. 28: Grzegorz Malewicz, Matthew H. Austern, Aart J. C. Bik, James C. Dehnert, Ilan Horn, Naty Leiser, and Grzegorz Czajkowski. Pregel: A System for Large-Scale Graph Processing. SIGMOD'10. (Xiao)

  • Nov. 30: Sergey Melnik, Andrey Gubarev, Jing Jing Long, Geoffrey Romer, Shiva Shivakumar, Matt Tolton and Theo Vassilakis. Dremel: Interactive Analysis of Web-Scale Datasets, Proc. of the 36th Int'l Conf on Very Large Data Bases, 2010, pp. 330-339. (Xiao)

  • Nov. 30: Tyson Condie, Neil Conway, Peter Alvaro, and Joseph M. Hellerstein, Khaled Elmeleegy and Russell Sears. MapReduce Online . In Proc. of the 7th USENIX Symposium on Networked Systems Design and Implementation (NSDI '10). (Pang Lei)

  • Hyunjoo Kim, Shivangi Chaudhari, Manish Parashar, and Christopher Marty Online Risk Analytics on the Cloud In Proc. of the 9th IEEE/ACM International Symposium on Cluster Computing and the Grid. (Notes...).

  • Rajagopal Ananthanarayanan, Karan Gupta, Prashant Pandey, Himabindu Pucha, Prasenjit Sarkar, Mansi Shah, and Renu Tewari. Cloud Analytics: Do We Really Need to Reinvent the Storage Stack? HotCloud'09 (Notes...).

  • Dec. 5: Lucian Popa, Mihai Budiu, Yuan Yu, and Michael Isard DryadInc: Reusing Work in Large-scale Computations . HotCloud'09 (Wai Ting)

  • Dec. 5: Leslie Lamport. The part-time parliament ACM Trans. Comput. Syst. 16, 2 (May. 1998), 133-169. (Shaoming)

  • Dec. 7: Roy Campbell, Indranil Gupta, Michael Heath, Steven Y. Ko, Michael Kozuch, Marcel Kunze, Thomas Kwan, Kevin Lai, Hing Yan Lee, Martha Lyons, Dejan Milojicic, David O'Hallaron, and Yeng Chai Soh Open Cirrus Cloud Computing Testbed: Federated Data Centers for Open Source Systems and Services Research HotCloud'09 (Wai Ting)

  • Dec. 7 Dhruba Borthakur, Jonathan Gray, Joydeep Sen Sarma, Kannan Muthukkaruppan, Nicolas Spiegelberg, Hairong Kuang, Karthik Ranganathan, Dmytro Molkov, Aravind Menon, Samuel Rash, Rodrigo Schmidt, and Amitanand Aiyer. Apache Hadoop goes realtime at Facebook. SIGMOD '11. (Shaoming)

  • Dec. 7 Review and summary