User Tools

Site Tools


readings

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
readings [2014/09/30 15:39]
yixinluo
readings [2014/10/27 20:55]
yixinluo
Line 5: Line 5:
 Note that the reviews are due at 11:59 PM on the due date. Note that the reviews are due at 11:59 PM on the due date.
  
-==== Reading List (now in reverse order) ====+==== Reading List (in reverse order) ==== 
 +=== 10/31/2014 === 
 +Review required for the following two papers, due on Friday, Oct 31. 
 +  * **[[http://​www.cs.utexas.edu/​~pingali/​CS395T/​2009fa/​lectures/​herlihy93transactional.pdf|Maurice Herlihy and J. Eliot B. Moss, "​Transactional Memory: Architectural Support for Lock-Free Data Structures",​ ISCA 1993.]]** 
 +  * **[[http://​www.cs.cmu.edu/​~tcm/​tcm_papers/​isca00.pdf|J. Gregory Steffan et al., "A Scalable Approach to Thread-Level Speculation",​ ISCA 2000.]]** 
 +Required videos: 
 +| 9/25 Wed. | 2.5.1 Speculation | [[http://​www.ece.cmu.edu/​~ece742/​f14/​files/​onur-740-fall13-module2.5-speculation.pdf|pdf]],​ [[http://​www.ece.cmu.edu/​~ece742/​f14/​files/​onur-740-fall13-module2.5-speculation.pptx|pptx]],​ [[https://​www.youtube.com/​watch?​v=g3IF8DTtr8c|YouTube]][[http://​cmu.vid.acatar.com/​Panopto/​Pages/​Viewer/​Default.aspx?​id=dbb3baf9-c85e-4007-8c71-1e3204fe9907|Panopto]] | [[http://​www.ece.cmu.edu/​~ece740/​f13/​doku.php?​id=readings#​module_2-5|readings]] | 
 +| 9/27 Fri. | 2.5.2 Speculation | [[https://​www.youtube.com/​watch?​v=FqHk4bxrI8Y|video]] [[http://​cmu.vid.acatar.com/​Panopto/​Pages/​Viewer/​Default.aspx?​id=bf92ac2b-5fb5-4896-9bbd-6cb216a16cdd|Panopto]] | ::: | 
 +| 9/30 Mon. | 2.5.3 Speculation | [[https://​www.youtube.com/​watch?​v=uhyNTy8hvDs|video]] [[http://​cmu.vid.acatar.com/​Panopto/​Pages/​Viewer/​Default.aspx?​id=7dbc66b0-b245-45ba-b047-09a84427676a|Panopto]] | ::: | 
 +| ::: | 2.5.4 Speculation | [[https://​www.youtube.com/​watch?​v=McMyefc8CCE|video]] [[http://​cmu.vid.acatar.com/​Panopto/​Pages/​Viewer/​Default.aspx?​id=e9eddf7c-c5c0-4ab4-a731-30852d601506|Panopto]] | ::: | 
 +Related readings: 
 +  * [[ftp://​ftp.cs.wisc.edu/​sohi/​papers/​1995/​isca.multiscalar.pdf|Sohi et al., “Multiscalar Processors,​” ISCA 1995.]] 
 +  * [[http://​classes.soe.ucsc.edu/​cmpe202/​Spring13/​papers/​12a.pdf|Zhou,​ “Dual-Core Execution: Building a Highly Scalable Single-Thread Instruction Window,” PACT 2005.]] 
 +  * [[http://​pages.cs.wisc.edu/​~rajwar/​papers/​micro01.pdf|Rajwar and Goodman, “Speculative Lock Elision: Enabling Highly Concurrent Multithreaded Execution,​” MICRO 2001.]] 
 + 
 +=== 10/16/2014 === 
 +Review required (one out of two) due on Wednesday night 
 +  * **[[http://​www.eecg.toronto.edu/​~moshovos/​research/​store-wait-free.pdf|Wenisch et al., "​Mechanisms for Store-wait–free Multiprocessors",​ ISCA 2007]]** 
 +    * [[http://​www.cs.utexas.edu/​~pingali/​CS395T/​2009fa/​lectures/​herlihy93transactional.pdf|Herlihy et al., "​Transactional Memory: Architectural Support for Lock-Free Data Structures",​ ISCA 1993.]] 
 +    * [[http://​www.cs.cmu.edu/​~tcm/​tcm_papers/​isca00.pdf|Steffan et al., "A Scalable Approach to Thread-Level Speculation",​ ISCA 2000.]] 
 +  * **[[https://​homes.cs.washington.edu/​~luisceze/​publications/​isca07_bulksc.pdf|Ceze et al., "​BulkSC:​ Bulk Enforcement of Sequential Consistency",​ ISCA 2007.]]** 
 +    * [[https://​homes.cs.washington.edu/​~luisceze/​publications/​isca06_bulk.pdf|Ceze et al., "Bulk Disambiguation of Speculative Threads in Multiprocessors",​ ISCA 2006.]] 
 +    * [[http://​www.ece.cmu.edu/​~ece740/​f13/​lib/​exe/​fetch.php?​media=rajwar01.pdf|Rajwar and Goodman, “Speculative Lock Elision: Enabling Highly Concurrent Multithreaded Execution”,​ MICRO 2001.]] 
 +    * [[http://​www.ece.cmu.edu/​~ece740/​f13/​lib/​exe/​fetch.php?​media=herlihy93.pdf|Herlihy and Moss, “Transactional Memory: Architectural Support for Lock-Free Data Structures”,​ ISCA 1993.]] 
 + 
 +=== 10/14/2014 === 
 +Background lecture and paper (required reading, no need to review) 
 +  * [[https://​www.youtube.com/​watch?​v=Mq24MXW4g3U|Consistency & Coherence Lecture]] 
 +  * [[http://​courses.cs.washington.edu/​courses/​cse548/​10wi/​Lamport.pdf|Leslie Lamport, "How to Make a Multiprocessor Computer That Correctly Executes Multiprocess Programs",​ IEEE Trans. Computers 28(9): 690-691 (1979)]] 
 +Required review (one out of two) due on Monday night 
 +  * **[[http://​hpc.cs.tsinghua.edu.cn/​research/​zwm/​reading/​prof/​2a.pdf|Gharachorloo et al., "​Memory Consistency and Event Ordering in Scalable Shared-Memory Multiprocessors",​ ISCA 1990.]]** 
 +  * **[[https://​courses.engr.illinois.edu/​cs533/​sp2012/​reading_list/​gharachorloo91two.pdf|Gharachorloo et al., "Two Techniques to Enhance the Performance of Memory Consistency Models",​ ICPP 1991.]]** 
 +    * [[http://​www.csd.uoc.gr/​~hy460/​pdf/​kung.pdf|Kung et al., "On Optimistic Methods for Concurrency Control",​ TODS 1981.]] 
 +    * [[https://​engineering.purdue.edu/​~vpai/​Publications/​ranganathan-spaa97.pdf|Ranganathan et al., "Using Speculative Retirement and Larger Instruction Windows to Narrow the Performance Gap Between Memory Consistency Models",​ SPAA 1997.]] 
 +    * [[http://​www.cs.arizona.edu/​~gniady/​papers/​isca99_scrc.pdf|Gniady et al., "Is SC + ILP = RC?", ISCA 1999.]] 
 +Recommended book 
 +  * [[http://​www.morganclaypool.com/​doi/​pdfplus/​10.2200/​S00346ED1V01Y201104CAC016|A Primer on Memory Consistency and Cache Coherence, Chapters 1, 3, 4, 5]] 
 +Further reading in a different area (system level consistency) 
 +  * [[http://​sns.cs.princeton.edu/​docs/​eiger-nsdi13.pdf|Lloyd et al., "​Stronger Semantics for Low-Latency Geo-Replicated Storage",​ NSDI 2013.]] 
 + 
 +=== 10/9/2014 === 
 +Amirali'​s Literature Survey 
 +  * [[http://​www.cs.utah.edu/​events/​thememoryforum/​mike.pdf|Connor et al., "​Highlights of the high-bandwidth memory (HBM) standard",​ Memory Forum 2014.]] 
 +  * [[https://​cs.uwaterloo.ca/​~brecht/​courses/​702/​Possible-Readings/​multiprocessor/​tlb-consistency-computer-1990.pdf|Teller et al., "​Translation-Lookaside Buffer Consistency",​ Computer 1990.]] 
 +  * [[http://​research.cs.wisc.edu/​multifacet/​papers/​isca13_direct_segment.pdf|Basu et al., "​Efficient virtual memroy support in big memory servers",​ ISCA 2013.]] 
 +Jiyuan'​s Paper Discussion 
 +  * **[[http://​mercury.pr.erau.edu/​~davisb22/​papers/​burst_scheduling_hpca13.pdf|Shao et al., "A Burst Scheduling Access Reordering Mechanism",​ HPCA 2007.]]** 
 +    * [[http://​users.ece.cmu.edu/​~omutlu/​pub/​dram-aware-caches-TR-HPS-2010-002.pdf|Lee et al., "​DRAM-Aware Last-Level Cache Writeback: Reducing Write-Caused Interference in Memory Systems",​ HPS 2010.]] 
 +    * [[http://​lca.ece.utexas.edu/​people/​kaseridis/​papers/​ISCA_2010.pdf]] 
 +    * [["​FIRM:​ Fair and High-Performance Memory Control for Persistent Memory Systems"​]] 
 +    * [[Adaptive History-Based Memory Schedulers]] 
 +    * [["​Self Optimizing Memory Controllers:​ A Reinforcement Learning Approach"​]] 
 +  * **[[http://​www.cs.cmu.edu/​~chensm/​LBA_reading_group/​papers/​3Ddram-isca08.pdf|Loh et al., "​3D-Stacked Memory Architectures for Multi-Core Processors",​ ISCA 2008.]]** 
 +    * [[http://​users.ece.gatech.edu/​~moin/​papers/​micro12.pdf|Qureshi et al., "​Fundamental Latency Trade-off in Architecting DRAM Caches: Outperforming Impractical SRAM-Tags with a Simple and Practical Design",​ MICRO 2012.]] 
 +    * [[http://​sampa.cs.washington.edu/​papers/​micro06_mshr.pdf|Tuck et al., "​Scalable Cache Miss Handling for High Memory Level Parallelism",​ MICRO 2006.]] 
 +    * [[http://​comparch.gatech.edu/​hparch/​papers/​sim_isca13.pdf|Sim et al., "​Resilient die-stacked DRAM caches",​ ISCA 2013.]] 
 +  * **[[http://​www.dongpingzhang.com/​wordpress/​wp-content/​uploads/​2013/​06/​MSPC6-Zhang.pdf|Zhang et al., "A New Perspective on Processing-in-memory Architecture Design",​ MSPC 2013.]]** 
 +    * [[https://​www.google.com/​url?​sa=t&​rct=j&​q=&​esrc=s&​source=web&​cd=1&​ved=0CB4QFjAA&​url=http%3A%2F%2Fisca2010.inria.fr%2Fmedia%2Fslides%2FISCA_Needle_A_0610.pptx&​ei=9kE9VMOuMtLCsATIuIGQCQ&​usg=AFQjCNGfI_qA9tHBnR8pJo50uRNYvgVEBw&​sig2=OTCWpdAXmMmekm9jun8uBg&​bvm=bv.77161500,​d.cWc&​cad=rja|Dally et al., "​Moving the needle Computer Architecture Research in Academe and Industry",​ ISCA keynote 2010.]] 
 + 
 +=== 10/7/2014 === 
 +  * **[[http://​tinker.cc.gatech.edu/​pdfs/​MICRO44_Jesse_Beu.pdf|Beu et al., "​Manager-Client Pairing: A Framework for Implementing Coherence Hierarchies",​ MICRO 2011.]]** 
 +  * **[[http://​research.cs.wisc.edu/​multifacet/​papers/​hpca14_quick_release.pdf|Hechtman et al., "Quick Release: A Throughput-oriented Approach to Release Consistency on GPUs", HPCA 2014.]]** 
 +  * **[[http://​dl.acm.org/​citation.cfm?​id=2541982|Voskuilen et al., "​High-Performance Fractal Coherence",​ ASPLOS 2014.]]** 
  
 === 10/2/2014 === === 10/2/2014 ===
Line 12: Line 75:
  
 === 10/1/2014 === === 10/1/2014 ===
-=== 9/30/2014 === 
 Amirali'​s literature survey: Amirali'​s literature survey:
  
 +=== 9/30/2014 ===
 Doru's literature survey: Doru's literature survey:
   * **[[http://​users.elis.ugent.be/​~leeckhou/​papers/​isca13.pdf|Bois et al., "​Criticality Stacks: Identifying Critical Threads in Parallel Programs using Synchronization Behavior",​ ISCA 2013.]]**   * **[[http://​users.elis.ugent.be/​~leeckhou/​papers/​isca13.pdf|Bois et al., "​Criticality Stacks: Identifying Critical Threads in Parallel Programs using Synchronization Behavior",​ ISCA 2013.]]**
   * **[[http://​www.istc-cc.cmu.edu/​publications/​papers/​2013/​joao_isca13_preprint.pdf|Joao et al., "​Utility-Based Acceleration of Multithreaded Applications on Asymmetric CMPs", ISCA 2013.]]**   * **[[http://​www.istc-cc.cmu.edu/​publications/​papers/​2013/​joao_isca13_preprint.pdf|Joao et al., "​Utility-Based Acceleration of Multithreaded Applications on Asymmetric CMPs", ISCA 2013.]]**
     * [[http://​users.elis.ugent.be/​~leeckhou/​papers/​isca12-2.pdf|Craeynest et al., "​Scheduling Heterogeneous Multi-Cores through Performance Impact Estimation (PIE)",​ ISCA 2012.]]     * [[http://​users.elis.ugent.be/​~leeckhou/​papers/​isca12-2.pdf|Craeynest et al., "​Scheduling Heterogeneous Multi-Cores through Performance Impact Estimation (PIE)",​ ISCA 2012.]]
-    * Evelyn Duesterwald,​ Calin Cascaval,​ Sandhya Dwarkadas:​ Characterizing and Predicting Program Behavior and its Variability+    * [[http://​webdocs.cs.ualberta.ca/​~amaral/​courses/​605/​papers/​DuesterwaldEtAl.pdf|Evelyn Duesterwald ​et al."Characterizing and Predicting Program Behavior and its Variability", PACT 2003.]]
   * **[[http://​hps.ece.utexas.edu/​pub/​morphcore_micro2012.pdf|Khubaib et al., "​MorphCore:​ An Energy-Efficient Microarchitecture for High Performance ILP and High Throughput TLP", ISCA 2012.]]**   * **[[http://​hps.ece.utexas.edu/​pub/​morphcore_micro2012.pdf|Khubaib et al., "​MorphCore:​ An Energy-Efficient Microarchitecture for High Performance ILP and High Throughput TLP", ISCA 2012.]]**
-    * Andrew Lukefahr,​ Shruti Padmanabha,​ Reetuparna Das,​ Faissal MSleiman,​ Ronald GDreslinski,​ Thomas FWenisch,​ Scott AMahlke: Composite Cores: Pushing Heterogeneity Into a CoreMICRO 2012 +    * [[http://​cccp.eecs.umich.edu/​papers/​lukefahr_micro12.pdf|Andrew Lukefahr et al., "Composite Cores: Pushing Heterogeneity Into a Core", ​MICRO 2012.]] 
-    * Shruti Padmanabha,​ Andrew Lukefahr,​ Reetuparna Das, Scott A. Mahlke: Trace based phase prediction for tightly-coupled heterogeneous coresMICRO 2013 +    * [[http://​cccp.eecs.umich.edu/​papers/​shrupad_micro13.pdf|Shruti Padmanabha ​et al."Trace based phase prediction for tightly-coupled heterogeneous cores", ​MICRO 2013.]] 
-    * Core-fusion +    * [[http://​m3.csl.cornell.edu/​papers/​isca07.pdf|Engin Ipek et al., "Core fusion: accommodating software diversity in chip multiprocessors",​ ISCA 2007.]] 
-    * Heterogeneous-block ​architecture +    * [[http://​users.ece.cmu.edu/​~omutlu/​pub/​heterogeneous-block-architecture_iccd14.pdf|Chris Fallin et al., "The Heterogeneous Block Architecture",​ ICCD 2014.]] 
-    * Yoga architecture+    * [[http://​hps.ece.utexas.edu/​pub/​TR-HPS-2014-001.pdf|Carlos Villavieja et al., "Yoga: A Hybrid Dynamic VLIW/OoO Processor",​ HPS Tech Report 2014.]]
  
 Yang's literature survey: Yang's literature survey:
 +  * [[http://​plaza.ufl.edu/​chaol/​File/​Enabling-HPCA-2013.pdf|Chao Li et al., "​Enabling distributed generation powered sustainable high-performance data center",​ HPCA 2013.]]
 +  * [[https://​www.usenix.org/​system/​files/​conference/​icac14/​icac14-paper-li_chao.pdf|Chao Li et al., "​Managing Green Datacenters Powered by Hybrid Renewable Energy Systems",​ ICAC 2014.]]
 +  * [[http://​ieeexplore.ieee.org/​xpls/​abs_all.jsp?​arnumber=6672933&​tag=1|Sen Li et al., "Data center power control for frequency regulation",​ PES 2014.]]
 +
  
 === 9/25/2014 === === 9/25/2014 ===
Line 35: Line 102:
 === 9/24/2014 === === 9/24/2014 ===
 Kevin'​s literature survey: Kevin'​s literature survey:
-  * [[http://​users.ece.cmu.edu/​~omutlu/​pub/​raidr-dram-refresh_isca12.pdf|Liu et al., "​RAIDR:​ Retention-Aware Intelligent DRAM Refresh",​ ISCA 2012.]] +  ​* **[[http://​users.ece.cmu.edu/​~omutlu/​pub/​raidr-dram-refresh_isca12.pdf|Liu et al., "​RAIDR:​ Retention-Aware Intelligent DRAM Refresh",​ ISCA 2012.]]** 
-  * [[http://​users.ece.cmu.edu/​~omutlu/​pub/​dram-retention-time-characterization_isca13.pdf|Liu et al., "An Experimental Study of Data Retention Behavior in Modern DRAM Devices: Implications for Retention Time Profiling Mechanisms",​ ISCA 2013.]] +    * [[http://​arch.ece.gatech.edu/​pub/​micro40.pdf|Mrinmoy Ghosh et al., "Smart Refresh: An Enhanced Memory Controller Design for Reducing Energy in Conventional and 3D Die-Stacked DRAMs",​ MICRO 2007.]] 
-  * [[http://​www.ece.cmu.edu/​~safari/​pubs/​error-mitigation-for-intermittent-dram-failures_sigmetrics14.pdf|Khan et al., "The Efficacy of Error Mitigation Techniques for DRAM Retention Failures: A Comparative Experimental Study",​ SIGMETRICS 2014.]]+    * [[http://​www.cs.utah.edu/​events/​thememoryforum/​kang.pdf|Kang et al., "​Co-Architecting Controllers and DRAM to Enhance DRAM Process Scaling",​ Memory Forum 2014.]] 
 +    * [[http://​people.engr.ncsu.edu/​ericro/​publications/​conference_HPCA-12.pdf|Ravi K. Venkatesan et al, "​Retention-aware placement in DRAM (RAPID): software methods for quasi-non-volatile DRAM", HPCA 2006.]] 
 +    * [[http://​ieeexplore.ieee.org/​xpls/​abs_all.jsp?​arnumber=4154211&​tag=1|K. Ohyu et al, "​Quantitative identification for the physical origin of variable retention time: A vacancy-oxygen complex defect model",​ IEDM 2006.]] 
 +    * [[http://​ieeexplore.ieee.org/​xpls/​abs_all.jsp?​arnumber=5970110|Heesang Kim et al., "​Characterization of the Variable Retention Time in Dynamic Random Access Memory",​ TED 2011.]] 
 +  ​* **[[http://​users.ece.cmu.edu/​~omutlu/​pub/​dram-retention-time-characterization_isca13.pdf|Liu et al., "An Experimental Study of Data Retention Behavior in Modern DRAM Devices: Implications for Retention Time Profiling Mechanisms",​ ISCA 2013.]]** 
 +  ​* **[[http://​www.ece.cmu.edu/​~safari/​pubs/​error-mitigation-for-intermittent-dram-failures_sigmetrics14.pdf|Khan et al., "The Efficacy of Error Mitigation Techniques for DRAM Retention Failures: A Comparative Experimental Study",​ SIGMETRICS 2014.]]**
  
 === 9/23/2014 === === 9/23/2014 ===
readings.txt · Last modified: 2014/12/03 21:12 by yixinluo