Differences
This shows you the differences between two versions of the page.
readings [2010/11/12 23:56] vseshadr |
readings [2010/12/04 06:00] (current) vseshadr |
||
---|---|---|---|
Line 182: | Line 182: | ||
===== For Lecture 22 ===== | ===== For Lecture 22 ===== | ||
Same as previous lecture | Same as previous lecture | ||
+ | |||
+ | ===== For Lecture 23 ===== | ||
+ | Same as previous lecture | ||
+ | |||
+ | ===== For Lecture 24 ===== | ||
+ | == Required Readings == | ||
+ | * {{conbiningbranchpredictors.pdf|McFarling, "Combining Branch Predictors," DEC WRL TR, 1993}} | ||
+ | * {{increasingprocessorperformance.pdf|Carmean and Sprangle, "Increasing Processor Performance by Implementing Deeper Pipelines," ISCA 2002}} | ||
+ | |||
+ | == Recommended Readings == | ||
+ | * {{analysisofcorrelationandpredictability.pdf|Evers et al., "An Analysis of Correlation and Predictability: What Makes Two-Level Branch Predictors Work," ISCA 1998}} | ||
+ | * {{alternativeimplementationoftwolevelbp.pdf|Yeh and Patt, "Alternative Implementations of Two-Level Adaptive Branch Prediction," ISCA 1992}} | ||
+ | * {{availableilpforsuperscalar.pdf|Jouppi and Wall, "Available instruction-level parallelism for superscalar and superpipelined machines," ASPLOS 1989}} | ||
+ | * {{divergemergeprocessors.pdf|Kim et al., "Diverge-Merge Processor (DMP): Dynamic Predicated Execution of Complex Control-Flow Graphs Based on Frequently Executed Paths," MICRO 2006}} | ||
+ | * {{dynamicbranchpredictionwithperceptrons.pdf|Jimenez and Lin, "Dynamic Branch Prediction with Perceptrons," HPCA 2001}} | ||
+ | |||
+ | ===== For Lecture 25 ===== | ||
+ | Same as previous lecture | ||
+ | |||
+ | ===== For Lecture 26 ===== | ||
+ | |||
+ | === Control Flow III === | ||
+ | |||
+ | == Recommended Readings == | ||
+ | * {{wishbranches.pdf|Kim et al., "Wish Branches: Enabling Adaptive and Aggressive Predicated Execution," IEEE Micro Top Picks, Jan/Feb 2006}} | ||
+ | * {{divergemergeprocessors.pdf|Kim et al., "Diverge-Merge Processor: Generalized and Energy-Efficient Dynamic Predication," IEEE Micro Top Picks, Jan/Feb 2007}} | ||
+ | |||
+ | === Alternative Approaches to Concurrency === | ||
+ | == Required Readings == | ||
+ | * {{vliweli.pdf|Fisher, "Very Long Instruction Word architectures and the ELI-512," ISCA 1983}} | ||
+ | * {{introducingia64.pdf|Huck et al., "Introducing the IA-64 Architecture," IEEE Micro 2000}} | ||
+ | |||
+ | == Recommended Readings == | ||
+ | * {{cray1computersystem.pdf|Russell, "The CRAY-1 computer system," CACM 1978}} | ||
+ | * {{ilpprocessing.pdf|Rau and Fisher, "Instruction-level parallel processing: history,overview, and perspective," Journal of Supercomputing, 1993}} | ||
+ | * {{instructionschedulingforilpprocessors.pdf|Faraboschi et al., "Instruction Scheduling for Instruction Level Parallel Processors," Proc. IEEE, Nov. 2001}} | ||
+ | |||
+ | ===== For Lecture 26 ===== | ||
+ | Same as previous lecture (Alternative Approaches to Concurrency) | ||
+ | |||
+ | ===== For Lecture 27 ===== | ||
+ | == Required Readings == | ||
+ | * {{nvidiatesla.pdf|Lindholm et al., "NVIDIA Tesla: A Unified Graphics and Computing Architecture," IEEE Micro 2008}} | ||
+ | * {{cray1computersystem.pdf|Russell, "The CRAY-1 computer system," CACM 1978}} | ||
+ | |||
+ | == Recommended Readings == | ||
+ | * {{dynamicwarpformation.pdf|Fung et al., "Dynamic Warp Formation and Scheduling for Efficient GPU Control Flow," MICRO 2007}} | ||
+ | * {{qilin.pdf|Luk et al., "Qilin: Exploiting Parallelism on Heterogeneous Multiprocessors with Adaptive Mapping," MICRO 2009}} |