This is an old revision of the document!


Literature Review

The literature survey instructions are here: pdf

Groups

Name(s) Literature Survey Papers
Jamie
Ben 1. “The Effectiveness of Multiple Hardware Contexts,” ASPLOS 1994.
2. “Fairness and Throughput in Switch on Event Multithreading,” MICRO 2006.
3. “Fast thread migration via cache working set prediction,” HPCA 2011.
- “Cache-Conscious Wavefront Scheduling,” MICRO 2012.
Donghyuk
Samihan 1. “Prefetch-Aware Shared Resource Management for Multi-Core Systems,” ISCA 2011.
2. “Feedback Directed Prefetching: Improving the Performance and Bandwidth-Efficiency of Hardware Prefetchers,” HPCA 2007.
3. “PACMan: Prefetch-Aware Cache Management for High Performance Caching,” MICRO 2011.
Rui, Tyler 1. “Fully associative software-based cache design,” ISCA 2000.
2. “The V-way cache: Demand based associativity via global replacement,” ISCA 2005.
3. “Utility-Based Cache Partitioning: A Low-Overhead, High-Performance, Runtime Mechanism to Partition Shared Caches,” MICRO 2006.
Jason, Brian 1. “Spatial memory streaming,” ISCA 2006.
2. “Feedback directed prefetching,” ISCA 2007.
3. “Interactions between compression and prefetching in chip multiprocessors,” HPCA 2007.
- “Memory-link compression schemes: a value locality perspective,” IEEE Transactions on Computers.
Hyoseung 1. “A Software Memory Partition Approach for Eliminating Bank-level Interference in Multicore System,” PACT 2012.
2. “Development and validation of a hierarchical memory model incorporating CPU- and memory-operation overlap model,” WOSP 1998.
3. “Understanding How Off-Chip Memory Bandwidth Partitioning in Chip Multiprocessors Affects System Performance,” HPCA 2010.
4. “An Analytical Performance Model for Co-Management of Last-Level Cache and Bandwidth Sharing,” MASCOTS 2011.
Joe, Paul 1. “Dynamic Warp Formation and Scheduling for Efficient GPU Control Flow,” MICRO 2007.
2. “Dynamic Warp Subdivision for Integrated Branch and Memory Divergence Tolerance,” ISCA 2010.
3. “CAPRI: Prediction of Compaction-Adequacy for Handling Control-Divergence in GPGPU Architectures,” ISCA 2012.
- “Thread Block Compaction for Efficient SIMT Control Flow,” HPCA 2011.
- “Improving GPU Performance via Large Warps and Two-Level Warp Scheduling,” MICRO 2011.
Hongyi 1. “PatternHunter: faster and more sensitive homology search,” BioInformatics 2002.
2. “Efficient Large-Scale Sequence Comparison by Locality-Sensitive Hashing,” BioInformatics 2001.
3. “Alignment of whole genomes,” Nucl. Acids Res. 1999.
4. “Gapped BLAST and PSI-BLAST: a new generation of protein database search programs,” Nucl. Acids Res. 1997.
Berkin 1. “Modeling critical sections in Amdahl's law and its implications for multicore design,” ISCA 2010.
2. “Amdahl's Law in the Multicore Era,” IEEE Computer 2008.
3. “Dark Silicon and the End of Multicore Scaling,” ISCA 2011.
- “Many-Core vs. Many-Thread Machines: Stay Away From the Valley,” CAL 2009.
Richard 1. “Algorithms for Constraint Satisfaction Problems: A Survey,” AI Mag. 1992.
2. “MINION: A Fast Scalable Constraint Solver,” ECAI 2006.
3. “Autotuning a Random Walk Boolean Satisfiability Solver,” ICCS 2011.