Materials
Resources
This is an old revision of the document!
Buzzwords are terms that are mentioned during lecture which are particularly important to understand thoroughly. This page tracks the buzzwords for each of the lectures and can be used as a reference for finding gaps in your understanding of course material.
* Predicate combining (combine predicate for a branch instruction) * Predicated execution (control dependence becomes data dependence) * Definition of basic blocks * Control flow graph * Delayed branching * benefit? * What does it eliminates? * downside? * Delayed branching in SPARC (with squashing) * Backward compatibility with the delayed slot * What should be filled in the delayed slot * How to ensure correctness * Fine-grained multithreading * fetch from different threads * What are the issues (what if the program doesn't have many threads) * CDC 6000 * Denelcor HEP * No dependency checking * Inst. from different thread can fill-in the bubbles * Cost? * Simulteneuos multithreading * Branch prediction * Guess what to fetch next. * Misprediction penalty * Need to guess the direction and target * How to perform the performance analysis? * Given the branch prediction accuracy and penalty cost, how to compute a cost of a branch misprediction. * Given the program/number of instructions, percent of branches, branch prediction accuracy and penalty cost, how to compute a cost coming from branch mispredictions. * How many extra instructions are being fetched? * What is the performance degredation? * How to reduce the miss penalty? * Predicting the next address (non PC+4 address) * Branch target buffer (BTB) * Predicting the address of the branch * Global branch history - for directions * Can use compiler to profile and get more info * Input set dictacts the accuracy * Add time to compilation * Heuristics that are common and doesn't require profiling. * Might be inaccurate * Does not require profiling * Static branch prediction * Pregrammer provides pragmas, hinting the likelihood of taken/not taken branch * For example, x86 has the hint bit * Dynamic branch prediction * Last time predictor * Two bits counter based prediction * One more bit for hysteresis