Differences

This shows you the differences between two versions of the page.

--- buzzword [2014/01/17 19:18]
rachata
+++ buzzword [2015/02/04 19:23]
rachata
@@ Line 1: / Line 1: @@
 ====== Buzzwords ======
-Buzzwords are terms that are mentioned during lecture which are particularly important to understand thoroughly.  This page tracks the buzzwords for each of the lectures and can be used as a reference for finding gaps in your understanding of course material.
+Buzzwords are terms that are mentioned during lecture which are particularly important to understand thoroughly. This page tracks the buzzwords for each of the lectures and can be used as a reference for finding gaps in your understanding of course material.
-===== Lecture 1 (1/13 Mon.) =====
+===== Lecture 1 (1/12 Mon.) =====
   * Level of transformation
     * Algorithm
@@ Line 10: / Line 9: @@
     * Compiler
   * Cross abstraction layers
-    * Expose an interface
   * Tradeoffs
   * Caches
-  * Multi-thread
+  * DRAM/memory controller
-  * Multi-core
+  * DRAM banks
-  * Unfairness
-  * DRAM controller/Memory controller
-  * Memory hog
   * Row buffer hit/miss
   * Row buffer locality
-  * Streaming access/ Random access
+  * Unfairness
-  * DRAM refresh
+  * Memory performance hog
-  * Retention time
+  * Shared DRAM memory system
-  * Profiling DRAM retention time
+  * Streaming access vs. random access
+  * Memory scheduling policies
+  * Scheduling priority
+  * Retention time of DRAM
+  * Process variation
+  * Retention time profile
   * Power consumption
-  * Wimpy cores
   * Bloom filter
-    * Pros/Cons
+  * Hamming code
-    * False Positive
+  * Hamming distance
-  * Simulation
+  * DRAM row hammer
-  * Memory performance attacks
-  * RTL design
-===== Lecture 2 (1/15 Wed.) =====
+===== Lecture 2 (1/14 Wed.) =====
-  * Optimizing for energy/ Optimizing for the performance
-    * Generally you should optimize for the users
-  * state-of-the-art
-  * RTL Simulation
-    * Long, slow and can be costly
-  * High level simulation
-    * What should be employed?
-	* Important to get the idea of how they are implemented in RTL
-	* Allows designer to filter out techniques that do not work well
-  * Design points
-    * Design processors to meet the design points
-  * Software stack
-  * Design decisions
-  * Datacenters
-  * MIPS R2000
-    * What are architectural techniques that improve the performance of a processor over MIPS 2000
   * Moore's Law
+  * Algorithm --> step-by-step procedure to solve a problem
   * in-order execution
   * out-of-order execution
@@ Line 65: / Line 46: @@
   * Scaling issue
     * Transister are getting smaller
+  * Key components of a computer
+  * Design points
+    * Design processors to meet the design points
+  * Software stack
+  * Design decisions
+  * Datacenters
   * Reliability problems that cause errors
   * Analogies from Kuhn's "The Structure of Scientific Revolutions" (Recommended book)
@@ Line 73: / Line 60: @@
     * Computation
       * Communication
-        * Storage
+      * Storage
-          * DRAM
+        * DRAM
-          * NVRAM (Non-volatile memory): PCM, STT-MRAM
+        * NVRAM (Non-volatile memory): PCM, STT-MRAM
-          * Storage (Flash/Harddrive)
+        * Storage (Flash/Harddrive)
   * Von Neumann Model (Control flow model)
     * Stored program computer
-        * Properties of Von Neumann Model: Stored program, sequential instruction processing
+      * Properties of Von Neumann Model: Stored program, sequential instruction processing
-        * Unified memory
+      * Unified memory
-          * When does an instruction is being interpreted as an instruction (as oppose to a datum)?
+        * When does an instruction is being interpreted as an instruction (as oppose to a datum)?
-        * Program counter
+      * Program counter
-        * Examples: x86, ARM, Alpha, IBM Power series, SPARC, MIPS
+      * Examples: x86, ARM, Alpha, IBM Power series, SPARC, MIPS
   * Data flow model
     * Data flow machine
@@ Line 94: / Line 81: @@
   * Tradeoffs between control-driven and data-driven
     * What are easier to program?
-	* Which are easy to compile?
+      * Which are easy to compile?
-	* What are more parallel (does that mean it is faster?)
+      * What are more parallel (does that mean it is faster?)
-	* Which machines are more complex to design?
+      * Which machines are more complex to design?
     * In control flow, when a program is stop, there is a pointer to the current state (precise state).
   * ISA vs. Microarchitecture
     * Semantics in the ISA
-	* uArch should obey the ISA
+      * uArch should obey the ISA
-	* Changing ISA is costly, can affect compatibility.
+      * Changing ISA is costly, can affect compatibility.
   * Instruction pointers
   * uArch techniques: common and powerful techniques break Vonn Neumann model if done at the ISA level
@@ Line 109: / Line 96: @@
       * Out-of-order executions
       * etc.
-    * Design techniques
+        * Design techniques
-      * Adder implementation (Bit serial, ripple carry, carry lookahead)
+          * Adder implementation (Bit serial, ripple carry, carry lookahead)
-      * Connection machine (an example of a machine that use bit serial to tradeoff latency for more parallelism)
+          * Connection machine (an example of a machine that use bit serial to tradeoff latency for more parallelism)
   * Microprocessor: ISA + uArch + circuits
   * What are a part of the ISA? Instructions, memory, etc.
@@ Line 118: / Line 105: @@
     * Things that are not suppose to be visible to the programmer/software but typically make the processor faster and/or consumes less power and/or less complex
-===== Lecture 3 (1/17 Wed.) =====
+===== Lecture 3 (1/17 Fri.) =====
+   * Microarchitecture
+   * Three major tradeoffs of computer architecture
+   * Macro-architecture
+   * LC-3b ISA
+   * Unused instructions
+   * Bit steering
+   * Instruction processing style
+   * 0,1,2,3 address machines
+   * Stack machine
+   * Accumulator machine
+   * 2-operand machine
+   * 3-operand machine
+   * Tradeoffs between 0,1,2,3 address machines
+   * Postfix notation
+   * Instructions/Opcode/Operade specifiers (i.e. addressing modes)
+   * Simply vs. complex data type (and their tradeoffs)
+   * Semantic gap and level
+   * Translation layer
+   * Addressability
+   * Byte/bit addressable machines
+   * Virtual memory
+   * Big/little endian
+   * Benefits of having registers (data locality)
+   * Programmer visible (Architectural) state
+   * Programmers can access this directly
+   * What are the benefits?
+   * Microarchitectural state
+   * Programmers cannot access this directly
+   * Evolution of registers (from accumulators to registers)
+   * Different types of instructions
+   * Control instructions
+   * Data instructions
+   * Operation instructions
+   * Addressing modes
+   * Tradeoffs (complexity, flexibility, etc.)
+   * Orthogonal ISA
+   * Addressing modes that are orthogonal to instruction types
+   * I/O devices
+   * Vectored vs. non-vectored interrupts
+   * Complex vs. simple instructions
+   * Tradeoffs
+   * RISC vs. CISC
+   * Tradeoff
+   * Backward compatibility
+   * Performance
+   * Optimization opportunity
+   * Translation
+===== Lecture 4 (1/21 Wed.) =====
+  * Fixed vs. variable length instruction
+  * Huffman encoding
+  * Uniform vs. non-uniform decode
+  * Registers
+    * Tradeoffs between number of registers
+  * Alignments
+    * How does MIPS load words across alignment the boundary
+===== Lecture 5 (1/26 Mon.) =====
+  * Tradeoffs in ISA: Instruction length
+    * Uniform vs. non-uniform
+  * Design point/Use cases
+    * What dictates the design point?
+  * Architectural states
+  * uArch
+    * How to implement the ISA in the uArch
+  * Different stages in the uArch
+  * Clock cycles
+  * Multi-cycle machine
+  * Datapath and control logic
+    * Control signals
+  * Execution time of instructions/program
+    * Metrics and what do they means
+  * Instruction processing
+    * Fetch
+    * Decode
+    * Execute
+    * Memory fetch
+    * Writeback
+  * Encoding and semantics
+  * Different types of instructions (I-type, R-type, etc.)
+  * Control flow instructions
+  * Non-control flow instructions
+  * Delayed slot/Delayed branch
+  * Single cycle control logic
+  * Lockstep
+  * Critical path analysis
+    * Critical path of a single cycle processor
+  * What is in the control signals?
+    * Combinational logic & Sequential logic
+  * Control store
+  * Tradeoffs of a single cycle uarch
+  * Design principles
+    * Common case design
+    * Critical path design
+    * Balanced designs
+    * Dynamic power/Static power
+      * Increases in power due to frequency
+===== Lecture 6 (1/28 Mon.) =====
+  * Design principles
+    * Common case design
+    * Critical path design
+    * Balanced designs
+  * Multi cycle design
+  * Microcoded/Microprogrammed machines
+    * States
+    * Translation from one state to another
+    * Microinstructions
+    * Microsequencing
+    * Control store - Product control signals
+    * Microsequencer
+    * Control signal
+      * What do they have to control?
+  * Instruction processing cycle
+  * Latch signals
+  * State machine
+  * State variables
+  * Condition code
+  * Steering bits
+  * Branch enable logic
+  * Difference between gating and loading? (write enable vs. driving the bus)
+  * Memory mapped I/O
+  * Hardwired logic
+    * What control signals come from hardwired logic?
+  * Variable latency memory
+  * Handling interrupts
+  * Difference betwen interrupts and exceptions
+  * Emulator (i.e. uCode allots minimal datapath to emulate the ISA)
+  * Updating machine behavior
+  * Horizontal microcode
+  * Vertical microcode
+  * Primitives
+===== Lecture 7 (1/30 Fri.) =====
+  * Emulator (i.e. uCode allots minimal datapath to emulate the ISA)
+  * Updating machine behavior
+  * Horizontal microcode
+  * Vertical microcode
+  * Primitives
+  * nanocode and millicode
+    * what are the differences between nano/milli/microcode
+  * microprogrammed vs. hardwire control
+  * Pipelining
+  * Limitations of the multi-programmed design
+    * Idle resources
+  * Throughput of a pipelined design
+    * What dictacts the throughput of a pipelined design?
+  * Latency of the pipelined design
+  * Dependency
+  * Overhead of pipelining
+    * Latch cost?
+  * Data forwarding/bypassing
+  * What are the ideal pipeline?
+  * External fragmentation
+  * Issues in pipeline designs
+    * Stalling
+      * Dependency (Hazard)
+        * Flow dependence
+        * Output dependence
+        * Anti dependence
+        * How to handle them?
+    * Resource contention
+    * Keeping the pipeline full
+    * Handling exception/interrupts
+    * Pipeline flush
+    * Speculation
+===== Lecture 8 (2/2 Mon.) =====
-  * Design tradeoffs
+  * Interlocking
-  * Macro Architectures
+  * Multipath execution
-  * Reconfiguribility vs. specialized designs
+  * Fine grain multithreading
-  * Parallelism (instructions, data parallel)
+  * No-op (Bubbles in the pipeline)
-  * Uniform decode (Example: Alpha)
+  * Valid bits in the instructions
-  * Steering bits (Sub-opcode)
+  * Branch prediction
-  * 0,1,2,3 address machines
+  * Different types of data dependence
-    * Stack machine
+  * Pipeline stalls
-    * Accumulator machine
+    * bubbles
-    * 2-operand machine
+    * How to handle stalls
-    * 3-operand machine
+    * Stall conditions
-    * Tradeoffs between 0,1,2,3 address machines
+    * Stall signals
-  * Instructions/Opcode/Operade specifiers (i.e. addressing modes)
+    * Dependences
-  * Simply vs. complex data type (and their tradeoffs)
+      * Distant between dependences
-  * Semantic gap
+    * Data forwarding/bypassing
-  * Translation layer
+    * Maintaining the correct dataflow
-  * Addressability
+  * Different ways to design data forwarding path/logic
-  * Byte/bit addressable machines
+  * Different techniques to handle interlockings
-  * Virtual memory
+    * SW based
-  * Big/little endian
+    * HW based
-  * Benefits of having registers (data locality)
+  * Profiling
-  * Programmer visible (Architectural) state
+    * Static profiling
-    * Programmers can access this directly
+    * Helps from the software (compiler)
-    * What are the benefits?
+      * Superblock optimization
-  * Microarchitectural state
+      * Analyzing basic blocks
-    * Programmers cannot access this directly
+  * How to deal with branches?
-  * Evolution of registers (from accumulators to registers)
+    * Branch prediction
-  * Different types of instructions
+    * Delayed branching (branch delay slot)
-    * Control instructions
+    * Forward control flow/backward control flow
-    * Data instructions
+    * Branch prediction accuracy
-    * Operation instructions
+  * Profile guided code positioning
-  * Addressing modes
+    * Based on the profile info. position the code based on it
-    * Tradeoffs (complexity, flexibility, etc.)
+    * Try to make the next sequential instruction be the next inst. to be executed
-  * Orthogonal ISA
+  * Predicate combining (combine predicate for a branch instruction)
-    * Addressing modes that are orthogonal to instructino types
+  * Predicated execution (control dependence becomes data dependence)
-  * Vectors vs. non vectored interrupts
-  * Complex vs. simple instructions
-    * Tradeoffs
+===== Lecture 9 (2/4 Wed.) =====
-  * RISC vs. CISC
-    * Tradeoff
+  * Predicate combining (combine predicate for a branch instruction)
-    * Backward compatibility
+  * Predicated execution (control dependence becomes data dependence)
-    * Performance
+  * Definition of basic blocks
-    * Optimization opportunity
+  * Control flow graph
+  * Delayed branching
+    * benefit?
+    * What does it eliminates?
+    * downside?
+    * Delayed branching in SPARC (with squashing)
+    * Backward compatibility with the delayed slot
+    * What should be filled in the delayed slot
+    * How to ensure correctness
+  * Fine-grained multithreading
+    * fetch from different threads
+    * What are the issues (what if the program doesn't have many threads)
+    * CDC 6000
+    * Denelcor HEP
+    * No dependency checking
+    * Inst. from different thread can fill-in the bubbles
+    * Cost?
+  * Simulteneuos multithreading
+  * Branch prediction
+    * Guess what to fetch next.
+    * Misprediction penalty
+    * Need to guess the direction and target
+    * How to perform the performance analysis?
+      * Given the branch prediction accuracy and penalty cost, how to compute a cost of a branch misprediction.
+      * Given the program/number of instructions, percent of branches, branch prediction accuracy and penalty cost, how to compute a cost coming from branch mispredictions.
+        * How many extra instructions are being fetched?
+        * What is the performance degredation?
+    * How to reduce the miss penalty?
+    * Predicting the next address (non PC+4 address)
+    * Branch target buffer (BTB)
+      * Predicting the address of the branch
+    * Global branch history - for directions
+    * Can use compiler to profile and get more info
+      * Input set dictacts the accuracy
+      * Add time to compilation
+    * Heuristics that are common and doesn't require profiling.
+      * Might be inaccurate
+      * Does not require profiling
+    * Static branch prediction
+      * Pregrammer provides pragmas, hinting the likelihood of taken/not taken branch
+      * For example, x86 has the hint bit
+    * Dynamic branch prediction
+      * Last time predictor
+      * Two bits counter based prediction
+        * One more bit for hysteresis

18-447 Introduction to Computer Architecture – Spring 2015

User Tools

Site Tools

Differences

Page Tools