Differences

This shows you the differences between two versions of the page.

--- buzzword [2014/03/24 18:16]
rachata
+++ buzzword [2014/03/31 18:15]
rachata
@@ Line 797: / Line 797: @@
     * Cached misses cache block
     * Prevent ping-ponging
-  * Pseudo associtivity
+  * Pseudo associativity
     * Simpler way to implement associative cache
   * Skewed assoc. cache
@@ Line 811: / Line 811: @@
     * What information goes into the MSHR?
     * When do you access the MSHR?
+===== Lecture 22 (3/26 Wed.) =====
+  * Multi-porting
+    * Virtual multi-porting
+      * Time-share the port, not too scalable but cheap
+    * True multiporting
+  * Multiple cache copies
+  * Banking
+    * Can have bank conflict
+    * Extra interconnects across banks
+    * Address mapping can mitigate bank conflict
+    * Common in main memory (note that regFile in GPU is also banked, but mainly for the pupose of reducing complexity)
+  * Accessing DRAM
+    * Row bits
+    * Column bits
+    * Addressibility
+    * DRAM has its own clock
+  * DRAM (2T) vs. SRAM (6T)
+    * Cost
+    * Latency
+  * Interleaving in DRAM
+    * Effects from address mapping on memory interleaving
+    * Effects from memory access patterns from the program on interleaving
+  * DRAM Bank
+    * To minimize the cost of interleaving (Shared the data bus and the command bus)
+  * DRAM Rank
+    * Minimize the cost of the chip (a bundle of chips operated together)
+  * DRAM Channel
+    * An interface to DRAM, each with its own ranks/banks
+  * DIMM
+    * More DIMM adds the interconnect complexity
+  * List of commands to read/write data into DRAM
+    * Activate -> read/write -> precharge
+    * Activate moves data into the row buffer
+    * Precharge prepare the bank for the next access
+  * Row buffer hit
+  * Row buffer conflict
+  * Scheduling memory requests to lower row conflicts
+  * Burst mode of DRAM
+    * Prefetch 32-bits from an 8-bit interface if DRAM needs to read 32 bits
+  * Address mapping
+    * Row interleaved
+    * Cache block interleaved
+  * Memory controller
+    * Sending DRAM commands
+    * Periodically send commands to refresh DRAM cells
+    * Ensure correctness and data integrity
+    * Where to place the memory controller
+      * On CPU chip vs. at the main memory
+        * Higher BW on-chip
+    * Determine the order of requests that will be serviced in DRAM
+      * Request queues that hold requests
+      * Send requests whenever the request can be sent to the bank
+      * Determine which command (across banks) should be sent to DRAM
+  * Priority of demand vs. prefetch requests
+  * Memory scheduling policies
+    * FCFS
+    * FR-FCFS
+      * Capped FR-FCFS: FR-FCFS with a timeout
+      * Usually this is done in a command level (read/write commands and precharge/activate commands)
+===== Lecture 23 (3/28 Fri.) =====
+  * DRAM design choices
+    * Cost/density/latency/BW/Yield
+  * Sense Amplifier
+    * How do they work
+  * Dual data rate
+  * Subarray
+  * Rowclone
+    * Moving bulk of data from one row to others
+    * Lower latency and BW when performing copies/zeroes out the data
+  * TL-DRAM
+    * Far segment
+    * Near segment
+    * What causes the long latency
+    * Benefit of TL-DRAM
+      * TL-DRAM vs. DRAM cache (adding a small cache in DRAM)
+===== Lecture 24 (3/31 Mon.) =====
+  * Memory controller
+    * Different commands
+  * Memory scheduler
+    * Determine the order of requests to be issued to DRAM
+    * Age/hit-miss status/types(load/store/prefetch/from GPU/from CPU)/criticality
+  * Row buffer
+    * hit/conflict
+    * open/closed row
+    * Open row policy
+    * Closed row policy
+    * Tradeoffs between open and closed row policy
+      * What if the programs has high row buffer locality: open row might benefit more
+      * Closed row will service misses request faster
+  * Bank conflict
+  * Interference from different applications/threads
+    * Differnt programs/processes/threads interfere with each other
+      * introduce more row buffer/bank conflicts
+    * Memory schedule has to manage these interference
+    * Memory hog problems
+  * Interference in the data/command bus
+  * FR-FCFS
+    * Why does FR-FCFS make sense?
+      * Row buffer has lower lantecy
+    * Issues with FR-FCFS
+      * Unfairness
+  * STFM
+    * Fairness issue in memory scheduling
+    * How does STFM calculate the fairness and slowdown
+      * How to estimate slowdown time when it is runing alone
+    * Definition of fairness (based on STFM, different papers/areas define fairness differently)
+  * PAR-BS
+    * Parallelism in programs
+    * Intereference across banks
+    * How to form  a batch
+    * How to determine ranking between batches/within a batch

18-447 Introduction to Computer Architecture – Spring 2015

User Tools

Site Tools

Differences

Page Tools