<span>General-purpose graphics processing units (GPGPU) have emerged as an important class of shared memory parallel processing architectures, with widespread deployment in every computer class from high-end supercomputers to embedded mobile platforms. Relative to more traditional multicore systems
General-purpose graphics processor architectures
✍ Scribed by Aamodt, Tor M.; Fung, Wilson Wai Lun; Rogers, Timothy G.
- Publisher
- Morgan & Claypool Publishers
- Year
- 2018
- Tongue
- English
- Leaves
- 142
- Series
- Synthesis lectures in computer architecture #44.
- Category
- Library
No coin nor oath required. For personal study only.
✦ Synopsis
Originally developed to support video games, graphics processor units (GPUs) are now increasingly used for general-purpose (non-graphics) applications ranging from machine learning to mining of cryptographic currencies. GPUs can achieve improved performance and efficiency versus central processing units (CPUs) by dedicating a larger fraction of hardware resources to computation. In addition, their general-purpose Read more...
✦ Table of Contents
- Introduction --
1.1 The landscape of computation accelerators --
1.2 GPU hardware basics --
1.3 A brief history of GPUs --
1.4 Book outline --
2. Programming model --
2.1 Execution model --
2.2 GPU instruction set architectures --
2.2.1 NVIDIA GPU instruction set architectures --
2.2.2 AMD graphics core next instruction set architecture --
3. The SIMT core: instruction and register data flow --
3.1 One-loop approximation --
3.1.1 SIMT execution masking --
3.1.2 SIMT deadlock and stackless SIMT architectures --
3.1.3 Warp scheduling --
3.2 Two-loop approximation --
3.3 Three-loop approximation --
3.3.1 Operand collector --
3.3.2 Instruction replay: handling structural hazards --
3.4 Research directions on branch divergence --
3.4.1 Warp compaction --
3.4.2 Intra-warp divergent path management --
3.4.3 Adding MIMD capability --
3.4.4 Complexity-effective divergence management --
3.5 Research directions on scalarization and affine execution --
3.5.1 Detection of uniform or affine variables --
3.5.2 Exploiting uniform or affine variables in GPU --
3.6 Research directions on register file architecture --
3.6.1 Hierarchical register file --
3.6.2 Drowsy state register file --
3.6.3 Register file virtualization --
3.6.4 Partitioned register file --
3.6.5 RegLess --
4. Memory system --
4.1 First-level memory structures --
4.1.1 Scratchpad memory and L1 data cache --
4.1.2 L1 texture cache --
4.1.3 Unified texture and data cache --
4.2 On-chip interconnection network --
4.3 Memory partition unit --
4.3.1 L2 cache --
4.3.2 Atomic operations --
4.3.3 Memory access scheduler --
4.4 Research directions for GPU memory systems --
4.4.1 Memory access scheduling and interconnection network design --
4.4.2 Caching effectiveness --
4.4.3 Memory request prioritization and cache bypassing --
4.4.4 Exploiting inter-warp heterogeneity --
4.4.5 Coordinated cache bypassing --
4.4.6 Adaptive cache management --
4.4.7 Cache prioritization --
4.4.8 Virtual memory page placement --
4.4.9 Data placement --
4.4.10 Multi-chip-module GPUs --
5. Crosscutting research on GPU computing architectures --
5.1 Thread scheduling --
5.1.1 Research on assignment of threadblocks to cores --
5.1.2 Research on cycle-by-cycle scheduling decisions --
5.1.3 Research on scheduling multiple kernels --
5.1.4 Fine-grain synchronization aware scheduling --
5.2 Alternative ways of expressing parallelism --
5.3 Support for transactional memory --
5.3.1 Kilo TM --
5.3.2 Warp TM and temporal conflict detection --
5.4 Heterogeneous systems --
Bibliography --
Authors' biographies.
✦ Subjects
Graphics processing units;Computer architecture;COMPUTERS / General;GPGPU;Computer architecture
📜 SIMILAR VOLUMES
<P>This book examines the issue of design of fully-integrated frequency synthesizers suitable for system-on-a-chip (SOC) processors. This book takes a more global design perspective in jointly examining the design space at the circuit level as well as at the architectural level. The coverage of the
This sequel to the best-selling, first volume of "GPU Gems" details the latest programming techniques for today's graphics processing units (GPUs). As GPUs find their way into mobile phones, handheld gaming devices, and consoles, GPU expertise is even more critical in today's competitive environment
Аннотация Сиквел оригинальной GPU Gems в деталях рассказывает о свежайших приемах программирования текущего поколения (прим. - на момент 6-й серии GeForce) графических процессоров. Программисты графики реального времени откроют для себя последние алгоритмы для создания усовершенствованных графически
The completely updated, illustrated bestseller on architectural graphics with over 500,000 copies sold Architectural Graphics presents a wide range of basic graphic tools and techniques designers use to communicate architectural ideas. Expanding upon the wealth of illustrations and informat
Chings classic book on communicating ideas in architectural design clearly and effectively has been updated and expanded to include a complete guide to drawing equipment and materials, discussions on arrangingarchitectural presentations, and instructions and explanations on how to freehand sketch, s