𝔖 Scriptorium
✦   LIBER   ✦

📁

General-purpose graphics processor architectures

✍ Scribed by Aamodt, Tor M.; Fung, Wilson Wai Lun; Rogers, Timothy G.


Publisher
Morgan & Claypool Publishers
Year
2018
Tongue
English
Leaves
142
Series
Synthesis lectures in computer architecture #44.
Category
Library

⬇  Acquire This Volume

No coin nor oath required. For personal study only.

✦ Synopsis


Originally developed to support video games, graphics processor units (GPUs) are now increasingly used for general-purpose (non-graphics) applications ranging from machine learning to mining of cryptographic currencies. GPUs can achieve improved performance and efficiency versus central processing units (CPUs) by dedicating a larger fraction of hardware resources to computation. In addition, their general-purpose  Read more...

✦ Table of Contents


  1. Introduction --
    1.1 The landscape of computation accelerators --
    1.2 GPU hardware basics --
    1.3 A brief history of GPUs --
    1.4 Book outline --
    2. Programming model --
    2.1 Execution model --
    2.2 GPU instruction set architectures --
    2.2.1 NVIDIA GPU instruction set architectures --
    2.2.2 AMD graphics core next instruction set architecture --
    3. The SIMT core: instruction and register data flow --
    3.1 One-loop approximation --
    3.1.1 SIMT execution masking --
    3.1.2 SIMT deadlock and stackless SIMT architectures --
    3.1.3 Warp scheduling --
    3.2 Two-loop approximation --
    3.3 Three-loop approximation --
    3.3.1 Operand collector --
    3.3.2 Instruction replay: handling structural hazards --
    3.4 Research directions on branch divergence --
    3.4.1 Warp compaction --
    3.4.2 Intra-warp divergent path management --
    3.4.3 Adding MIMD capability --
    3.4.4 Complexity-effective divergence management --
    3.5 Research directions on scalarization and affine execution --
    3.5.1 Detection of uniform or affine variables --
    3.5.2 Exploiting uniform or affine variables in GPU --
    3.6 Research directions on register file architecture --
    3.6.1 Hierarchical register file --
    3.6.2 Drowsy state register file --
    3.6.3 Register file virtualization --
    3.6.4 Partitioned register file --
    3.6.5 RegLess --
    4. Memory system --
    4.1 First-level memory structures --
    4.1.1 Scratchpad memory and L1 data cache --
    4.1.2 L1 texture cache --
    4.1.3 Unified texture and data cache --
    4.2 On-chip interconnection network --
    4.3 Memory partition unit --
    4.3.1 L2 cache --
    4.3.2 Atomic operations --
    4.3.3 Memory access scheduler --
    4.4 Research directions for GPU memory systems --
    4.4.1 Memory access scheduling and interconnection network design --
    4.4.2 Caching effectiveness --
    4.4.3 Memory request prioritization and cache bypassing --
    4.4.4 Exploiting inter-warp heterogeneity --
    4.4.5 Coordinated cache bypassing --
    4.4.6 Adaptive cache management --
    4.4.7 Cache prioritization --
    4.4.8 Virtual memory page placement --
    4.4.9 Data placement --
    4.4.10 Multi-chip-module GPUs --
    5. Crosscutting research on GPU computing architectures --
    5.1 Thread scheduling --
    5.1.1 Research on assignment of threadblocks to cores --
    5.1.2 Research on cycle-by-cycle scheduling decisions --
    5.1.3 Research on scheduling multiple kernels --
    5.1.4 Fine-grain synchronization aware scheduling --
    5.2 Alternative ways of expressing parallelism --
    5.3 Support for transactional memory --
    5.3.1 Kilo TM --
    5.3.2 Warp TM and temporal conflict detection --
    5.4 Heterogeneous systems --
    Bibliography --
    Authors' biographies.

✦ Subjects


Graphics processing units;Computer architecture;COMPUTERS / General;GPGPU;Computer architecture


📜 SIMILAR VOLUMES


Performance Analysis and Tuning for Gene
✍ Hyesoon Kim, Richard Vuduc, Sara Baghsorkhi, Jee Choi, Wen-mei Hwu 📂 Library 📅 2012 🏛 Morgan & Claypool Publishers 🌐 English

<span>General-purpose graphics processing units (GPGPU) have emerged as an important class of shared memory parallel processing architectures, with widespread deployment in every computer class from high-end supercomputers to embedded mobile platforms. Relative to more traditional multicore systems

Clock Generators for SOC Processors: Cir
✍ Amr Fahim 📂 Library 📅 2005 🏛 Springer 🌐 English

<P>This book examines the issue of design of fully-integrated frequency synthesizers suitable for system-on-a-chip (SOC) processors. This book takes a more global design perspective in jointly examining the design space at the circuit level as well as at the architectural level. The coverage of the

GPU Gems 2: Programming Techniques for H
✍ Matt Pharr 📂 Library 📅 2005 🏛 Addison-Wesley Professional 🌐 English

This sequel to the best-selling, first volume of "GPU Gems" details the latest programming techniques for today's graphics processing units (GPUs). As GPUs find their way into mobile phones, handheld gaming devices, and consoles, GPU expertise is even more critical in today's competitive environment

GPU Gems 2: Programming Techniques for H
✍ Matt Pharr, Randima Fernando 📂 Library 📅 2005 🏛 Addison-Wesley Professional 🌐 English

Аннотация Сиквел оригинальной GPU Gems в деталях рассказывает о свежайших приемах программирования текущего поколения (прим. - на момент 6-й серии GeForce) графических процессоров. Программисты графики реального времени откроют для себя последние алгоритмы для создания усовершенствованных графически

Architectural Graphics
✍ Francis D. K. Ching 📂 Library 📅 2002 🏛 Wiley 🌐 English

The completely updated, illustrated bestseller on architectural graphics with over 500,000 copies sold Architectural Graphics presents a wide range of basic graphic tools and techniques designers use to communicate architectural ideas. Expanding upon the wealth of illustrations and informat

Architectural Graphics
✍ Francis D. Ching 📂 Library 📅 1985 🏛 Van Nostrand Reinhold Company 🌐 English

Chings classic book on communicating ideas in architectural design clearly and effectively has been updated and expanded to include a complete guide to drawing equipment and materials, discussions on arrangingarchitectural presentations, and instructions and explanations on how to freehand sketch, s