𝔖 Scriptorium
✦   LIBER   ✦

📁

Performance Analysis and Tuning for General Purpose Graphics Processing Units (Synthesis Lectures on Computer Architecture)

✍ Scribed by Hyesoon Kim, Richard Vuduc, Sara Baghsorkhi, Jee Choi, Wen-mei Hwu


Publisher
Morgan & Claypool Publishers
Year
2012
Tongue
English
Leaves
98
Series
Synthesis Lectures on Computer Architecture #20
Edition
Illustrated
Category
Library

⬇  Acquire This Volume

No coin nor oath required. For personal study only.

✦ Synopsis


General-purpose graphics processing units (GPGPU) have emerged as an important class of shared memory parallel processing architectures, with widespread deployment in every computer class from high-end supercomputers to embedded mobile platforms. Relative to more traditional multicore systems of today, GPGPUs have distinctly higher degrees of hardware multithreading (hundreds of hardware thread contexts vs. tens), a return to wide vector units (several tens vs. 1-10), memory architectures that deliver higher peak memory bandwidth (hundreds of gigabytes per second vs. tens), and smaller caches/scratchpad memories (less than 1 megabyte vs. 1-10 megabytes). In this book, we provide a high-level overview of current GPGPU architectures and programming models. We review the principles that are used in previous shared memory parallel platforms, focusing on recent results in both the theory and practice of parallel algorithms, and suggest a connection to GPGPU platforms. We aim to provide hints to architects about understanding algorithm aspect to GPGPU. We also provide detailed performance analysis and guide optimizations from high-level algorithms to low-level instruction level optimizations. As a case study, we use n-body particle simulations known as the fast multipole method (FMM) as an example. We also briefly survey the state-of-the-art in GPU performance analysis tools and techniques. Table of Contents: GPU Design, Programming, and Trends / Performance Principles / From Principles to Practice: Analysis and Tuning / Using Detailed Performance Analysis to Guide Optimization


📜 SIMILAR VOLUMES


Introductory Tiling Theory for Computer
✍ Craig Kaplan 📂 Library 📅 2009 🏛 Morgan and Claypool Publishers 🌐 English

Tiling theory is an elegant branch of mathematics that has applications in several areas of computer science. The most immediate application area is graphics, where tiling theory has been used in the contexts of texture generation, sampling theory, remeshing, and of course the generation of decorati

GPU Gems 2: Programming Techniques for H
✍ Matt Pharr 📂 Library 📅 2005 🏛 Addison-Wesley Professional 🌐 English

This sequel to the best-selling, first volume of "GPU Gems" details the latest programming techniques for today's graphics processing units (GPUs). As GPUs find their way into mobile phones, handheld gaming devices, and consoles, GPU expertise is even more critical in today's competitive environment

GPU Gems 2: Programming Techniques for H
✍ Matt Pharr, Randima Fernando 📂 Library 📅 2005 🏛 Addison-Wesley Professional 🌐 English

Аннотация Сиквел оригинальной GPU Gems в деталях рассказывает о свежайших приемах программирования текущего поколения (прим. - на момент 6-й серии GeForce) графических процессоров. Программисты графики реального времени откроют для себя последние алгоритмы для создания усовершенствованных графически

GPU Gems 2: Programming Techniques for H
✍ Matt Pharr, Randima Fernando 📂 Library 📅 2005 🌐 English

Аннотация Сиквел оригинальной GPU Gems в деталях рассказывает о свежайших приемах программирования текущего поколения (прим. - на момент 6-й серии GeForce) графических процессоров. Программисты графики реального времени откроют для себя последние алгоритмы для создания усовершенствованных графическ

General-purpose graphics processor archi
✍ Aamodt, Tor M.; Fung, Wilson Wai Lun; Rogers, Timothy G. 📂 Library 📅 2018 🏛 Morgan & Claypool Publishers 🌐 English

Originally developed to support video games, graphics processor units (GPUs) are now increasingly used for general-purpose (non-graphics) applications ranging from machine learning to mining of cryptographic currencies. GPUs can achieve improved performance and efficiency versus central processing