Tiling theory is an elegant branch of mathematics that has applications in several areas of computer science. The most immediate application area is graphics, where tiling theory has been used in the contexts of texture generation, sampling theory, remeshing, and of course the generation of decorati
Performance Analysis and Tuning for General Purpose Graphics Processing Units (Synthesis Lectures on Computer Architecture)
✍ Scribed by Hyesoon Kim, Richard Vuduc, Sara Baghsorkhi, Jee Choi, Wen-mei Hwu
- Publisher
- Morgan & Claypool Publishers
- Year
- 2012
- Tongue
- English
- Leaves
- 98
- Series
- Synthesis Lectures on Computer Architecture #20
- Edition
- Illustrated
- Category
- Library
No coin nor oath required. For personal study only.
✦ Synopsis
General-purpose graphics processing units (GPGPU) have emerged as an important class of shared memory parallel processing architectures, with widespread deployment in every computer class from high-end supercomputers to embedded mobile platforms. Relative to more traditional multicore systems of today, GPGPUs have distinctly higher degrees of hardware multithreading (hundreds of hardware thread contexts vs. tens), a return to wide vector units (several tens vs. 1-10), memory architectures that deliver higher peak memory bandwidth (hundreds of gigabytes per second vs. tens), and smaller caches/scratchpad memories (less than 1 megabyte vs. 1-10 megabytes). In this book, we provide a high-level overview of current GPGPU architectures and programming models. We review the principles that are used in previous shared memory parallel platforms, focusing on recent results in both the theory and practice of parallel algorithms, and suggest a connection to GPGPU platforms. We aim to provide hints to architects about understanding algorithm aspect to GPGPU. We also provide detailed performance analysis and guide optimizations from high-level algorithms to low-level instruction level optimizations. As a case study, we use n-body particle simulations known as the fast multipole method (FMM) as an example. We also briefly survey the state-of-the-art in GPU performance analysis tools and techniques. Table of Contents: GPU Design, Programming, and Trends / Performance Principles / From Principles to Practice: Analysis and Tuning / Using Detailed Performance Analysis to Guide Optimization
📜 SIMILAR VOLUMES
This sequel to the best-selling, first volume of "GPU Gems" details the latest programming techniques for today's graphics processing units (GPUs). As GPUs find their way into mobile phones, handheld gaming devices, and consoles, GPU expertise is even more critical in today's competitive environment
Аннотация Сиквел оригинальной GPU Gems в деталях рассказывает о свежайших приемах программирования текущего поколения (прим. - на момент 6-й серии GeForce) графических процессоров. Программисты графики реального времени откроют для себя последние алгоритмы для создания усовершенствованных графически
Аннотация Сиквел оригинальной GPU Gems в деталях рассказывает о свежайших приемах программирования текущего поколения (прим. - на момент 6-й серии GeForce) графических процессоров. Программисты графики реального времени откроют для себя последние алгоритмы для создания усовершенствованных графическ
Originally developed to support video games, graphics processor units (GPUs) are now increasingly used for general-purpose (non-graphics) applications ranging from machine learning to mining of cryptographic currencies. GPUs can achieve improved performance and efficiency versus central processing