✦ LIBER ✦

Adaptive blocking in the QR factorization

✍ Scribed by Christian H. Bischof

Book ID: 104632731
Publisher: Springer US
Year: 1989
Tongue: English
Weight: 888 KB
Volume: 3
Category: Article
ISSN: 0920-8542
DOI: 10.1007/bf00127828

No coin nor oath required. For personal study only.

✦ Synopsis

On most high-performance architectures, data movement is slow compared to floating point (in particular, vector) performance. On these architectures block algorithms have been successful for matrix computations. By considering a matrix as a collection of submatrices (the so-called blocks), one naturally arrives at algorithms that require little data movement. The optimal blocking strategy, however, depends on the computing environment and on the problem parameters. On parallel machines, tradeoffs between individual floating point performance and overall system performance also come into play. Current approaches use fixed-width blocking strategies which are not optimal. This paper presents an adaptive blocking methodology for determining a good blocking strategy systematically. We demonstrate this technique on a block QR factorization routine on a distributed-memory machine. Using timing models for the high-level kernels of the algorithm, we can formulate in a recurrence relation a blocking strategy that avoids adding extra delays along the critical path of the algorithm. This recurrence relation predicts performance well since we base our timing models on observed data, not other simplistic measures. Experiments on the Intel iPSC/1 hypercube show that, in fact, the resulting blocking strategy is as good as any fixed-width blocking strategy, independent of problem size and the number of processors employed. So while we do not know the optimum fixed-width blocking strategy unless we rerun the same problem several times, adaptive blocking provides close to optimum performance in the first run. We also mention how adaptive blocking can result in performance portable code by automating the generation of the timing models.

📜 SIMILAR VOLUMES

Perturbation Analyses for the QR Factori

Perturbation Analyses for the QR Factorization

✍ Chang, Xiao-Wen; Paige, Christopher C.; Stewart, G. W. 📂 Article 📅 1997 🏛 Society for Industrial and Applied Mathematics 🌐 English ⚖ 359 KB

Perturbation analysis for the hyperbolic

Perturbation analysis for the hyperbolic QR factorization

✍ Hanyu Li; Hu Yang; Hua Shao 📂 Article 📅 2012 🏛 Elsevier Science 🌐 English ⚖ 269 KB

On perturbation bounds for the QR factor

On perturbation bounds for the QR factorization

✍ Ji-guang Sun 📂 Article 📅 1995 🏛 Elsevier Science 🌐 English ⚖ 630 KB

Componentwise perturbation analyses for

Componentwise perturbation analyses for the QR factorization

✍ Xiao-Wen Chang; Christopher C. Paige 📂 Article 📅 2001 🏛 Springer-Verlag 🌐 English ⚖ 199 KB

Perturbation bounds for the generalized

Perturbation bounds for the generalized QR factorization

✍ Anders Barrlund 📂 Article 📅 1994 🏛 Elsevier Science 🌐 English ⚖ 872 KB

Block Householder transformation for par

Block Householder transformation for parallel QR factorization

✍ F. Rotella; I. Zambettakis 📂 Article 📅 1999 🏛 Elsevier Science 🌐 English ⚖ 257 KB

A new form of the QR factorization procedure is presented which is based on a generalization of the Householder transformation. This extension is a block matrical form of the usual Householder procedure which leads to a dichotomic algorithm which allows parallel implementation.