✦ LIBER ✦

A class of Lanczos-like algorithms implemented on parallel computers

✍ Scribed by S.K. Kim; A.T. Chronopoulos

Publisher: Elsevier Science
Year: 1991
Tongue: English
Weight: 869 KB
Volume: 17
Category: Article
ISSN: 0167-8191
DOI: 10.1016/s0167-8191(05)80065-1

No coin nor oath required. For personal study only.

✦ Synopsis

Kim, S.K. and A.T. Chronopoulos, A class of Lanczos-like algorithms implemented on parallel Computers, Parallel Computing 17 (1991) 763-778.

The Lanczos algorithm is most commonly used in approximating a small number of extreme eigenvalues and eigenvectors for symmetric large sparse matrices. Main memory accesses for shared memory systems or global communications (synchronizations) in message passing systems decrease the computation speed. In this paper, the standard Lanczos algorithm is restructured so that only one synchronization point is required; that is, one global communication in a message passing distributed-memory machine or one global memory sweep in a shared-memory machine per each iteration is required.

We also introduce the s-step Lanczos method for finding a few eigenvalues of symmetric large sparse matrices in a similar way to the s-step Conjugate Gradient method [2], and we prove that the s-step method generates reduction matrices which are similar to reduction matrices generated by the standard method. One iteration of the s-step Lanczos algorithm corresponds to s iterations of the standard Lanczos algorithm. The s-step method has improved data locality, minimized global communication and has superior parallel properties to the standard method. These algorithms are implemented on a 64-node NCUBE/seven hypercube and a CRAY-2, and performance results are presented.

📜 SIMILAR VOLUMES

Implementation of the lanczos method for

Implementation of the lanczos method for structural vibration analysis on a parallel computer

✍ Susan W. Bostic; Robert E. Fulton 📂 Article 📅 1987 🏛 Elsevier Science 🌐 English ⚖ 916 KB

Parallel implementation of a ray tracing

Parallel implementation of a ray tracing algorithm for distributed memory parallel computers

✍ Lee, Tong-Yee; Raghavendra, C. S.; Nicholas, John B. 📂 Article 📅 1997 🏛 John Wiley and Sons 🌐 English ⚖ 145 KB 👁 3 views

Ray tracing is a well known technique to generate life-like images. Unfortunately, ray tracing complex scenes can require large amounts of CPU time and memory storage. Distributed memory parallel computers with large memory capacities and high processing speeds are ideal candidates to perform ray tr

On a class of preconditioned iterative m

On a class of preconditioned iterative methods on parallel computers

✍ O. Axelsson; G. Carey; G. Lindskog 📂 Article 📅 1989 🏛 John Wiley and Sons 🌐 English ⚖ 856 KB

The standard implementations of iteratike solvers for finitc elcment and finite difercnce mcthods frequently use a diagonal (Jacobi) preconditioncr, particularly for element-by-element schemes. However, for such methods the actual order of the condition number with respect to mesh size is not reduce

A class of parallel multiple-front algor

A class of parallel multiple-front algorithms on subdomains

✍ A. Bose; G. F. Carey; V. F. de Almeida 📂 Article 📅 2003 🏛 John Wiley and Sons 🌐 English ⚖ 177 KB

A new direct MP2 gradient algorithm with

A new direct MP2 gradient algorithm with implementation on a massively parallel computer

✍ Ida M.B. Nielsen 📂 Article 📅 1996 🏛 Elsevier Science 🌐 English ⚖ 464 KB

A new direct second-order Moller-Plesset gradient algorithm is presented. By avoiding generation of molecular orbital integrals with three virtual indices, the memory requirement for a calculation with n basis functions and o occupied orbitals is reduced to on 2, and overall computational savings ar

A compact algorithm for Gaussian elimina

A compact algorithm for Gaussian elimination over GF(2) implemented on highly parallel computers

✍ Dennis Parkinson; Marvin Wunderlich 📂 Article 📅 1984 🏛 Elsevier Science 🌐 English ⚖ 477 KB