๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

A class of Lanczos-like algorithms implemented on parallel computers

โœ Scribed by S.K. Kim; A.T. Chronopoulos


Publisher
Elsevier Science
Year
1991
Tongue
English
Weight
869 KB
Volume
17
Category
Article
ISSN
0167-8191

No coin nor oath required. For personal study only.

โœฆ Synopsis


Kim, S.K. and A.T. Chronopoulos, A class of Lanczos-like algorithms implemented on parallel Computers, Parallel Computing 17 (1991) 763-778.

The Lanczos algorithm is most commonly used in approximating a small number of extreme eigenvalues and eigenvectors for symmetric large sparse matrices. Main memory accesses for shared memory systems or global communications (synchronizations) in message passing systems decrease the computation speed. In this paper, the standard Lanczos algorithm is restructured so that only one synchronization point is required; that is, one global communication in a message passing distributed-memory machine or one global memory sweep in a shared-memory machine per each iteration is required.

We also introduce the s-step Lanczos method for finding a few eigenvalues of symmetric large sparse matrices in a similar way to the s-step Conjugate Gradient method [2], and we prove that the s-step method generates reduction matrices which are similar to reduction matrices generated by the standard method. One iteration of the s-step Lanczos algorithm corresponds to s iterations of the standard Lanczos algorithm. The s-step method has improved data locality, minimized global communication and has superior parallel properties to the standard method. These algorithms are implemented on a 64-node NCUBE/seven hypercube and a CRAY-2, and performance results are presented.


๐Ÿ“œ SIMILAR VOLUMES


Parallel implementation of a ray tracing
โœ Lee, Tong-Yee; Raghavendra, C. S.; Nicholas, John B. ๐Ÿ“‚ Article ๐Ÿ“… 1997 ๐Ÿ› John Wiley and Sons ๐ŸŒ English โš– 145 KB ๐Ÿ‘ 3 views

Ray tracing is a well known technique to generate life-like images. Unfortunately, ray tracing complex scenes can require large amounts of CPU time and memory storage. Distributed memory parallel computers with large memory capacities and high processing speeds are ideal candidates to perform ray tr

On a class of preconditioned iterative m
โœ O. Axelsson; G. Carey; G. Lindskog ๐Ÿ“‚ Article ๐Ÿ“… 1989 ๐Ÿ› John Wiley and Sons ๐ŸŒ English โš– 856 KB

The standard implementations of iteratike solvers for finitc elcment and finite difercnce mcthods frequently use a diagonal (Jacobi) preconditioncr, particularly for element-by-element schemes. However, for such methods the actual order of the condition number with respect to mesh size is not reduce

A new direct MP2 gradient algorithm with
โœ Ida M.B. Nielsen ๐Ÿ“‚ Article ๐Ÿ“… 1996 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 464 KB

A new direct second-order Moller-Plesset gradient algorithm is presented. By avoiding generation of molecular orbital integrals with three virtual indices, the memory requirement for a calculation with n basis functions and o occupied orbitals is reduced to on 2, and overall computational savings ar