We present three parallel implementations of the Karatsuba algorithm for long integer multiplication on a distributed memory architecture and discuss the experimental results obtained on a Paragon computer. The first two implementations have both time complexity O(n) on n log 2 3 processors, but pre
The RSCG algorithm on distributed memory architectures
β Scribed by Lori Freitag; James Ortega
- Publisher
- John Wiley and Sons
- Year
- 1995
- Tongue
- English
- Weight
- 815 KB
- Volume
- 2
- Category
- Article
- ISSN
- 1070-5325
No coin nor oath required. For personal study only.
π SIMILAR VOLUMES
We describe an efficient implementation of the pseudospectral multi-reference single-and double-excitation configuration interaction method on a distributed memory parallel architecture. Near-linear speedups are achieved up to 16 processors for a single-reference test case, demonstrating that pseudo
This paper introduces an architecture-independent, hierarchical approach to algorithm design on distributed-memory architectures, in contrast to the current trend of tailoring algorithms towards specific architectures. We show that, rather surprisingly, this new approach can achieve uniformity witho
## Abstract We developed a novel parallel algorithm for largeβscale Fock matrix calculation with small locally distributed memory architectures, and named it the β__RT__ parallel algorithm.β The __RT__ parallel algorithm actively involves the concept of integral screening, which is indispensable fo
Automatic scheduling for directed acyclic graphs (DAG) and its applications for coarse-grained irregular problems such as large n-body simulation have been studied in the literature. However, solving irregular problems with mixed granularities such as sparse matrix factorization is challenging since