✦ LIBER ✦

Fast matrix multiplication

✍ Scribed by Carlos F. Bunge; Gerardo Cisneros

Publisher: John Wiley and Sons
Year: 1987
Tongue: English
Weight: 358 KB
Volume: 8
Category: Article
ISSN: 0192-8651
DOI: 10.1002/jcc.540080705

No coin nor oath required. For personal study only.

✦ Synopsis

Several implementations of matrix multiplication (MMUL) in Fortran and VAX assembly language are discussed. On a VAX-11/780 computer, the most efficient MMUL is achieved through vector-scalarmultiply-and-add (VSMA) operations, rather than by means of dot products. We also discuss optimal MMUL algorithms for use in virtual memory machines when the data overflow the working set.

📜 SIMILAR VOLUMES

Rectangular Matrix Multiplication Revisi

Rectangular Matrix Multiplication Revisited

✍ Don Coppersmith 📂 Article 📅 1997 🏛 Elsevier Science 🌐 English ⚖ 161 KB

SUMMA: scalable universal matrix multipl

SUMMA: scalable universal matrix multiplication algorithm

✍ Van De Geijn, R. A.; Watts, J. 📂 Article 📅 1997 🏛 John Wiley and Sons 🌐 English ⚖ 341 KB

In the paper we give a straightforward, highly efficient, scalable implementation of common matrix multiplication operations. The algorithms are much simpler than previously published methods, yield better performance, and require less work space. MPI implementations are given, as are performance re

A practical algorithm for faster matrix

A practical algorithm for faster matrix multiplication

✍ Igor Kaporin 📂 Article 📅 1999 🏛 John Wiley and Sons 🌐 English ⚖ 69 KB

The purpose of this paper is to present an algorithm for matrix multiplication based on a formula discovered by Pan [7]. For matrices of order up to 10 000, the nearly optimum tuning of the algorithm results in a rather clear non-recursive one-or two-level structure with the operation count comparab

Approximating Matrix Multiplication for

Approximating Matrix Multiplication for Pattern Recognition Tasks

✍ Edith Cohen; David D Lewis 📂 Article 📅 1999 🏛 Elsevier Science 🌐 English ⚖ 241 KB

Many pattern recognition tasks, including estimation, classification, and the finding of similar objects, make use of linear models. The fundamental operation in such tasks is the computation of the dot product between a query vector and a large database of instance vectors. Often we are interested

The Combinatorics of Cache Misses during

The Combinatorics of Cache Misses during Matrix Multiplication

✍ Philip J. Hanlon; Dean Chung; Siddhartha Chatterjee; Daniela Genius; Alvin R. Le 📂 Article 📅 2001 🏛 Elsevier Science 🌐 English ⚖ 333 KB

In this paper we construct an analytic model of cache misses during matrix multiplication. The analysis in this paper applies to square matrices of size 2 m where the array layout function is given in terms of a function 3 that interleaves the bits in the binary expansions of the row and column indi

Scalable Parallel Matrix Multiplication

Scalable Parallel Matrix Multiplication on Distributed Memory Parallel Computers

✍ Keqin Li 📂 Article 📅 2001 🏛 Elsevier Science 🌐 English ⚖ 392 KB

Consider any known sequential algorithm for matrix multiplication over an arbitrary ring with time complexity O(N a ), where 2 < a [ 3. We show that such an algorithm can be parallelized on a distributed memory parallel computer (DMPC) in O(log N) time by using N a /log N processors. Such a parallel