We consider the scheduling of N jobs divided into G families for processing on a single machine. No set-up is necessary between jobs belonging to the same family. A set-up must be scheduled when switching from the processing of family i jobs to those of another family j, i = j, the duration of this
A fast and accurate method for determining a lower bound on execution time
β Scribed by G. Fursin; M. F. P. O'Boyle; O. Temam; G. Watts
- Publisher
- John Wiley and Sons
- Year
- 2004
- Tongue
- English
- Weight
- 212 KB
- Volume
- 16
- Category
- Article
- ISSN
- 1532-0626
- DOI
- 10.1002/cpe.774
No coin nor oath required. For personal study only.
β¦ Synopsis
Abstract
In performance critical applications, memory latency is frequently the dominant overhead. In many cases, automatic compilerβbased optimizations to improve memory performance are limited and programmers frequently resort to manual optimization techniques. However, this process is tedious and timeβconsuming. Furthermore, as the potential benefit from optimization is unknown there is no way to judge the amount of effort worth expending, nor when the process can stop, i.e. when optimal memory performance has been achieved or sufficiently approached. Architecture simulators can provide such information but designing an accurate model of an existing architecture is difficult and simulation times are excessively long. In this article, we propose and implement a technique that is both fast and reasonably accurate for estimating a lower bound on execution time for scientific applications. This technique has been tested on a wide range of programs from the SPEC benchmark suite and two commercial applications, where it has been used to guide a manual optimization process and iterative compilation. We compare our technique with that of a simulator with an ideal memory behaviour and demonstrate that our technique provides comparable information on memory performance and yet is over two orders of magnitude faster. We further show that our technique is considerably more accurate than hardware counters. Copyright Β© 2004 John Wiley & Sons, Ltd.
π SIMILAR VOLUMES
## Abstract ChemInform is a weekly Abstracting Service, delivering concise information at a glance that was extracted from about 100 leading journals. To access a ChemInform Abstract of an article which was published elsewhere, please select a βFull Textβ option. The original article is trackable v
The nearest neighbor rule or k-nearest neighbor rule is a technique of nonparametric pattern recognition. Its algorithm is simple and the error is smaller than twice the Bayes error if there are enough training samples. However, it requires an enormous amount of computation, proportional to the numb