A System for Evaluating Performance and Cost of SIMD Array Designs
โ Scribed by Martin C. Herbordt; Jade Cravy; Renoy Sam; Owais Kidwai; Calvin Lin
- Publisher
- Elsevier Science
- Year
- 2000
- Tongue
- English
- Weight
- 424 KB
- Volume
- 60
- Category
- Article
- ISSN
- 0743-7315
No coin nor oath required. For personal study only.
โฆ Synopsis
SIMD arrays are likely to become increasingly important as coprocessors in domain specific systems as architects continue to leverage RAM technology in their design. The problem this work addresses is the efficient evaluation of SIMD arrays with respect to complex applications while accounting for operating frequency and chip area. The underlying issues include the size of the architecture space, the lack of portability of the test programs, and the inherent complexity of simulating up to hundreds of thousands of processing elements. The overall method we use is to combine architecture level and Electronic Design Automation (EDA) level modeling by using an EDA-based tool to calibrate architectural simulations. The resulting system retains much of the high throughput of the architecture level simulator but it also has accuracy similar to that of an early pass EDA synthesis and circuit simulation. The particular problem of computational cost of the architectural level simulation is addressed with a novel approach to trace-based simulation (we call it trace compilation), which we find to be one to two orders of magnitude faster than instruction level simulation while still retaining much of the accuracy of the model. Furthermore, traces must be generated for only a small fraction of the possible parameter combinations. Using trace compilation also addresses program portability by allowing the user to code in a single data parallel language with a single compiler, regardless of the target architecture. We have used our system to evaluate thousands of potential SIMD array designs with respect to real applications and present some sample results.
๐ SIMILAR VOLUMES
In this paper we propose the OPTNET, a novel optical network and associated coherence protocol for scalable multiprocessors. The network divides its channels into broadcast and point-to-point groups. The broadcast channels are used for memory block request, coherence, and synchronization transaction
This article describes the procedures for rapidly estimating the approximate manufacturing cost of a molded part through the utilization of a computer program developed specifically for this task. The estimated cost includes the costs of material, mold, and processing. The procedures involve identif
With rapid advances in audio and video technologies, more and more content will be encoded and delivered Today, vast amounts of text, images, graphics, animation, and even Java applets are being hosted and delivered by the by means of audio/video in addition to texts and images. WWW. With rapid adva
One approach to evaluating system reliability is the use of system based component test plans. Such plans have numerous advantages over complete system level tests, primarily in terms of time and cost savings. This paper considers one of the two basic building blocks of many complex systems, namely
This study investigated a parameter that determines an optimum condition of the content of the ionic group and the concentration of outer solution for highperformance electro-driven polymer hydrogel membranes. The optimum condition for quick bending was determined by a simple method that identified