Use of embedded DRAMs in video and image computing
β Scribed by Coskun Mermer; Donglok Kim; Stefan G. Berg; Robert Gove; Yongmin Kim
- Publisher
- Elsevier Science
- Year
- 2003
- Tongue
- English
- Weight
- 673 KB
- Volume
- 49
- Category
- Article
- ISSN
- 1383-7621
No coin nor oath required. For personal study only.
β¦ Synopsis
We have evaluated the role of embedded dynamic random access memory (eDRAM) in the performance of programmable mediaprocessors, focusing on video/image computing. eDRAMΓs contributions to improving the total system performance can be assessed by measuring the number of CPU stall cycles caused by the memory transactions. We decomposed the CPU stall cycles into three components: latency due to row access, latency due to the pipeline of memory transactions, and burst transfer time. We used a cycle-accurate cache and eDRAM model to measure the system performance in executing selected low-level video/image computing functions on a mediaprocessor core. We simulated various values for data bus width, page size, and row-access time of eDRAM, pipeline delay of a memory transaction, and data cache line size. While the wider data width of eDRAM does reduce the burst transfer time, the actual reduction in the total stall cycles when the width was expanded from 8 to 16 bytes was lower than expected, ranging from 6.2% to 18.9%. Instead, we found that the row-access latency and memory transaction pipeline delay represent the major portion of the CPU stall cycles. For example, in case of 32-byte wide data bus, they account for 85.3-95.1% of the memory busy time during which data cache misses are serviced. We show how to lower the CPU stall time further, e.g., using no-write-allocate data cache to reduce the total burst transfer time, efficient memory banking to reduce the number of eDRAM page misses, and various software/hardware methods to bring data to the cache before they are needed by the CPU. In particular, the regular memory access pattern in video/image computing allows several methods to enhance the memory performance in using eDRAM, e.g., enlarging the cache line size and data prefetching. This paper presents our methodology, experimental results, and findings, which would be useful to the design of highly integrated systems on a chip with eDRAM in the future.
π SIMILAR VOLUMES
The aim of this study was to evaluate the effects of various cell culture conditions on cell morphology. Cell morphology was estimated by means of video recording and computer-assisted image analysis. Cell contours from the stored images of either live cells or fixed and stained cells were determine
A microcomputer-based system for copying, storing, retrieving, and displaying color still images was assembled from commercially available components. The system was tested against 35-mm color photographic transparencies by measuring the performance of 91 medical students in a task of identifying th
generation; i.e., instead of generating an image from a given formula, image compression searches for sets of frac-Iterated function systems (IFS) have been used to compress image data. Because of difficulty in finding IFS in natural tals in a digitized image which describe and represent the images,