𝔖 Bobbio Scriptorium
✦   LIBER   ✦

[ACM Press Proceeding of the 2nd international workshop - Waikiki, Honolulu, HI, USA (2011.05.22-2011.05.22)] Proceeding of the 2nd international workshop on Software engineering for cloud computing - SECLOUD '11 - A MapReduce workflow system for architecting scientific data intensive applications

✍ Scribed by Nguyen, Phuong; Halem, Milton


Book ID
121277457
Publisher
ACM Press
Year
2011
Tongue
English
Weight
424 KB
Category
Article
ISBN
1450305822

No coin nor oath required. For personal study only.

✦ Synopsis


MapReduce is promising for developing both scalable business and scientific data intensive applications. However, there are few existing scientific workflow systems which can benefit from the MapReduce programming model. We propose a workflow system for integrating structure, and orchestrating MapReduce jobs for scientific data intensive workflows. The system consists of a simple workflow design C++ API, a job scheduler, and a runtime support system for Hadoop or Sector/Sphere frameworks. A climate satellite data intensive processing and analysis application is developed as a use case and an evaluation for the workflow system. The evaluation shows that it is possible to make the steps in the climate data intensive application automatically from data gridding to complex data analysis using the workflow system. The performance of the climate analysis application is significantly improved by the enabled MapReduce workflow system compared with the sequential embarrassing parallel methods. The overhead of the workflow system is negligible. However, the graphic user interface is still under development for the workflow system.


πŸ“œ SIMILAR VOLUMES


[ACM Press Proceeding of the 2nd interna
✍ Yoshioka, Nobukazu; Yokoyama, Shigetoshi; Tanabe, Yoshionori; Honiden, Shinichi πŸ“‚ Article πŸ“… 2011 πŸ› ACM Press 🌐 English βš– 297 KB

Education for cloud engineers is crucial in terms of innovation in the development of cloud technologies. We propose a new cloud platform based on open-source software that uses multi-clouds for the education.