[ACM Press Proceeding of the 2nd international workshop - Waikiki, Honolulu, HI, USA (2011.05.22-2011.05.22)] Proceeding of the 2nd international workshop on Software engineering for cloud computing - SECLOUD '11 - A MapReduce workflow system for architecting scientific data intensive applications
β Scribed by Nguyen, Phuong; Halem, Milton
- Book ID
- 121277457
- Publisher
- ACM Press
- Year
- 2011
- Tongue
- English
- Weight
- 424 KB
- Category
- Article
- ISBN
- 1450305822
No coin nor oath required. For personal study only.
β¦ Synopsis
MapReduce is promising for developing both scalable business and scientific data intensive applications. However, there are few existing scientific workflow systems which can benefit from the MapReduce programming model. We propose a workflow system for integrating structure, and orchestrating MapReduce jobs for scientific data intensive workflows. The system consists of a simple workflow design C++ API, a job scheduler, and a runtime support system for Hadoop or Sector/Sphere frameworks. A climate satellite data intensive processing and analysis application is developed as a use case and an evaluation for the workflow system. The evaluation shows that it is possible to make the steps in the climate data intensive application automatically from data gridding to complex data analysis using the workflow system. The performance of the climate analysis application is significantly improved by the enabled MapReduce workflow system compared with the sequential embarrassing parallel methods. The overhead of the workflow system is negligible. However, the graphic user interface is still under development for the workflow system.
π SIMILAR VOLUMES
Education for cloud engineers is crucial in terms of innovation in the development of cloud technologies. We propose a new cloud platform based on open-source software that uses multi-clouds for the education.