An Empirical Analysis of Scheduling Techniques for Real-Time Cloud-Based Data Processing

Loading...
Thumbnail Image

Related Collections

Degree type

Discipline

Subject

CPS Real-Time
Computer Sciences

Funder

Grant number

License

Copyright date

Distributor

Related resources

Author

Zhang, Zhuoyao
Zheng, Qi

Contributor

Abstract

In this paper, we explore the challenges and needs of current cloud infrastructures, to better support cloud-based data-intensive applications that are not only latency-sensitive but also require strong timing guarantees. These applications have strict deadlines (e.g., to perform time-dependent mission critical tasks or to complete real-time control decisions using a human-in-the-loop), and deadline misses are undesirable. To highlight the challenges in this space, we provide a case study of the online scheduling of MapReduce jobs executed by Hadoop. Our evaluations on Amazon EC2 show that the existing Hadoop scheduler is ill-equipped to handle jobs with deadlines. However, by adapting existing multiprocessor scheduling techniques for the cloud environment, we observe significant performance improvements in minimizing missed deadlines and tardiness. Based on our case study, we discuss a range of challenges in this domain posed by virtualization and scale, and propose our research agenda centered around the application of advanced real-time scheduling techniques in the cloud environment.

Advisor

Date of presentation

2011-12-01

Conference name

Departmental Papers (CIS)

Conference dates

2023-05-17T07:17:41.000

Conference location

Date Range for Data Collection (Start Date)

Date Range for Data Collection (End Date)

Digital Object Identifier

Series name and number

Volume number

Issue number

Publisher

Publisher DOI

relationships.isJournalIssueOf

Comments

2011 IEEE International Conference on Service-Oriented Computing and Applications, Dec. 2011.

Recommended citation

Collection