A preemption-based meta-scheduling system for distributed computing
Abstract (Summary)
This research aims at designing and building a scheduling framework for distributed
computing systems with the primary objectives of providing fast response times to the
users, delivering high system throughput and accommodating maximum number of applications
into the systems. The author claims that the above mentioned objectives are
the most important objectives for scheduling in recent distributed computing systems,
especially Grid computing environments.
In order to achieve the objectives of the scheduling framework, the scheduler employs
arbitration of application-level schedules and preemption of executing jobs under
certain conditions. In application-level scheduling, the user develops a schedule for his
application using an execution model that simulates the execution behavior of the application.
Since application-level scheduling can seriously impede the performance of
the system, the scheduling framework developed in this research arbitrates between different
application-level schedules corresponding to different applications to provide fair
system usage for all applications and balance the interests of different applications. In
this sense, the scheduling framework is not a classical scheduling system, but a metascheduling
system that interacts with the application-level schedulers.
Due to the large system dynamics involved in Grid computing systems, the ability
to preempt executing jobs becomes a necessity. The meta-scheduler described in this
dissertation employs well defined scheduling policies to preempt and migrate executing
applications. In order to provide the users with the capability to make their applications
vi
preemptible, a user-level checkpointing library called SRS(Stop-Restart Software) was
also developed by this research. The SRS library is different from many user-level
checkpointing libraries since it allows reconfiguration of applications between migrations.
This reconfiguration can be achieved by changing the processor configuration and/or
data distribution.
The experimental results provided in this dissertation demonstrates the utility of the
metascheduling framework for distributed computing systems. And lastly, the metascheduling
framework was put to practical use by building a Grid computing system
called GradSolve. GradSolve is a flexible system and it allows the application library
writers to upload applications with different capabilities into the system. GradSolve is
also unique with respect to maintaining traces of the execution of the applications and
using the traces for subsequent executions of the application.
vii
Bibliographical Information:
Advisor:
School:The University of Tennessee at Chattanooga
School Location:USA - Tennessee
Source Type:Master's Thesis
Keywords:
ISBN:
Date of Publication: