Back to Results
First PageMeta Content
Computer architecture / Failover / Job scheduler / Scheduling / High-availability cluster / Computer cluster / Globus Toolkit / Replication / Job queue / Computing / Fault-tolerant computer systems / Concurrent computing


Job-Site Level Fault Tolerance for Cluster and Grid environments* Kshitij Limaye1, Box Leangsuksun1, Zeno Greenwood1, Stephen L. Scott2, Christian Engelmann2,3, Richard Libby4 and Kasidit Chanchio5
Add to Reading List

Document Date: 2009-07-03 20:22:36


Open Document

File Size: 891,58 KB

Share Result on Facebook

City

Santa Fe / Ruston / Ottawa / Boston / Reading / /

Company

Smart Failover Job / Checkpoint / Grid Workflow / USA 2 Oak Ridge National Laboratory / MIT Press / Intel Corporation / High Availability Distributed Systems / UT-Battelle LLC / /

Country

Thailand / United States / Canada / United Kingdom / /

/

EntertainmentAwardEvent

OSCAR / The OSCAR / /

Event

Company Expansion / /

Facility

University of Virginia / The University of Reading / Checkpoint /Restart Scheme / Louisiana Tech University / Thammasat University / /

Holiday

Mardi Gras Day / /

IndustryTerm

standby server / job-site / updater algorithm / resource management software / stateless services / excellent solution / backup updater algorithm / client side algorithm / job site / update algorithm / cluster job-site / cluster computing environment / remote server / remote site / software stack / pbs_server / stateful services / check-point-aware algorithm / mission critical applications / job management / computing / backup server / /

OperatingSystem

UNIX / Linux / /

Organization

U. S. Department of Energy / MIT / Department of Energy / University of Reading / Berkeley Lab / office of Advanced Scientific Computing Research / Thammasat University / Louisiana Tech University / office of Science / University of Virginia / Mathematics / Information and Computational Sciences Office / Department of Computer Science / Center for Entrepreneurship and Information Technology / Virtual Organization / /

Person

Kshitij Limaye / Derek Wright / Miron Livny / Thomas Naughton / Robert L. Henderson / Karen Miller / Jason Maassen / Algorithms Figure / Gosia Wrzesinska / Backup Updater / Beowulf / John Mugler / Job-Site Level Fault Tolerance / Thilo Kielmann / Henri E. Bal / Rob V. van Nieuwport / Ian Foster / Todd Tannenbaum / Jie Xu / Paul Townend / /

Position

site manager / local cluster scheduler / manager / local scheduler / PBS scheduler / manager / http /

ProgrammingLanguage

Python / /

ProvinceOrState

Virginia / Louisiana / Tennessee / New Mexico / Massachusetts / /

PublishedMedium

Condor / /

Technology

5 Backup Server Update Algorithm / remaining algorithm / promising technology / updater algorithm / check-point-aware algorithm / UNIX / Linux / Information Technology / client side algorithm / 6 Backup Server Update Algorithm / operating system / html / backup updater algorithm / updating algorithm / pdf / update algorithm / Virtual Organization / Server-Side Algorithms / 3 Failover Client Algorithm / GUI / /

URL

http /

SocialTag