19th Workshop on Job Scheduling Strategies for Parallel Processing

19th Workshop on Job Scheduling Strategies for Parallel Processing (JSSPP)

In Conjunction with IPDPS 2015
Hyderabad, India
29 May 2015

Workshop organizers

Walfredo Cirne, Google
Narayan Desai, Ericsson

Program Committee

Henri Casanova, University of Hawaii at Manoa
Julita Corbalan, Technical University of Catalonia
Dick Epema, Delft University of Technology
Hyeonsang Eom, Seoul National University
Dror Feitelson, The Hebrew University
Liana Fong, IBM T. J. Watson Research Center
Eitan Frachtenberg, Facebook
Alfredo Goldman, University of Sao Paulo
Allan Gottlieb, New York University
Alexandru Iosup, Delft University of Technology
Morris Jette, SchedMD LLC
Srikanth Kandula, Microsoft
Rajkumar Kettimuthu, Argonne National Laboratory
Dalibor Klusáček, Masaryk University
Madhukar Korupolu, Google
Zhiling Lan, Illinois Institute of Technology
Bill Nitzberg, Altair Engineering
P-O Östberg, Umeå University
Larry Rudolph, Two Sigma Investments
Uwe Schwiegelshohn, Technical University Dortmund
Leonel Sousa, Universidade Técnica de Lisboa
Mark Squillante, IBM T. J. Watson Research Center
Wei Tang, Argonne National Laboratory
Ramin Yahyapour, GWDG - University of Göttingen

Technical Program:

Session 1 (9:00-10:00) – Opening and keynote

Keynote: Practical Considerations for Multi-Level Schedulers.
Benjamin Hindman (Mesosphere)

Abstract: Existing research has shown the benefits of running multi-level schedulers, either for single node parallel computation or multi-node distributed computation. But, there are some important practical considerations that must be addressed in order to use these multi-level scheduling architectures in multi-user production environments. In this presentation we'll discuss these practical considerations through lessons learned deploying Apache Mesos, a 2-level distributed scheduling system that has been used in organizations such as Twitter, Netflix, and Apple. We'll first highlight the multi-level scheduling systems that influenced Mesos as well as describe the 2-level Mesos architecture in detail. We'll then focus on the 1st-level scheduler of Mesos and the efficient multi-resource fair-sharing algorithm that it employs. Finally, we'll discuss the extensions that have been added over the years (or are being added today) driven by practical needs, from weights, to reservations, to quotas, to optimistic allocations, and deallocation.

Biography: Benjamin Hindman is a co-founder of Mesosphere and co-creator of the Apache Mesos project. Ben was a PhD student at UC Berkeley before bringing Mesos to Twitter where it now runs on tens of thousands of machines powering Twitter's datacenters. He is now Chief Architect at Mesosphere, where they are building the Mesosphere Datacenter Operating System (DCOS). An academic at heart, his research in programming languages and distributed systems has been published in leading academic conferences.

Session 2 (10:30-12:00) – Distributed Parallelism

Real-life Experience with Major Reconfiguration of Job Scheduling System.
Dalibor Klusáček, Šimon Tóth, and Gabriela Podolníková (Masaryk University, Czech Republic)

Influence of Dynamic Think Times on Parallel Job Scheduler Performances in Generative Simulations.
Stephan Schlagkamp (TU Dortmund, Germany)

Evalix: Classification and Prediction of Job Resource Consumption on HPC Platforms.
Joseph Emeras, Sébastien Varrette, Mateusz Guzek, and Pascal Bouvry (University of Luxembourg, Luxemburg)

Session 3 (13:00-15:00) – Node Level Scheduling

Controlled Duplication Scheduling of Real-Time Precedence Tasks on Heterogeneous Multiprocessors.
Jagpreet Singh (Indian Institute of Information Technology Allahabad, India) and Nitin Auluck (Indian Institute of Technology Ropar, India)

Data Driven Scheduling Approach for the Multi-Node Multi-GPU Cholesky Decomposition.
Yuki Tsujita and Toshio Endo (Tokyo Institute of Technology, Japan)

Scheduling for Better Energy Efficiency on Many-core Chips.
Chanseok Kang, Seungyul Lee, Yong-Jun Lee, Jaejin Lee, and Bernhard Egger (Seoul National University, Korea)

On the Design and Implementation of an Efficient Lock-Free Scheduler.
Florian Negele, Felix Friedrich (ETH Zürich), Suwon Oh, and Bernhard Egger (Seoul National University, Korea)

Session 4 (15:30)– Panel and Discussion

Panel on The Evolution of Parallel Scheduling

Registration

Registration will be part of the IPDPS process and is handled by the IEEE. For details, see the IPDPS web site.

Proceedings

Interim proceedings containing a collection of the papers presented will be distributed at the workshop in electronic form. It is planned to also publish a post-workshop proceedings in the Springer Lecture Notes on Computer Science series, as was done in previous years (pending approval from Springer).

Back to parallel job scheduling workshops home page