Date: Hands on Tutorial on Tuesday, May 17th, 2016
Talk on Pegasus – Enhancing LIGO DAGMan Experience on Friday May 20th, 2016
Location: Town Center, The Wisconsin Institutes for Discovery.
Instructor: The Pegasus team from USC Information Sciences Institute
Tutorial Exercises: http://pegasus.isi.edu/tutorial/isi/
The Pegasus Team is hosting a hands on tutorial on Pegasus on May 17th, 2016 at HTCondor Week 2016.
Executing Computing Pipelines using Pegasus Workflow Management System
In this tutorial, we will focus on how to model scientific analysis as a workflow that can be executed on the Open Science Grid using Pegasus WMS (http://pegasus.isi.edu). Pegasus allows users to design workflows at a high-level of abstraction, that is independent of the resources available to execute them and the location of data and executables. It compiles these abstract workflows to executable workflows that can be deployed onto distributed resources such local campus clusters, computational clouds and grids such as XSEDE and Open Science Grid. During the compilation process, Pegasus WMS does data discovery, whereby it determines the locations of input data files and executables. Data transfer tasks are added to the executable workflow that are responsible for staging in the input files to the cluster, and the generated output files back to a user specified location. In addition to the data transfers tasks, data cleanup (cleanup data that is no longer required) and data registration tasks (catalog the output files) are be added to the pipeline.
Through hands-on exercises, we will cover issues of workflow composition, how to design a workflow in a portable way, run the workflow efficiently and reliably on OSG. An important component of the tutorial will be how to monitor, debug and analyze workflows using Pegasus-provided tools.