News

Pegasus 4.2.2 Released

Pegasus 4.2.2 is a minor release, that has minor enhancements and fixes bugs to Pegasus 4.2.0 release. Improvements in 4.2.2 include 

  • support for sever side pagination for pegasus-dashboard
  • support for lcg-utils command line clients to retrieve and push data to SRM servers
  • installation of Pegasus python libraries  in standard system locations
  • examples for using CREAMCE and glite submissions

Managing HTC workflows with Pegasus - recorded webinar

This is a recording of a Open Science Grid - Campus Infrastructures Community webinar on 4/26/13. Topics covered include:

  • What is Pegasus?
  • Components of a Pegasus workflow (abstract workflow description, replica, transformation and site catalogs)
  • Common workflow transformations
  • Debugging and statistics

The recording also contains a command line demonstation of how to plan, monitor, and debug workflows.

Full recording and slides can be found at http://www.campusgrids.org/#!webinar-htc-pegasus/c1620

Seminar by Rafael Ferreira da Silva: Online and non-clairvoyant self-healing of workflow executions on grids

Abstract: Distributed computing infrastructures are commonly used through scientific gateways, but operating these gateways requires important human intervention to handle operational incidents. In this work, we present a self-healing process that quantifies incident degrees of workflow executions from metrics measuring long-tail effect, task granularity, and fairness among workflows. These metrics are simple enough to be computed online and they make little assumptions on the application or resource characteristics. Incidents are classified in levels and associated to sets of healing actions that are selected based on association rules modeling correlations between incident level. The healing process is parametrized on real application traces acquired in production on the European Grid Infrastructure.

Seminar by Bartosz Baliś: HyperFlow: Scientific workflow as a DEDS (Discrete-Event Dynamic System)

Bartosz Baliś from AGH University of Science and Technology gave a seminar at ISI on April 12th titled "HyperFlow: Scientific workflow as a DEDS (Discrete-Event Dynamic System)".

Seminar by Maciej Malawski: Cloud Platform for VPH Applications

Maciej Malawski from AGH University of Science and Technology gave a seminar at ISI on April 11th titled "Cloud Platform for VPH Applications".

How can we use HPC platforms to help dig out new exoplanets?

In this week's ISGTW our collaborators G. Bruce Berriman and Peter Plavchan describe how they are analyzing data from NASA's Kepler mission to find exoplanets using Pegasus.

Precip 0.2 released

We are proud to announce new changes to Precip - Pegasus Repeatable Experiments for the Cloud in Python

Two features were added to the precip API:

First of them is the retry feature. Some of the cloud instances fail to boot correctly and they will reach timeout in the middle of the experiments. So in the provisioning phase of the precip whenever an instance has booting problems, it will retry the booting phase according to the times user asked for.

The second feature is the change to the way the SSH keypair is stored. In order to use precip from multiple machines or accounts at the same time the way the keys are stored is changed. Now at the beginning of every connection between the submit host and the cloud a unique id is generated for each account and the keys are stored in "precip_id". In this case different machines with different ids can use it at the same time.

Project ADAMANT Wins CENIC's 2013 Innovations in Networking Award for Experimental/Developmental Applications

March 13, 2013 — La Mirada, CA — Project ADAMANT, a collaborative effort of the University of Southern California Information Sciences Institute (USC/ISI), RENCI/UNC Chapel Hill, and Duke University, has been honored by the Corporation for Education Network Initiatives in California (CENIC) as the recipient of the 2013 Innovations in Networking Award for Experimental/Developmental Applications.

Viterbi Ranked in Top Ten

The upcoming edition of the US News and World Report, which will become public on March 12, 2013, ranks the USC Viterbi School of Engineering in the top ten graduate programs in engineering.

Ewa Deelman talks about the ADAMANT project

Ewa Deelman talks about the ADAMANT project at The 100G and Beyond Workshop, sponsored by the California Institute of Telecommunications and Information Technology (Calit2), the Energy Sciences Network (ESnet), and the Corporation for Education Network Initiatives in California (CENIC)

Seminar by James Howison: Scientific software production systems: incentives and collaboration

 

Title: Scientific software production systems: incentives and collaboration
Speaker: James Howison, University of Texas at Austin
 
Abstract: Software plays an increasingly critical role in science, including data analysis, simulations, and managing workflows. Unlike other technologies supporting science, software can be copied and distributed at essentially no cost, potentially opening the door to unprecedented levels of sharing and collaborative innovation. Yet we do not have a clear picture of how software development for science fits into the day-to-day practice of science, or how well the methods and incentives of its production facilitate realization of this potential. We report the results of a multiple-case study of software development in three fields: high energy physics, structural biology, and microbiology. In each case, we identify a typical publication, and use qualitative methods to explore the production of the software used in the science represented by the publication. We identify several different production systems, characterized primarily by differences in incentive structures. We identify ways in which incentives are matched and mismatched with the needs of the science fields, especially with respect to collaboration.
 

Amazon Web Services at ISI

Ann Merrihew, Jamie Kinney, and Larry Gilreath from Amazon Web Services visited ISI on February 13th to describe how AWS can be used for science. Jamie gave examples of how scientists from NASA and CMS are using AWS for data storage and processing.

Pegasus 4.2 Released

Pegasus 4.2 is a major release of Pegasus which contains 

  • several improvements on data management capabilities,
  • a new web based monitoring dashboard
  • job submission interfaces supported. CREAM CE is now supported
  • new replica catalog backends. 
  • support for PMC only workflows and IO forwarding for PMC clustered jobs
  • anonymous usage metrics reporting

New award: dV/dT - Accelerating the Rate of Progress towards Extreme Scale Collaborative Science

In recognition of the importance of collaborative science, the US Department of Energy solicited proposals aimed at advancing the state-of-the-art in extreme scale scientific collaborations. Miron Livny (University of Wisconsin at Madison) and his colleagues: William Allcock (University of Chicago, Argonne National Laboratory), Ewa Deelman (University of Southern California), Douglas Thain (University of Notre Dame), and Frank Wuerthwein (University of California San Diego) have received an award under that program entitled “dV/dt - Accelerating the Rate of Progress towards Extreme Scale Collaborative Science”.

New Award: Transforming Computational Science with ADAMANT (Adaptive Data-Aware Multi-Domain Application Network Topologies)

Workflows, especially data-driven workflows and workflow ensembles are becoming a centerpiece of modern computational science. However, scientists lack the tools that integrate the operation of workflow-driven science applications on top of dynamic infrastructures that link campus, institutional and national resources into connected arrangements targeted at solving a specific problem. These tools must (a) orchestrate the infrastructure in response to application demands, (b) manage application lifetime on top of the infrastructure by monitoring various workflow steps and modifying slices in response to application demands, and (c) integrate data movement with the workflows to optimize performance.

Introducing Precip - Pegasus Repeatable Experiments for the Cloud in Python

We are proud to announce a new member of the Pegsaus projects family:

Precip -  Pegasus Repeatable Experiments for the Cloud in Python 

Precip is a flexible experiment management API for running experiments on clouds. Precip was developed for use on FutureGrid infrastructures such as OpenStack, Eucalyptus (>=3.2), Nimbus, and at the same time commercial clouds such as Amazon EC2. The API allows you to easily provision resources, which you can then can run commands on and copy files to/from subsets of instances identified by tags. The goal of the API is to be flexible and simple to use in Python scripts to control your experiments.

Major features of Precip include support for vanilla images which can be bootstrapped at runtime, flexible tagging of instances for group manipulation such as running commands or copy files, and automatic handling of ssh keys and security groups.

Pegasus at SC'12

Next week the annual Supercomputing conference will be held in Salt Lake City, Utah. If you are planning to attend the conference we encourage you to stop by one of the Pegasus-related talks that are scheduled during the week.

Several of the talks will be given at the workshop on Workflows in Support of Large-Scale Science (WORKS) on Monday. More information can be found on the WORKS website:  http://works.cs.cardiff.ac.uk/program.php

If you would like to discuss the Pegasus Workflow Managment System or other projects such as running workflows on the Cloud, resource provisioning, or our new Precip: Pegasus Repeatable Experiments for the Cloud in Python software, please come to the USC booth during Ewa's talks.

The complete list of Pegasus-related talks is as follows:

Monday 10:50AM: "Peer-to-Peer Data Sharing for Scientific Workflows on Amazon EC2", Gideon Juve, WORKS Workshop, Room 155-A

A Tale of 160 Scientists, Three Applications, One Workshop and A Cloud

Our collaborator Bruce Berriman has posted a presentation and a paper about hosting a workshop with hands-on sessions being run on an infrastructure set up in Amazon EC2:

 The NASA Exoplanet Science Center (NEXScI) hosts the Sagan Workshops, annual themed conferences aimed at introducing the latest techniques in exoplanet astronomy to young researchers. The workshops emphasize interaction with data, and include hands-on sessions where participants use their laptops to follow step-by-step tutorials given by experts.
...
Rather than attempt to run these diverse applications on the inevitable wide range of environments on attendees’ laptops, the conference organizers, in consultation with the Virtual Astronomical Observatory, chose instead to run the applications on the Amazon Elastic Cloud 2 (EC2). This paper summarizes the system architecture, the Amazon resources consumed, and lessons learned and best practices.

ISGTW article about Pegasus and HUBzero collaboration

From the article:

Over the past several years, the US National Science Foundation has been funding the development of collaborative web sites or ‘collaboratories’ and scientific workflow technologies. Many communities have adopted the HUBzero platform to create collaboratories called ‘hubs’ where they can share ideas, models, experiences, publications, and data in pursuit of research and education. Scientists have also been using the Pegasus Workflow Management System (WMS) to manage complex analyses running on their campus and on large-scale cyberinfrastructure, such as the Open Science Grid, DiaGrid, and XSEDE.

Read the full article at the ISGTW website

Pegasus now available in HUBZero

A tutorial on using Pegasus in HUBZero is available at https://hubzero.org/resources/pegtut. To learn about how to generate Pegasus workflows, please see the Pegasus User Guide. For help using Pegasus please contact pegasus-users@isi.edu

Pages

Subscribe to Pegasus News