Applications

Pegasus is being used in a broad range of applications.  This pages shows some of the examples.
We are looking for new applications willing to leverage our workflow technologies. If you are interested please contact us at pegasus at isi dot edu


Pegasus is used in astronomy, and in particular in the Montage application which delivers science-grade mosaics of the sky.  Our technologies were used to transform a single-processor Montage code into a complex workflow and parallelized computations to process larger-scale images. Montage workflows mapped by Pegasus to the NSF CyberInfrastructure are characterized by tens of thousands of executable tasks and the processing of thousands of images.  The image on the right (Beaton et al. Ap J Lett in press) was recently created to verify a bar in the spiral galaxy M31.  Eleven major projects and surveys worldwide, such as the Spitzer Space Telescope Legacy teams have integrated Montage and therefore Pegasus into their pipelines and processing environments to generate science and browse products for dissemination to the astronomy community.

For other Montage success stories please visit:

http://montage.ipac.caltech.edu/applications.html

Montage Scientists: Bruce Berriman, John Good,  (IPAC) Dan Katz (LSU), and Joe Jacobs (Caltech)


The Southern California Earthquake Center (SCEC) uses our workflow technologies to produce more accurate seismic hazard maps. These maps, generated as part of the SCEC CyberShake project, indicate the maximum amount of shaking expected at a particular geographic location over a certain period of time. The hazard maps are used by civil engineers to determine building design tolerances. Pegasus maps the CyberShake workflows onto SCEC and NSF CyberInfrastructure resources.  The figure on the left shows the results of running CyberShake on the TeraGrid in the fall of 2005. The workflows ran over a period of 23 days and processed 20TB of data using 1.8 CPU Years. The total number of tasks in the workflows was 261,823. CyberShake delivers new insights into how rupture directivity and sedimentary basin effects contribute to the shaking experienced at different geographic locations. As a result more accurate hazard maps can be created.

SCEC is also using Pegasus and DAGMan in the Earthworks Portal, a TeraGrid Science Gateway, hosted at Washington University that allows users to configure and execute earthquake wave propagation simulations structured as workflows through a simple portal interface.

David Okaya's of SCEC has an interesting slide on Benefits of Scientifc Workflows for an application scientists perspective.

SCEC Scientists:  Thomas H. Jordan, Scott Callaghan, Edward Field, Hunter Francoeur, Robert Graves, Nitin Gupta, Vipin Gupta, Philip Maechling, John Mehringer, David Okaya, Li Zhao


Pegasus is used in the Laser Gravitational Wave Observatory (LIGO) project to map binary inspiral analysis workflows onto the Open Science Grid. A month of LIGO data requires many thousands of jobs, running for days on hundreds of CPUs. The figure on the right  illustrates the use of OSG for the LIGO workflows over the period of November 2006 to early January 2007. The figure was created using the Monalisa monitoring software used on OSG. The workflows were run across several OSG sites and used a total of 2.5 CPU years of computing over the period 2 months.

LIGO Scientists: Kent Blackburn, David Meyers, Michael Samidi (Caltech)


GADU - the Genome Analysis and Database Update system, has been using Pegasus for the past 2 years to perform high-throughout analysis and annotation of the genomics information that it regularly fuses from multiple public information sources, providing an integrated facility that supports research programs within DOE as well as public visitors to its web portal. GADU workflows are being run across the Open Science Grid and TeraGrid, applying tools such as BLAST, BLOCKS and PFAM to enrich the warehouse.

The graph on the left shows the number of GADU workflow jobs run on OSG for the past year April 2006-January 2007. (source OSG's Monalisa)

GADU scientists:  Natalia Maltsev, Alex Rodriguez, Dinanath Sulakhe, Elizabeth Marland, Veronika Nefedova (ANL)


A climate modeling application has used out tools to reduce the amount of time computations take.  Simulations which used to take 2.5 months to run manually, took only 2.5 days to run using our tools. In Automating "Climate Science: Large Ensemble Simulations on the TeraGrid with the GriPhyN Virtual Data System," Veronika Nefedova, Robert Jacob, Ian Foster, Zhengyu Liu, Yun Liu, Ewa Deelman, Gaurang Mehta, Mei-Hui Su, Karan Vahi, e-Science 2006, Amsterdam, December 4-6, 2006


Pegasus is also used in the Telescience project and portal to support 3D reconstruction of electron tomography images. The UCSD scientists plan to continue to rely on our workflow technologies to expand the set of Grid applications they support within their portal environment and to develop new techniques that can provide real-time feedback from the 3D reconstruction to the scientists manipulating the instrument.

Telescience Scientists: Mark Ellisman, Steven Peltier, Abel Lin (UCSD)


Data mining and natural language processing applications at USC/ISI are new user communities that are exploring the use of our workflow technologies to manage the large-scale computations on today’s cyberinfrastructure.