Applications
Pegasus is being used in a broad range of
applications. This pages shows some of the examples.
We are looking for new applications willing to leverage our workflow technologies. If you are interested please contact us at pegasus at isi dot edu
Pegasus
is used in astronomy, and in particular in the
Montage application which
delivers science-grade mosaics of the sky. Our technologies were used to
transform a single-processor Montage code into a complex workflow and
parallelized computations to process larger-scale images. Montage workflows
mapped by Pegasus to the NSF CyberInfrastructure are characterized by tens of
thousands of executable tasks and the processing of thousands of images. The
image on the right (Beaton
et al. Ap J Lett in press) was recently created to verify a bar in
the spiral galaxy M31. Eleven major projects and surveys worldwide, such as
the Spitzer Space Telescope Legacy teams have integrated Montage and therefore
Pegasus into their pipelines and processing environments to generate science and
browse products for dissemination to the astronomy community.
For other Montage success stories please visit:
http://montage.ipac.caltech.edu/applications.html
Montage Scientists: Bruce Berriman, John Good, (IPAC) Dan Katz (LSU), and Joe Jacobs (Caltech)
The
Southern California Earthquake Center (SCEC)
uses our workflow technologies to produce more accurate seismic hazard maps.
These maps, generated as part of the SCEC
CyberShake
project, indicate the maximum amount of shaking expected at a particular
geographic location over a certain period of time. The hazard maps are used by
civil engineers to determine building design tolerances. Pegasus maps the
CyberShake workflows onto SCEC and NSF CyberInfrastructure resources. The
figure on the left shows the results of running CyberShake on the
TeraGrid in the fall of 2005. The
workflows ran over a period of 23 days and processed 20TB of data using 1.8 CPU
Years. The total number of tasks in the workflows was 261,823. CyberShake
delivers new insights into how rupture directivity and sedimentary basin
effects contribute to the shaking experienced at different geographic locations.
As a result more accurate hazard maps can be created.
SCEC is also using Pegasus and DAGMan in the Earthworks Portal, a TeraGrid Science Gateway, hosted at Washington University that allows users to configure and execute earthquake wave propagation simulations structured as workflows through a simple portal interface.
David Okaya's of SCEC has an interesting slide on Benefits of Scientifc Workflows for an application scientists perspective.
SCEC Scientists: Thomas H. Jordan, Scott Callaghan, Edward Field, Hunter Francoeur, Robert Graves, Nitin Gupta, Vipin Gupta, Philip Maechling, John Mehringer, David Okaya, Li Zhao
Pegasus
is used in the Laser Gravitational
Wave Observatory (LIGO) project to map binary inspiral analysis
workflows onto the Open Science Grid. A month of LIGO data requires many
thousands of jobs, running for days on hundreds of CPUs. The figure on the
right illustrates the use of OSG for the LIGO workflows over the period of
November 2006 to early January 2007. The figure was created using the
Monalisa
monitoring software used on OSG. The workflows were run across several OSG sites
and used a total of 2.5 CPU years of computing over the period 2 months.
LIGO Scientists: Kent Blackburn, David Meyers, Michael Samidi (Caltech)
GADU
- the Genome Analysis and Database Update system, has been using Pegasus for the
past 2 years to perform high-throughout analysis and annotation of the genomics
information that it regularly fuses from multiple public information sources,
providing an integrated facility that supports research programs within DOE as
well as public visitors to its web portal. GADU workflows are being run across
the Open Science Grid and TeraGrid, applying tools such as BLAST, BLOCKS and
PFAM to enrich the warehouse.
The graph on the left shows the number of GADU workflow jobs run on OSG for the past year April 2006-January 2007. (source OSG's Monalisa)
GADU scientists: Natalia Maltsev, Alex Rodriguez, Dinanath Sulakhe, Elizabeth Marland, Veronika Nefedova (ANL)
A climate modeling application has used out tools to reduce the amount of time computations take. Simulations which used to take 2.5 months to run manually, took only 2.5 days to run using our tools. In Automating "Climate Science: Large Ensemble Simulations on the TeraGrid with the GriPhyN Virtual Data System," Veronika Nefedova, Robert Jacob, Ian Foster, Zhengyu Liu, Yun Liu, Ewa Deelman, Gaurang Mehta, Mei-Hui Su, Karan Vahi, e-Science 2006, Amsterdam, December 4-6, 2006
Pegasus is also used in the Telescience project and portal to support 3D reconstruction of electron tomography images. The UCSD scientists plan to continue to rely on our workflow technologies to expand the set of Grid applications they support within their portal environment and to develop new techniques that can provide real-time feedback from the 3D reconstruction to the scientists manipulating the instrument.
Telescience Scientists: Mark Ellisman, Steven Peltier, Abel Lin (UCSD)
Data mining and natural language processing applications at USC/ISI are new user communities that are exploring the use of our workflow technologies to manage the large-scale computations on today’s cyberinfrastructure.
