Workflow gallery
The workflow gallery is a gallery of archived workflow runs on distributed resources. The workflow gallery is divided into three main pages.
The main page lists various types of workflows. It provides a description of each workflow type. Selecting a particular
workflow type will takes you to the workflow type page. It shows all the runs of that workflow type along with a short summary of the workflow .
The summary contains the following details. For workflows with sub workflows the summary is across all sub workflows.i.e. For workflows
having sub workflow jobs (i.e SUBDAG and SUBDAX jobs), the value includes jobs from the sub workflows as well.
Workflow runtime(min,sec) : | the walltime from the start of the workflow execution to the end as reported by the DAGMAN.In case of rescue dag the value is the cumulative of all retries. |
---|---|
Cumulative workflow runtime(min,sec): | the sum of the walltime of all jobs as reported by the DAGMan .In case of job retries the value is the cumulative of all retries. |
Total jobs run : | the total number of jobs runs during the workflow run . In case of a failed workflow the number of jobs run could be less than the total jobs in the planned workflow. This is a runtime view of the workflow. |
# jobs succeeded : | the total number of succeeded jobs during the workflow run |
# jobs failed : | the total number of failed jobs during the workflow run |
# jobs unsubmitted : | the total number of unsubmitted jobs during the workflow run |
# jobs unknown : | the total number of unknown jobs during the workflow run |
# Total tasks succeeded : | the total number of succeeded tasks |
# Total tasks failed : | the total number of failed tasks |
Selecting a particular run will take you the page which contains all the information about a workflow. The workflow page contains the following details.
Workflow execution details
For workflows having sub workflow jobs (i.e SUBDAG and SUBDAX jobs), the sub workflow jobs are considered as single jobs.i.e The parent workflow won't recursively calculate sub workflows job information.
Workflow runtime(min,sec) : | the walltime from the start of the workflow execution to the end as reported by the DAGMAN.In case of rescue dag the value is the cumulative of all retries. |
---|---|
Cumulative workflow runtime(min,sec): | the sum of the walltime of all jobs as reported by the DAGMan .In case of job retries the value is the cumulative of all retries. |
Total jobs run : | the total number of jobs runs during the workflow run . In case of a failed workflow the number of jobs run could be less than the total jobs in the planned workflow. This is a runtime view of the workflow. |
# jobs succeeded : | the total number of succeeded jobs during the workflow run |
# jobs failed : | the total number of failed jobs during the workflow run |
# jobs unsubmitted : | the total number of unsubmitted jobs during the workflow run |
# jobs unknown : | the total number of unknown jobs during the workflow run |
# Total tasks succeeded : | the total number of succeeded tasks |
# Total tasks failed : | the total number of failed tasks |
Workflow execution environment
The workflow execution ennvironment contains the details in the braindump file. It contains information like dax label, dag label, submit dir, pegasus home environment variables etc.
Job statistics
Job statistics contains the following details about the jobs in workflow.
Job : | the name of the job |
---|---|
Site : | the site where job ran. |
Kickstart(sec.) : | the actual duration of the job in seconds on the remote compute node. In case of retries the value is the cumulative of all retries. |
Post(sec.) : | the postscript time as reported by DAGMan .In case of retries the value is the cumulative of all retries. |
DAGMan(sec.) : | the time between the last parent job of a job completes and the job gets submitted.In case of retries the value of the last retry is used for calculation. |
CondorQTime(sec.) : | the time between submission by DAGMan and the remote Grid submission. It is an estimate of the time spent in the condor q on the submit node .In case of retries the value is the cumulative of all retries. |
Resource(sec.) : | the time between the remote Grid submission and start of remote execution . It is an estimate of the time job spent in the remote queue .In case of retries the value is the cumulative of all retries. |
Runtime(sec.) : | the time spent on the resource as seen by Condor DAGMan . Is always >=kickstart .In case of retries the value is the cumulative of all retries. |
Seqexec(sec.) : | the time taken for the completion of a clustered job .In case of retries the value is the cumulative of all retries. |
Seqexec-Delay(sec.) : | the time difference between the time for the completion of a clustered job and sum of all the individual tasks kickstart time .In case of retries the value is the cumulative of all retries. |
Task statistics
Task statistics contains the following details about the tranformation in workflow.
Transformation : | name of the transformation. |
---|---|
Count : | the number of times the transformation was executed. |
Succeeded : | the number of times the tranformation execution succeeded. |
Failed : | the number of times the tranformation execution failed. |
Mean(sec.) : | the mean of the transformation runtime. |
Variance(sec.) : | the variance of the transformation runtime.Variance is calculated using the on-line algorithm by Knuth (http://en.wikipedia.org/wiki/Algorithms_for_calculating_variance). |
Min(sec.) : | the minimum transformation runtime value. |
Max(sec.) : | the maximum transformation runtime value. |
Total(sec.) : | the cumulative of transformation runtime. |
Dax graph
Graph image of the dax file .
Dag graph
Graph image of the dag file .
Workflow execution gantt chart
The toolbar at the top provides zoom in/out , pan left/right/top/bottom and show/hide job name functionality. The toolbar at the bottom can be used to show/hide job states. A failed job is shown by a red border. Clicking on a sub workflow job will take you to the corresponding sub workflow.
Host over time chart
The toolbar at the top provides zoom in/out , pan left/right/top/bottom and show/hide host name functionality. The toolbar at the bottom can be used to show/hide job states. A failed job is shown by a red border. Clicking on a sub workflow job will take you to the corresponding sub workflow.