Revealing the Detailed Lineage of Script Outputs Using Hybrid ProvenanceCitation formats

  • Authors:
  • Qian Zhang
  • Yang Cao
  • Qiwen Wang
  • Duc Vu
  • Priyaa Thavasimani
  • And 6 others
  • External authors:
  • Timothy Mcphillips
  • Paolo Missier
  • Peter Slaughter
  • Christopher Jones
  • Mathew B. Jones
  • Bertram Ludäscher

Standard

Revealing the Detailed Lineage of Script Outputs Using Hybrid Provenance. / Zhang, Qian; Cao, Yang; Wang, Qiwen; Vu, Duc; Thavasimani, Priyaa; Mcphillips, Timothy; Missier, Paolo; Slaughter, Peter; Jones, Christopher; Jones, Mathew B.; Ludäscher, Bertram.

In: International Journal of Digital Curation, Vol. 12, No. 2, 13.08.2018, p. 390-408.

Research output: Contribution to journalArticlepeer-review

Harvard

Zhang, Q, Cao, Y, Wang, Q, Vu, D, Thavasimani, P, Mcphillips, T, Missier, P, Slaughter, P, Jones, C, Jones, MB & Ludäscher, B 2018, 'Revealing the Detailed Lineage of Script Outputs Using Hybrid Provenance', International Journal of Digital Curation, vol. 12, no. 2, pp. 390-408. https://doi.org/10.2218/ijdc.v12i2.585

APA

Zhang, Q., Cao, Y., Wang, Q., Vu, D., Thavasimani, P., Mcphillips, T., Missier, P., Slaughter, P., Jones, C., Jones, M. B., & Ludäscher, B. (2018). Revealing the Detailed Lineage of Script Outputs Using Hybrid Provenance. International Journal of Digital Curation, 12(2), 390-408. https://doi.org/10.2218/ijdc.v12i2.585

Vancouver

Zhang Q, Cao Y, Wang Q, Vu D, Thavasimani P, Mcphillips T et al. Revealing the Detailed Lineage of Script Outputs Using Hybrid Provenance. International Journal of Digital Curation. 2018 Aug 13;12(2):390-408. https://doi.org/10.2218/ijdc.v12i2.585

Author

Zhang, Qian ; Cao, Yang ; Wang, Qiwen ; Vu, Duc ; Thavasimani, Priyaa ; Mcphillips, Timothy ; Missier, Paolo ; Slaughter, Peter ; Jones, Christopher ; Jones, Mathew B. ; Ludäscher, Bertram. / Revealing the Detailed Lineage of Script Outputs Using Hybrid Provenance. In: International Journal of Digital Curation. 2018 ; Vol. 12, No. 2. pp. 390-408.

Bibtex

@article{97e24568531e42c0ba71f310d87dc0a0,
title = "Revealing the Detailed Lineage of Script Outputs Using Hybrid Provenance",
abstract = "We illustrate how combining retrospective and prospectiveprovenance can yield scientifically meaningfulhybrid provenance representations of the computational histories of data produced during a script run. Weuse scripts from multiple disciplines (astrophysics, climate science, biodiversity data curation, and social network analysis), implemented in Python, R, and MATLAB, to highlight the usefulness of diverse forms of retrospective provenance when coupled with prospective provenance. Users provide prospective provenance, i.e., the conceptual workflows latent in scripts, via simple YesWorkflow annotations, embedded as script comments. Runtime observables can be linked to prospective provenance via relational views and queries. These observables could be found hidden in filenames or folder structures, be recorded in log files, or they can be automatically captured using tools such as noWorkflow or the DataONE RunManagers. The YesWorkflow toolkit, example scripts, and demonstration code are available via an open source repository.",
author = "Qian Zhang and Yang Cao and Qiwen Wang and Duc Vu and Priyaa Thavasimani and Timothy Mcphillips and Paolo Missier and Peter Slaughter and Christopher Jones and Jones, {Mathew B.} and Bertram Lud{\"a}scher",
year = "2018",
month = aug,
day = "13",
doi = "10.2218/ijdc.v12i2.585",
language = "English",
volume = "12",
pages = "390--408",
journal = "International Journal on Data Curation",
issn = "1746-8256",
publisher = "University of Edinburgh",
number = "2",

}

RIS

TY - JOUR

T1 - Revealing the Detailed Lineage of Script Outputs Using Hybrid Provenance

AU - Zhang, Qian

AU - Cao, Yang

AU - Wang, Qiwen

AU - Vu, Duc

AU - Thavasimani, Priyaa

AU - Mcphillips, Timothy

AU - Missier, Paolo

AU - Slaughter, Peter

AU - Jones, Christopher

AU - Jones, Mathew B.

AU - Ludäscher, Bertram

PY - 2018/8/13

Y1 - 2018/8/13

N2 - We illustrate how combining retrospective and prospectiveprovenance can yield scientifically meaningfulhybrid provenance representations of the computational histories of data produced during a script run. Weuse scripts from multiple disciplines (astrophysics, climate science, biodiversity data curation, and social network analysis), implemented in Python, R, and MATLAB, to highlight the usefulness of diverse forms of retrospective provenance when coupled with prospective provenance. Users provide prospective provenance, i.e., the conceptual workflows latent in scripts, via simple YesWorkflow annotations, embedded as script comments. Runtime observables can be linked to prospective provenance via relational views and queries. These observables could be found hidden in filenames or folder structures, be recorded in log files, or they can be automatically captured using tools such as noWorkflow or the DataONE RunManagers. The YesWorkflow toolkit, example scripts, and demonstration code are available via an open source repository.

AB - We illustrate how combining retrospective and prospectiveprovenance can yield scientifically meaningfulhybrid provenance representations of the computational histories of data produced during a script run. Weuse scripts from multiple disciplines (astrophysics, climate science, biodiversity data curation, and social network analysis), implemented in Python, R, and MATLAB, to highlight the usefulness of diverse forms of retrospective provenance when coupled with prospective provenance. Users provide prospective provenance, i.e., the conceptual workflows latent in scripts, via simple YesWorkflow annotations, embedded as script comments. Runtime observables can be linked to prospective provenance via relational views and queries. These observables could be found hidden in filenames or folder structures, be recorded in log files, or they can be automatically captured using tools such as noWorkflow or the DataONE RunManagers. The YesWorkflow toolkit, example scripts, and demonstration code are available via an open source repository.

U2 - 10.2218/ijdc.v12i2.585

DO - 10.2218/ijdc.v12i2.585

M3 - Article

VL - 12

SP - 390

EP - 408

JO - International Journal on Data Curation

JF - International Journal on Data Curation

SN - 1746-8256

IS - 2

ER -