Publications

2017

Glossary, A. Rokem and F. Chirigati. In J. Kitzes, D. Turek, and F. Deniz (Eds.), The Practice of Reproducible Research: Case Studies and Lessons from the Data-Intensive Sciences, 2017
To Be Published
[GitBook]

Provenance and Reproducibility, F. Chirigati and J. Freire. In L. Liu and M. T. Özsu (Eds.), Encyclopedia of Database Systems, 2017
[entry]

Querying and Exploring Polygamous Relationships in Urban Spatio-Temporal Data Sets, Y. Chan, F. Chirigati, H. Doraiswamy, C. Silva and J. Freire. In Proceedings of the 2017 ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 1643-1646, 2017
Honorable Mention, SIGMOD Best Demonstration Award
[paper] [preprint]

HESML: A Scalable Ontology-based Semantic Similarity Measures Library with a Set of Reproducible Experiments and a Replication Dataset, J. Lastra-Díaz, A. García-Serrano, M. Batet, M. Fernández, and F. Chirigati. In Information Systems, vol. 66, pp. 97-118, 2017
Reproducibility Paper, Reviewer
[paper]

2016

ReproZip: The Reproducibility Packer, R. Rampin, F. Chirigati, D. Shasha, J. Freire, and V. Steeves. In Journal of Open Source Software (JOSS), 2016
[paper]

Knowledge Exploration Using Tables on the Web, F. Chirigati, J. Liu, F. Korn, Y. Wu, C. Yu, and H. Zhang. In Proceedings of the VLDB Endowment (PVLDB), 10(3), pp. 193-204, 2016
[paper]

Data Polygamy: The Many-Many Relationships among Urban Spatio-Temporal Data Sets, F. Chirigati, H. Doraiswamy, T. Damoulas, and J. Freire. In Proceedings of the 2016 ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 1011-1025, 2016
SIGMOD Most Reproducible Paper Award
We have made the plots reproducible using ReproZip -- check it out!
[paper] [preprint] [arXiv] [presentation] [poster] [source code]

ReproZip: Computational Reproducibility With Ease, F. Chirigati, R. Rampin, D. Shasha, and J. Freire. In Proceedings of the 2016 ACM SIGMOD International Conference on Management of Data (SIGMOD), pp. 2085-2088, 2016
[paper] [preprint] [poster]

Virtual Lightweight Snapshots for Consistent Analytics in NoSQL Stores, F. Chirigati, J. Siméon, M. Hirzel, and J. Freire. In Proceedings of the 32nd International Conference on Data Engineering (ICDE), pp. 1310-1321, 2016
We have made the plots reproducible using ReproZip -- check it out!
[paper] [preprint] [presentation] [source code]

Exploring What not to Clean in Urban Data: A Study Using New York City Taxi Trips, J. Freire, A. Bessa, F. Chirigati, H. T. Vo, and K. Zhao. In IEEE Data Engineering Bulletin, 39(2), pp. 63-77, 2016
[paper]

A Collaborative Approach to Computational Reproducibility, F. Chirigati, R. Capone, R. Rampin, J. Freire, and D. Shasha. In Information Systems, vol. 59, pp. 95-97, 2016
[editorial]

Reproducible Experiments on Dynamic Resource Allocation in Cloud Data Centers, A. Wolke, M. Bichler, F. Chirigati, and V. Steeves. In Information Systems, vol. 59, pp. 98-101, 2016
Reproducibility Paper, Reviewer
[paper] [arXiv]

2015

noWorkflow: Capturing and Analyzing Provenance of Scripts, L. Murta, V. Braganholo, F. Chirigati, D. Koop, and J. Freire. In Provenance and Annotation of Data and Processes, vol. 8628, Lecture Notes in Computer Science (LNCS), pp. 71-83, Springer International Publishing, 2015
[paper]

YesWorkflow: A User-Oriented, Language-Independent Tool for Recovering Workflow Information from Scripts, T. McPhillips, T. Song, T. Kolisnik, S. Aulenbach, K. Belhajjame, R. Kyle Bocinsky, Y. Cao, J. Cheney, F. Chirigati, S. Dey, J. Freire, C. Jones, J. Hanken, K. W. Kintigh, T. A. Kohler, D. Koop, J. A. Macklin, P. Missier, M. Schildhauer, C. Schwalm, Y. Wei, M. Bieda, B. Ludäscher. In International Journal of Digital Curation (IJDC), 10(1), pp. 298-313, 2015
[paper] [arXiv]

2014

The More the Merrier: Efficient Multi-Source Graph Traversal, M. Then, M. Kaufmann, F. Chirigati, T. Hoang-Vu, K. Pham, A. Kemper, T. Neumann, and H. T. Vo. In Proceedings of the VLDB Endowment (PVLDB), 8(4), pp. 449-460, 2014
[paper]

Provenance Storage, Querying, and Visualization in PBase, V. Cuevas-Vicenttín, P. Kianmajd, B. Ludäscher, P. Missier, F. Chirigati, Y. Wei, D. Koop, and S. Dey. In Proceedings of the International Provenance and Annotation Workshop 2014 (IPAW), Poster Session, 2014
[paper] [poster]

Reproducibility Using VisTrails, J. Freire, D. Koop, F. Chirigati, and C. Silva. In V. Stodden, F. Leisch, and R. Peng (Eds.), Implementing Reproducible Computational Research (The R Series), 2014

The PBase Scientific Workflow Provenance Repository, V. Cuevas-Vicenttín, P. Kianmajd, B. Ludäscher, P. Missier, F. Chirigati, Y. Wei, D. Koop, and S. Dey. In International Journal of Digital Curation (IJDC), 9(2), pp. 28-38, 2014
[paper]

2013

A Computational Reproducibility Benchmark, F. Chirigati, M. Troyer, D. Shasha, and J. Freire. In IEEE Data Engineering Bulletin, 36(4), pp. 54-59, 2013
[paper]

Chiron: A Parallel Engine for Algebraic Scientific Workflows, E. Ogasawara, J. Dias, V. Souza, F. Chirigati, D. Oliveira, F. Porto, P. Valduriez, and M. Mattoso. In Journal of Concurrency and Computation: Practice and Experience, 25(16), pp. 2327-2341, 2013
[paper]

Packing Experiments for Sharing and Publication, F. Chirigati, D. Shasha, and J. Freire. In Proceedings of the 2013 International Conference on Management of Data (SIGMOD), pp. 977-980, 2013
[paper] [poster]

ReproZip: Using Provenance to Support Computational Reproducibility, F. Chirigati, D. Shasha, and J. Freire. In Proceedings of the 5th USENIX conference on Theory and Practice of Provenance (TaPP), 2013
[paper] [presentation]

VisTrails Provenance Traces for Benchmarking, F. Chirigati, D. Koop, J. Freire, and C. Silva. In Proceedings of the 2013 Joint EDBT/ICDT Workshops, pp. 323-324, 2013
[paper]

2012

Towards Integrating Workflow and Database Provenance, F. Chirigati and J. Freire. In Provenance and Annotation of Data and Processes, vol. 7525, Lecture Notes in Computer Science (LNCS), pp. 11-23, Springer Berlin / Heidelberg, 2012
[paper] [presentation]

Evaluating Parameter Sweep Workflows in High Performance Computing, F. Chirigati, V. Souza, E. Ogasawara, D. Oliveira, J. Dias, F. Porto, P. Valduriez, and M. Mattoso. In Proceedings of the 1st International Workshop on Scalable Workflow Enactment Engines and Technologies (SWEET), article 2, 2012
[paper] [presentation]

2011

Similarity-Based Workflow Clustering, V. Souza, F. Chirigati, K. Maia, E. Ogasawara, D. Oliveira, V. Braganholo, L. Murta, and M. Mattoso. In Journal of Computational Interdisciplinary Sciences, vol. 2, pp. 23-35, 2011

2010

GExpLine: A Tool for Supporting Experiment Composition, D. Oliveira, E. Ogasawara, F. Chirigati, V. Souza, L. Murta, and M. Mattoso. In Provenance and Annotation of Data and Processes, vol. 6378, Lecture Notes in Computer Science (LNCS), pp. 251-259, 2010

2009

A Conception Process for Abstract Workflows: An Example on Deep Water Oil Exploitation Domain, W. Martinho, E. Ogasawara, D. Oliveira, F. Chirigati, F. Correa, B. Jacob, I. Santos, G. H. Travassos, and M. Mattoso. In Proceedings of the 5th IEEE International Conference on e-Science – Poster Session, 2009

Scientific Workflow Management System Applied to Uncertainty Quantification in Large Eddy Simulation, G. Guerra, F. Rochinha, R. Elias, A. Coutinho, V. Braganholo, D. Oliveira, E. Ogasawara, F. Chirigati, and M. Mattoso. In Proceedings of the 30th Iberian-Latin-American Congress on Computational Methods in Engineering (CILAMCE), 2009

Exploring Many Task Computing in Scientific Workflows, E. Ogasawara, D. Oliveira, F. Chirigati, C. E. Barbosa, R. Elias, V. Braganholo, A. Coutinho, and M. Mattoso. In Proceedings of the 2nd Workshop on Many-Task Computing on Grids and Supercomputers, International Conference for High Performance, Networking, Storage and Analysis (SC), 2009
[paper]

2008

Using Explicit Control Processes in Distributed Workflows to Gather Provenance, S. M. S. Cruz, F. Chirigati, R. Dahis, M. L. M. Campos, and M. Mattoso. In Provenance and Annotation of Data and Processes, vol. 5272, Lecture Notes in Computer Science (LNCS), pp. 186-199, Springer Berlin / Heidelberg, 2008