I. Altintas, O. Barney, and E. Jaeger-frank, Provenance collection support in the kepler scientific workflow system, International Provenance and Annotation Workshop, pp.118-132, 2006.

S. Artzet, N. Brichet, J. Chopard, M. Mielewczik, C. Fournier et al., Openalea.phenomenal: A workflow for plant phenotyping, 2018.

,

S. P. Callahan, J. Freire, E. Santos, C. E. Scheidegger, C. T. Silva et al., Vistrails: visualization meets data management, ACM SIGMOD Int. Conf. on Management of Data (SIGMOD), pp.745-747, 2006.

S. Crago, K. Dunn, P. Eads, L. Hochstein, D. I. Kang et al., Heterogeneous cloud computing, 2011 IEEE International Conference on Cluster Computing, pp.378-385, 2011.

D. Garijo, P. Alper, K. Belhajjame, O. Corcho, Y. Gil et al., Common motifs in scientific workflows: An empirical analysis, Future Generation Computer Systems (FGCS), vol.36, pp.338-351, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01342933

G. Heidsieck, D. De-oliveira, E. Pacitti, C. Pradal, F. Tardieu et al., Adaptive caching for data-intensive scientific workflows in the cloud, Int. Conf. on Database and Expert Systems Applications (DEXA), pp.452-466, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02174445

S. Kelling, W. M. Hochachka, D. Fink, M. Riedewald, R. Caruana et al., Data-intensive science: a new paradigm for biodiversity studies, BioScience, vol.59, issue.7, pp.613-620, 2009.

J. Liu, L. P. Morales, E. Pacitti, A. Costan, P. Valduriez et al., Efficient scheduling of scientific workflows using hot metadata in a multisite cloud, IEEE Trans. on Knowledge and Data Engineering, pp.1-20, 2018.
URL : https://hal.archives-ouvertes.fr/lirmm-01620231

J. Liu, E. Pacitti, P. Valduriez, and M. Mattoso, A survey of data-intensive scientific workflow management, Journal of Grid Computing, vol.13, issue.4, pp.457-493, 2015.
URL : https://hal.archives-ouvertes.fr/lirmm-01144760

J. Liu, E. Pacitti, P. Valduriez, D. De-oliveira, and M. Mattoso, Multi-objective scheduling of scientific workflows in multisite clouds, Future Generation Computer Systems(FGCS), vol.63, pp.76-95, 2016.
URL : https://hal.archives-ouvertes.fr/lirmm-01342203

K. Maheshwari, E. Jung, J. Meng, V. Vishwanath, and R. Kettimuthu, Improving multisite workflow performance using model-based scheduling, IEEE nt. Conf. on Parallel Processing (ICPP), pp.131-140, 2014.

D. De-oliveira, F. A. Baião, and M. Mattoso, Towards a taxonomy for cloud computing from an e-science perspective, pp.47-62, 2010.

M. T. Özsu and P. Valduriez, Principles of Distributed Database Systems, Fourth Edition, 2020.

C. Pradal, C. Fournier, P. Valduriez, and S. Cohen-boulakia, Openalea: scientific workflows combining data analysis and simulation, Int. Conf. on Scientific and Statistical Database Management (SSDBM), vol.11, pp.1-11, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01166298

F. Tardieu, L. Cabrera-bosquet, T. Pridmore, and M. Bennett, Plant phenomics, from sensors to knowledge, Current Biology, vol.27, issue.15, pp.770-783, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01608414

D. Yuan, Y. Yang, X. Liu, W. Li, L. Cui et al., A highly practical approach toward achieving minimum data sets storage cost in the cloud, IEEE Trans. on Parallel and Distributed Systems, vol.24, issue.6, pp.1234-1244, 2013.

J. Zhang, J. Luo, and F. Dong, Scheduling of scientific workflow in non-dedicated heterogeneous multicluster platform, Journal of Systems and Software, vol.86, issue.7, pp.1806-1818, 2013.