Towards a Quality Model for Effective Data Selection in Collaboratories

  • Yogesh Simmhan ,
  • Dennis Gannon

International Conference on Data Engineering Workshops (ICDEW) - Scientific Workflows and Dataflows (SciFlow) |

Published by IEEE Computer Society

Data-driven scientific applications utilize workflow frameworks to execute complex dataflows, resulting in derived data products of unknown quality. We discuss our on-going research on a quality model that provides users with an integrated estimate of the data quality that is tuned to their application needs and is available as a numerical quality score that enables uniform comparison of datasets, providing a way for the community to trust derived