Can we automate data quality to support machine learning?