Responsible data management involves incorporating ethical and legal considerations across the life cycle of data collection, analysis, and use in all data-intensive systems, whether they involve machine learning and AI or not. Decisions during data collection and preparation profoundly impact the robustness, fairness, and interpretability of data-intensive systems. Experts must consider these earlier life cycle stages to improve data quality, control for bias, and allow humans to oversee the operation of these systems. Data alone is insufficient to distinguish between a distorted reflection of a perfect world, a perfect reflection of a distorted world, or a combination of both. The assumed or externally verified nature of the distortions must be explicitly stated to allow experts to decide whether and how to mitigate their effects.
ASJC Scopus subject areas
- Computer Science(all)