Imputing missing values using the mean, median, or mode is a common technique to fill blanks in datasets. However, according to the AAIA™ Study Guide, this method introduces a significant risk of information distortion, especially if the data is not normally distributed or if imputed values disproportionately impact underrepresented groups.
“Simple imputation can reduce data variability and reinforce existing biases. It may also misrepresent the true distribution of the data, leading to skewed model outputs.”
Other options (B, C, D) may involve transformations, but they do not inherently cause as much bias or data misrepresentation as A.
[Reference: ISACA Advanced in AI Audit™ (AAIA™) Study Guide, Section: “AI Fundamentals and Technologies,” Subsection: “Data Imputation and Transformation Risks”, ]
Submit