The correct answer is C – A CSV file that consists of measurement data, which is a classic example of structured data. According to AWS documentation, structured data is data that is organized in a predefined schema, typically stored in tabular formats with fixed rows, columns, and data types. CSV files, relational databases, and spreadsheets fall into this category because their structure allows deterministic querying and processing. Measurement data contained in a CSV file is easy to analyze with SQL, Amazon Athena, Amazon Redshift, or SageMaker Data Wrangler. In contrast, options A and D (text comments and transcriptions) are examples of unstructured data, requiring NLP techniques. Option B (video files) represents unstructured multimedia data, requiring computer vision. Structured data is essential for supervised ML models, business analytics, and statistical modeling. AWS emphasizes that structured data provides the simplest path to feature engineering and model training due to its consistent format.
Referenced AWS Documentation:
AWS Data Analytics Whitepaper – Structured vs. Unstructured Data
Amazon SageMaker Data Wrangler Documentation
Contribute your Thoughts:
Chosen Answer:
This is a voting comment (?). You can switch to a simple comment. It is better to Upvote an existing comment if you don't have anything to add.
Submit