PMI Cognitive Project Management in AI CPMAI v7 - Training & Certification CPMAI_v7 Question # 17 Topic 2 Discussion
CPMAI_v7 Exam Topic 2 Question 17 Discussion:
Question #: 17
Topic #: 2
Clean, well-labeled datasets used for machine learning are partitioned into three subsets: Training sets, Validation sets, and Test sets. As your team is doing this, what’s the best way to split up this data?
CPMAI’s glossary defines data splitting as “dividing a data set into subsets (e.g., training, validation, test) for model development and evaluation,” typically achieved via random subsampling to ensure each subset is representative of the underlying distribution and to prevent sampling bias.
Contribute your Thoughts:
Chosen Answer:
This is a voting comment (?). You can switch to a simple comment. It is better to Upvote an existing comment if you don't have anything to add.
Submit