Amazon Web Services AWS Certified Machine Learning - Specialty MLS-C01 Question # 23 Topic 3 Discussion

Amazon Web Services AWS Certified Machine Learning - Specialty MLS-C01 Question # 23 Topic 3 Discussion

MLS-C01 Exam Topic 3 Question 23 Discussion:
Question #: 23
Topic #: 3

A credit card company wants to build a credit scoring model to help predict whether a new credit card applicant

will default on a credit card payment. The company has collected data from a large number of sources with

thousands of raw attributes. Early experiments to train a classification model revealed that many attributes are

highly correlated, the large number of features slows down the training speed significantly, and that there are

some overfitting issues.

The Data Scientist on this project would like to speed up the model training time without losing a lot of

information from the original dataset.

Which feature engineering technique should the Data Scientist use to meet the objectives?


A.

Run self-correlation on all features and remove highly correlated features


B.

Normalize all numerical values to be between 0 and 1


C.

Use an autoencoder or principal component analysis (PCA) to replace original features with new features


D.

Cluster raw data using k-means and use sample data from each cluster to build a new dataset


Get Premium MLS-C01 Questions

Contribute your Thoughts:


Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.