Pass the Amazon Web Services AWS Certified Specialty MLS-C01 Questions and answers with CertsForce

Viewing page 6 out of 10 pages
Viewing questions 51-60 out of questions
Questions # 51:

A Machine Learning team uses Amazon SageMaker to train an Apache MXNet handwritten digit classifier model using a research dataset. The team wants to receive a notification when the model is overfitting. Auditors want to view the Amazon SageMaker log activity report to ensure there are no unauthorized API calls.

What should the Machine Learning team do to address the requirements with the least amount of code and fewest steps?

Options:

A.

Implement an AWS Lambda function to long Amazon SageMaker API calls to Amazon S3. Add code to push a custom metric to Amazon CloudWatch. Create an alarm in CloudWatch with Amazon SNS to receive a notification when the model is overfitting.


B.

Use AWS CloudTrail to log Amazon SageMaker API calls to Amazon S3. Add code to push a custom metric to Amazon CloudWatch. Create an alarm in CloudWatch with Amazon SNS to receive a notification when the model is overfitting.


C.

Implement an AWS Lambda function to log Amazon SageMaker API calls to AWS CloudTrail. Add code to push a custom metric to Amazon CloudWatch. Create an alarm in CloudWatch with Amazon SNS to receive a notification when the model is overfitting.


D.

Use AWS CloudTrail to log Amazon SageMaker API calls to Amazon S3. Set up Amazon SNS to receive a notification when the model is overfitting.


Expert Solution
Questions # 52:

A Machine Learning Specialist is working for a credit card processing company and receives an unbalanced dataset containing credit card transactions. It contains 99,000 valid transactions and 1,000 fraudulent transactions The Specialist is asked to score a model that was run against the dataset The Specialist has been advised that identifying valid transactions is equally as important as identifying fraudulent transactions

What metric is BEST suited to score the model?

Options:

A.

Precision


B.

Recall


C.

Area Under the ROC Curve (AUC)


D.

Root Mean Square Error (RMSE)


Expert Solution
Questions # 53:

An e-commerce company needs a customized training model to classify images of its shirts and pants products The company needs a proof of concept in 2 to 3 days with good accuracy Which compute choice should the Machine Learning Specialist select to train and achieve good accuracy on the model quickly?

Options:

A.

m5 4xlarge (general purpose)


B.

r5.2xlarge (memory optimized)


C.

p3.2xlarge (GPU accelerated computing)


D.

p3 8xlarge (GPU accelerated computing)


Expert Solution
Questions # 54:

A manufacturing company needs to identify returned smartphones that have been damaged by moisture. The company has an automated process that produces 2.000 diagnostic values for each phone. The database contains more than five million phone evaluations. The evaluation process is consistent, and there are no missing values in the data. A machine learning (ML) specialist has trained an Amazon SageMaker linear learner ML model to classify phones as moisture damaged or not moisture damaged by using all available features. The model's F1 score is 0.6.

What changes in model training would MOST likely improve the model's F1 score? (Select TWO.)

Options:

A.

Continue to use the SageMaker linear learner algorithm. Reduce the number of features with the SageMaker principal component analysis (PCA) algorithm.


B.

Continue to use the SageMaker linear learner algorithm. Reduce the number of features with the scikit-iearn multi-dimensional scaling (MDS) algorithm.


C.

Continue to use the SageMaker linear learner algorithm. Set the predictor type to regressor.


D.

Use the SageMaker k-means algorithm with k of less than 1.000 to train the model


E.

Use the SageMaker k-nearest neighbors (k-NN) algorithm. Set a dimension reduction target of less than 1,000 to train the model.


Expert Solution
Questions # 55:

A Machine Learning Specialist is preparing data for training on Amazon SageMaker The Specialist is transformed into a numpy .array, which appears to be negatively affecting the speed of the training

What should the Specialist do to optimize the data for training on SageMaker'?

Options:

A.

Use the SageMaker batch transform feature to transform the training data into a DataFrame


B.

Use AWS Glue to compress the data into the Apache Parquet format


C.

Transform the dataset into the Recordio protobuf format


D.

Use the SageMaker hyperparameter optimization feature to automatically optimize the data


Expert Solution
Questions # 56:

A retail company intends to use machine learning to categorize new products A labeled dataset of current products was provided to the Data Science team The dataset includes 1 200 products The labeled dataset has 15 features for each product such as title dimensions, weight, and price Each product is labeled as belonging to one of six categories such as books, games, electronics, and movies.

Which model should be used for categorizing new products using the provided dataset for training?

Options:

A.

An XGBoost model where the objective parameter is set to multi: softmax


B.

A deep convolutional neural network (CNN) with a softmax activation function for the last layer


C.

A regression forest where the number of trees is set equal to the number of product categories


D.

A DeepAR forecasting model based on a recurrent neural network (RNN)


Expert Solution
Questions # 57:

A company sells thousands of products on a public website and wants to automatically identify products with potential durability problems. The company has 1.000 reviews with date, star rating, review text, review summary, and customer email fields, but many reviews are incomplete and have empty fields. Each review has already been labeled with the correct durability result.

A machine learning specialist must train a model to identify reviews expressing concerns over product durability. The first model needs to be trained and ready to review in 2 days.

What is the MOST direct approach to solve this problem within 2 days?

Options:

A.

Train a custom classifier by using Amazon Comprehend.


B.

Build a recurrent neural network (RNN) in Amazon SageMaker by using Gluon and Apache MXNet.


C.

Train a built-in BlazingText model using Word2Vec mode in Amazon SageMaker.


D.

Use a built-in seq2seq model in Amazon SageMaker.


Expert Solution
Questions # 58:

An ecommerce company has observed that customers who use the company's website rarely view items that the website recommends to customers. The company wants to recommend items to customers that customers are more likely to want to purchase.

Which solution will meet this requirement in the SHORTEST amount of time?

Options:

A.

Host the company's website on Amazon EC2 Accelerated Computing instances to increase the website response speed.


B.

Host the company's website on Amazon EC2 GPU-based instances to increase the speed of the website's search tool.


C.

Integrate Amazon Personalize into the company's website to provide customers with personalized recommendations.


D.

Use Amazon SageMaker to train a Neural Collaborative Filtering (NCF) model to make product recommendations.


Expert Solution
Questions # 59:

Which of the following metrics should a Machine Learning Specialist generally use to compare/evaluate machine learning classification models against each other?

Options:

A.

Recall


B.

Misclassification rate


C.

Mean absolute percentage error (MAPE)


D.

Area Under the ROC Curve (AUC)


Expert Solution
Questions # 60:

IT leadership wants Jo transition a company's existing machine learning data storage environment to AWS as a temporary ad hoc solution The company currently uses a custom software process that heavily leverages SOL as a query language and exclusively stores generated csv documents for machine learning

The ideal state for the company would be a solution that allows it to continue to use the current workforce of SQL experts The solution must also support the storage of csv and JSON files, and be able to query over semi-structured data The following are high priorities for the company:

• Solution simplicity

• Fast development time

• Low cost

• High flexibility

What technologies meet the company's requirements?

Options:

A.

Amazon S3 and Amazon Athena


B.

Amazon Redshift and AWS Glue


C.

Amazon DynamoDB and DynamoDB Accelerator (DAX)


D.

Amazon RDS and Amazon ES


Expert Solution
Viewing page 6 out of 10 pages
Viewing questions 51-60 out of questions