Summer Certification Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: force70

Pass the Amazon Web Services AWS Certified Associate MLA-C01 Questions and answers with CertsForce

Viewing page 6 out of 8 pages
Viewing questions 51-60 out of questions
Questions # 51:

Case study

An ML engineer is developing a fraud detection model on AWS. The training dataset includes transaction logs, customer profiles, and tables from an on-premises MySQL database. The transaction logs and customer profiles are stored in Amazon S3.

The dataset has a class imbalance that affects the learning of the model ' s algorithm. Additionally, many of the features have interdependencies. The algorithm is not capturing all the desired underlying patterns in the data.

The training dataset includes categorical data and numerical data. The ML engineer must prepare the training dataset to maximize the accuracy of the model.

Which action will meet this requirement with the LEAST operational overhead?

Options:

A.

Use AWS Glue to transform the categorical data into numerical data.


B.

Use AWS Glue to transform the numerical data into categorical data.


C.

Use Amazon SageMaker Data Wrangler to transform the categorical data into numerical data.


D.

Use Amazon SageMaker Data Wrangler to transform the numerical data into categorical data.


Expert Solution
Questions # 52:

A company is using an Amazon Redshift database as its single data source. Some of the data is sensitive.

A data scientist needs to use some of the sensitive data from the database. An ML engineer must give the data scientist access to the data without transforming the source data and without storing anonymized data in the database.

Which solution will meet these requirements with the LEAST implementation effort?

Options:

A.

Configure dynamic data masking policies to control how sensitive data is shared with the data scientist at query time.


B.

Create a materialized view with masking logic on top of the database. Grant the necessary read permissions to the data scientist.


C.

Unload the Amazon Redshift data to Amazon S3. Use Amazon Athena to create schema-on-read with masking logic. Share the view with the data scientist.


D.

Unload the Amazon Redshift data to Amazon S3. Create an AWS Glue job to anonymize the data. Share the dataset with the data scientist.


Expert Solution
Questions # 53:

An ML engineer is using Amazon SageMaker Canvas to build a custom ML model from an imported dataset. The model must make continuous numeric predictions based on 10 years of data.

Which metric should the ML engineer use to evaluate the model’s performance?

Options:

A.

Accuracy


B.

InferenceLatency


C.

Area Under the ROC Curve (AUC)


D.

Root Mean Square Error (RMSE)


Expert Solution
Questions # 54:

A company has a Retrieval Augmented Generation (RAG) application that uses a vector database to store embeddings of documents. The company must migrate the application to AWS and must implement a solution that provides semantic search of text files. The company has already migrated the text repository to an Amazon S3 bucket.

Which solution will meet these requirements?

Options:

A.

Use an AWS Batch job to process the files and generate embeddings. Use AWS Glue to store the embeddings. Use SQL queries to perform the semantic searches.


B.

Use a custom Amazon SageMaker notebook to run a custom script to generate embeddings. Use SageMaker Feature Store to store the embeddings. Use SQL queries to perform the semantic searches.


C.

Use the Amazon Kendra S3 connector to ingest the documents from the S3 bucket into Amazon Kendra. Query Amazon Kendra to perform the semantic searches.


D.

Use an Amazon Textract asynchronous job to ingest the documents from the S3 bucket. Query Amazon Textract to perform the semantic searches.


Expert Solution
Questions # 55:

A company is using Amazon SageMaker AI to build an ML model to predict customer behavior. The company needs to explain the bias in the model to an auditor. The explanation must focus on demographic data of the customers.

Which solution will meet these requirements?

Options:

A.

Use SageMaker Clarify to generate a bias report. Send the report to the auditor.


B.

Use AWS Glue DataBrew to create a job to detect drift in the model ' s data quality. Send the job output to the auditor.


C.

Use Amazon QuickSight integration with SageMaker AI to generate a bias report. Send the report to the auditor.


D.

Use Amazon CloudWatch metrics from the SageMaker AI namespace to create a bias dashboard. Share the dashboard with the auditor.


Expert Solution
Questions # 56:

A gaming company needs to deploy a natural language processing (NLP) model to moderate a chat forum in a game. The workload experiences heavy usage during evenings and weekends but minimal activity during other hours.

Which solution will meet these requirements MOST cost-effectively?

Options:

A.

Use an Amazon SageMaker AI batch transform job with fixed capacity.


B.

Use Amazon SageMaker Serverless Inference.


C.

Use a single Amazon EC2 GPU instance with reserved capacity.


D.

Use Amazon SageMaker Asynchronous Inference.


Expert Solution
Questions # 57:

An ML engineer needs to organize a large set of text documents into topics. The ML engineer will not know what the topics are in advance. The ML engineer wants to use built-in algorithms or pre-trained models available through Amazon SageMaker AI to process the documents.

Which solution will meet these requirements?

Options:

A.

Use the BlazingText algorithm to identify the relevant text and to create a set of topics based on the documents.


B.

Use the Sequence-to-Sequence algorithm to summarize the text and to create a set of topics based on the documents.


C.

Use the Object2Vec algorithm to create embeddings and to create a set of topics based on the embeddings.


D.

Use the Latent Dirichlet Allocation (LDA) algorithm to process the documents and to create a set of topics based on the documents.


Expert Solution
Questions # 58:

A company stores training data as a .csv file in an Amazon S3 bucket. The company must encrypt the data and must control which applications have access to the encryption key.

Which solution will meet these requirements?

Options:

A.

Create a new SSH access key and use the AWS Encryption CLI to encrypt the file.


B.

Create a new API key by using Amazon API Gateway and use it to encrypt the file.


C.

Create a new IAM role with permissions for kms:GenerateDataKey and use the role to encrypt the file.


D.

Create a new AWS Key Management Service (AWS KMS) key and use the AWS Encryption CLI with the KMS key to encrypt the file.


Expert Solution
Questions # 59:

An ML engineer wants to re-train an XGBoost model at the end of each month. A data team prepares the training data. The training dataset is a few hundred megabytes in size. When the data is ready, the data team stores the data as a new file in an Amazon S3 bucket.

The ML engineer needs a solution to automate this pipeline. The solution must register the new model version in Amazon SageMaker Model Registry within 24 hours.

Which solution will meet these requirements?

Options:

A.

Create an AWS Lambda function that runs one time each week to poll the S3 bucket for new files. Invoke the Lambda function asynchronously. Configure the Lambda function to start the pipeline if the function detects new data.


B.

Create an Amazon CloudWatch rule that runs on a schedule to start the pipeline every 30 days.


C.

Create an S3 Lifecycle rule to start the pipeline every time a new object is uploaded to the S3 bucket.


D.

Create an Amazon EventBridge rule to start an AWS Step Functions TrainingStep every time a new object is uploaded to the S3 bucket.


Expert Solution
Questions # 60:

An ML engineer is using Amazon Quick Suite (previously known as Amazon QuickSight) anomaly detection to detect very high or very low machine operating temperatures compared to normal. The ML engineer sets the Severity parameter to Low and above. The ML engineer sets the Direction parameter to All.

What effect will the ML engineer observe in the anomaly detection results if the ML engineer changes the Direction parameter to Lower than expected?

Options:

A.

Increased anomaly identification frequency and increased recall


B.

Decreased anomaly identification frequency and decreased recall


C.

Increased anomaly identification frequency and decreased recall


D.

Decreased anomaly identification frequency and increased recall


Expert Solution
Viewing page 6 out of 8 pages
Viewing questions 51-60 out of questions