Winter Sale Limited Time 65% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: pass65

Amazon Web Services AWS Certified Data Engineer - Associate (DEA-C01) Data-Engineer-Associate Question # 20 Topic 3 Discussion

Amazon Web Services AWS Certified Data Engineer - Associate (DEA-C01) Data-Engineer-Associate Question # 20 Topic 3 Discussion

Data-Engineer-Associate Exam Topic 3 Question 20 Discussion:
Question #: 20
Topic #: 3

A company wants to use Apache Spark jobs that run on an Amazon EMR cluster to process streaming data. The Spark jobs will transform and store the data in an Amazon S3 bucket. The company will use Amazon Athena to perform analysis.

The company needs to optimize the data format for analytical queries.

Which solutions will meet these requirements with the SHORTEST query times? (Select TWO.)


A.

Use Avro format. Use AWS Glue Data Catalog to track schema changes.


B.

Use ORC format. Use AWS Glue Data Catalog to track schema changes.


C.

Use Apache Parquet format. Use an external Amazon DynamoDB table to track schema changes.


D.

Use Apache Parquet format. Use AWS Glue Data Catalog to track schema changes.


E.

Use ORC format. Store schema definitions in separate files in Amazon S3.


Get Premium Data-Engineer-Associate Questions

Contribute your Thoughts:


Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.