Databricks Certified Machine Learning Associate Exam Databricks-Machine-Learning-Associate Question # 17 Topic 2 Discussion

Databricks Certified Machine Learning Associate Exam Databricks-Machine-Learning-Associate Question # 17 Topic 2 Discussion

Databricks-Machine-Learning-Associate Exam Topic 2 Question 17 Discussion:
Question #: 17
Topic #: 2

A data scientist has written a data cleaning notebook that utilizes the pandas library, but their colleague has suggested that they refactor their notebook to scale with big data.

Which of the following approaches can the data scientist take to spend the least amount of time refactoring their notebook to scale with big data?


A.

They can refactor their notebook to process the data in parallel.


B.

They can refactor their notebook to use the PySpark DataFrame API.


C.

They can refactor their notebook to use the Scala Dataset API.


D.

They can refactor their notebook to use Spark SQL.


E.

They can refactor their notebook to utilize the pandas API on Spark.


Get Premium Databricks-Machine-Learning-Associate Questions

Contribute your Thoughts:


Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.