A data analyst wants to use compression on an analyzed data set and send it to a new destination for further processing. Which of the following issues will most likely occur?
→ Compression is a CPU-intensive process because it requires encoding data into a smaller format, often involving complex algorithms. While memory use is usually moderate, CPU usage can spike significantly, especially during real-time compression or large dataset processing.
Why the other options are incorrect:
A: Library issues are possible but not the most likely issue in compression.
C: Most operating systems support common compression formats (e.g., .zip, .gz).
D: Memory usage is generally lower than CPU usage during compression.
Official References:
CompTIA DataX (DY0-001) Official Study Guide – Section 5.4:“Compression is compute-intensive and may result in increased CPU utilization, particularly on shared servers or during large batch processes.”
Cloud Data Engineering Guide, Chapter 9:“High CPU usage is a common bottleneck in data compression and decompression processes, especially at scale.”
—
Contribute your Thoughts:
Chosen Answer:
This is a voting comment (?). You can switch to a simple comment. It is better to Upvote an existing comment if you don't have anything to add.
Submit