Associate-Developer-Apache-Spark-3.5 Exam Question 16

A Spark DataFramedfis cached using theMEMORY_AND_DISKstorage level, but the DataFrame is too large to fit entirely in memory.
What is the likely behavior when Spark runs out of memory to store the DataFrame?
  • Associate-Developer-Apache-Spark-3.5 Exam Question 17

    A data engineer is working ona Streaming DataFrame streaming_df with the given streaming data:

    Which operation is supported with streaming_df?
  • Associate-Developer-Apache-Spark-3.5 Exam Question 18

    A data engineer replaces the exact percentile() function with approx_percentile() to improve performance, but the results are drifting too far from expected values.
    Which change should be made to solve the issue?
  • Associate-Developer-Apache-Spark-3.5 Exam Question 19

    A Spark application is experiencing performance issues in client mode because the driver is resource- constrained.
    How should this issue be resolved?
  • Associate-Developer-Apache-Spark-3.5 Exam Question 20

    An engineer has a large ORC file located at/file/test_data.orcand wants to read only specific columns to reduce memory usage.
    Which code fragment will select the columns, i.e.,col1,col2, during the reading process?