Associate-Developer-Apache-Spark-3.5 Exam Question 51

What is the behavior for function date_sub(start, days) if a negative value is passed into the days parameter?
  • Associate-Developer-Apache-Spark-3.5 Exam Question 52

    30 of 55.
    A data engineer is working on a num_df DataFrame and has a Python UDF defined as:
    def cube_func(val):
    return val * val * val
    Which code fragment registers and uses this UDF as a Spark SQL function to work with the DataFrame num_df?
  • Associate-Developer-Apache-Spark-3.5 Exam Question 53

    A data engineer writes the following code to join two DataFrames df1 and df2:
    df1 = spark.read.csv("sales_data.csv") # ~10 GB
    df2 = spark.read.csv("product_data.csv") # ~8 MB
    result = df1.join(df2, df1.product_id == df2.product_id)

    Which join strategy will Spark use?
  • Associate-Developer-Apache-Spark-3.5 Exam Question 54

    What is the benefit of Adaptive Query Execution (AQE)?
  • Associate-Developer-Apache-Spark-3.5 Exam Question 55

    35 of 55.
    A data engineer is building a Structured Streaming pipeline and wants it to recover from failures or intentional shutdowns by continuing where it left off.
    How can this be achieved?