Online Access Free Databricks.Associate-Developer-Apache-Spark-3.5.v2025-11-26.q35 Practice Test (Page 8)

Associate-Developer-Apache-Spark-3.5 Exam Question 31

A data engineer writes the following code to join two DataFramesdf1anddf2:
df1 = spark.read.csv("sales_data.csv") # ~10 GB
df2 = spark.read.csv("product_data.csv") # ~8 MB
result = df1.join(df2, df1.product_id == df2.product_id)

Which join strategy will Spark use?

A.Shuffle join, because AQE is not enabled, and Spark uses a static query plan

B.Broadcast join, as df2 is smaller than the default broadcast threshold

C.Shuffle join, as the size difference between df1 and df2 is too large for a broadcast join to work efficiently

D.Shuffle join because no broadcast hints were provided

Associate-Developer-Apache-Spark-3.5 Exam Question 32

An MLOps engineer is building a Pandas UDF that applies a language model that translates English strings into Spanish. The initial code is loading the model on every call to the UDF, which is hurting the performance of the data pipeline.
The initial code is:

def in_spanish_inner(df: pd.Series) -> pd.Series:
model = get_translation_model(target_lang='es')
return df.apply(model)
in_spanish = sf.pandas_udf(in_spanish_inner, StringType())
How can the MLOps engineer change this code to reduce how many times the language model is loaded?

A.Convert the Pandas UDF to a PySpark UDF

B.Convert the Pandas UDF from a Series # Series UDF to a Series # Scalar UDF

C.Run thein_spanish_inner()function in amapInPandas()function call

D.Convert the Pandas UDF from a Series # Series UDF to an Iterator[Series] # Iterator[Series] UDF

Associate-Developer-Apache-Spark-3.5 Exam Question 33

A DataFramedfhas columnsname,age, andsalary. The developer needs to sort the DataFrame byagein ascending order andsalaryin descending order.
Which code snippet meets the requirement of the developer?

A.df.orderBy(col("age").asc(), col("salary").asc()).show()

B.df.sort("age", "salary", ascending=[True, True]).show()

C.df.sort("age", "salary", ascending=[False, True]).show()

D.df.orderBy("age", "salary", ascending=[True, False]).show()

Associate-Developer-Apache-Spark-3.5 Exam Question 34

A data engineer needs to persist a file-based data source to a specific location. However, by default, Spark writes to the warehouse directory (e.g., /user/hive/warehouse). To override this, the engineer must explicitly define the file path.
Which line of code ensures the data is saved to a specific location?
Options:

A.users.write(path="/some/path").saveAsTable("default_table")

B.users.write.saveAsTable("default_table").option("path", "/some/path")

C.users.write.option("path", "/some/path").saveAsTable("default_table")

D.users.write.saveAsTable("default_table", path="/some/path")

Associate-Developer-Apache-Spark-3.5 Exam Question 35

Given this view definition:
df.createOrReplaceTempView("users_vw")
Which approach can be used to query the users_vw view after the session is terminated?
Options:

A.Query the users_vw using Spark

B.Persist the users_vw data as a table

C.Recreate the users_vw and query the data using Spark

D.Save the users_vw definition and query using Spark

Latest Upload: 103SAP.C_THR84_2505.v2026-01-12.q37; 103Salesforce.CRT-261.v2026-01-12.q83; 146Microsoft.SC-400.v2026-01-11.q164; 122SAP.C_THR88_2505.v2026-01-11.q67; 126CIPS.L4M6.v2026-01-11.q106; 114SAP.C_S4CS_2502.v2026-01-11.q35; 135Lpi.101-500.v2026-01-11.q128; 110Salesforce.Health-Cloud-Accredited-Professional.v2026-01-10.q45; 162Microsoft.AZ-900.v2026-01-10.q234; 140VMware.3V0-32.23.v2026-01-10.q133

Associate-Developer-Apache-Spark-3.5 Exam Question 31

Associate-Developer-Apache-Spark-3.5 Exam Question 32

Associate-Developer-Apache-Spark-3.5 Exam Question 33

Associate-Developer-Apache-Spark-3.5 Exam Question 34

Associate-Developer-Apache-Spark-3.5 Exam Question 35

Download PDF File