Online Access Free Databricks.Associate-Developer-Apache-Spark-3.5.v2026-03-02.q60 Practice Test (Page 4)

Associate-Developer-Apache-Spark-3.5 Exam Question 11

Given a CSV file with the content:

And the following code:
from pyspark.sql.types import *
schema = StructType([
StructField("name", StringType()),
StructField("age", IntegerType())
])
spark.read.schema(schema).csv(path).collect()
What is the resulting output?

A.[Row(name='bambi'), Row(name='alladin', age=20)]

B.[Row(name='alladin', age=20)]

C.[Row(name='bambi', age=None), Row(name='alladin', age=20)]

D.The code throws an error due to a schema mismatch.

Associate-Developer-Apache-Spark-3.5 Exam Question 12

An MLOps engineer is building a Pandas UDF that applies a language model that translates English strings into Spanish. The initial code is loading the model on every call to the UDF, which is hurting the performance of the data pipeline.
The initial code is:

def in_spanish_inner(df: pd.Series) -> pd.Series:
model = get_translation_model(target_lang='es')
return df.apply(model)
in_spanish = sf.pandas_udf(in_spanish_inner, StringType())
How can the MLOps engineer change this code to reduce how many times the language model is loaded?

A.Convert the Pandas UDF to a PySpark UDF

B.Convert the Pandas UDF from a Series → Series UDF to a Series → Scalar UDF

C.Run the in_spanish_inner() function in a mapInPandas() function call

D.Convert the Pandas UDF from a Series → Series UDF to an Iterator[Series] → Iterator[Series] UDF

Associate-Developer-Apache-Spark-3.5 Exam Question 13

Which UDF implementation calculates the length of strings in a Spark DataFrame?

A.df.withColumn("length", spark.udf("len", StringType()))

B.df.select(length(col("stringColumn")).alias("length"))

C.spark.udf.register("stringLength", lambda s: len(s))

D.df.withColumn("length", udf(lambda s: len(s), StringType()))

Associate-Developer-Apache-Spark-3.5 Exam Question 14

A Spark engineer is troubleshooting a Spark application that has been encountering out-of-memory errors during execution. By reviewing the Spark driver logs, the engineer notices multiple "GC overhead limit exceeded" messages.
Which action should the engineer take to resolve this issue?

A.Optimize the data processing logic by repartitioning the DataFrame.

B.Modify the Spark configuration to disable garbage collection

C.Increase the memory allocated to the Spark Driver.

D.Cache large DataFrames to persist them in memory.

Associate-Developer-Apache-Spark-3.5 Exam Question 15

26 of 55.
A data scientist at an e-commerce company is working with user data obtained from its subscriber database and has stored the data in a DataFrame df_user.
Before further processing, the data scientist wants to create another DataFrame df_user_non_pii and store only the non-PII columns.
The PII columns in df_user are name, email, and birthdate.
Which code snippet can be used to meet this requirement?

A.df_user_non_pii = df_user.drop("name", "email", "birthdate")

B.df_user_non_pii = df_user.dropFields("name", "email", "birthdate")

C.df_user_non_pii = df_user.select("name", "email", "birthdate")

D.df_user_non_pii = df_user.remove("name", "email", "birthdate")

Other Version: 478Databricks.Associate-Developer-Apache-Spark-3.5.v2025-11-26.q35

Latest Upload: 122Microsoft.AB-731.v2026-06-19.q23; 223IIA.IIA-CIA-Part2.v2026-06-19.q308; 149DAMA.MD-1220.v2026-06-19.q66; 125ISTQB.CT-AI.v2026-06-18.q68; 242IIA.IIA-CIA-Part3.v2026-06-17.q220; 165WGU.Introduction-to-IT.v2026-06-17.q67; 206CompTIA.220-1202.v2026-06-16.q110; 138TheInstitutes.CPCU-500.v2026-06-16.q25; 219ACAMS.CAMS7-CN.v2026-06-16.q170; 271CBIC.CIC.v2026-06-15.q123

Associate-Developer-Apache-Spark-3.5 Exam Question 11

Associate-Developer-Apache-Spark-3.5 Exam Question 12

Associate-Developer-Apache-Spark-3.5 Exam Question 13

Associate-Developer-Apache-Spark-3.5 Exam Question 14

Associate-Developer-Apache-Spark-3.5 Exam Question 15

Download PDF File