Online Access Free Databricks.Associate-Developer-Apache-Spark.v2022-05-26.q61 Practice Test (Page 10)

Associate-Developer-Apache-Spark Exam Question 41

A.itemsDf.select(explode("attributes").alias("attributes_exploded")).filter(attributes_exploded.contains("i"))

B.itemsDf.explode(attributes).alias("attributes_exploded").filter(col("attributes_exploded").contains("i"))

C.itemsDf.select(explode("attributes")).filter("attributes_exploded".contains("i"))

D.itemsDf.select(explode("attributes").alias("attributes_exploded")).filter(col("attributes_exploded").contain

E.itemsDf.select(col("attributes").explode().alias("attributes_exploded")).filter(col("attributes_exploded").co

Associate-Developer-Apache-Spark Exam Question 42

Which of the following code blocks adds a column predErrorSqrt to DataFrame transactionsDf that is the square root of column predError?

A.transactionsDf.withColumn("predErrorSqrt", sqrt(predError))

B.transactionsDf.select(sqrt(predError))

C.transactionsDf.withColumn("predErrorSqrt", col("predError").sqrt())

D.transactionsDf.withColumn("predErrorSqrt", sqrt(col("predError")))

E.transactionsDf.select(sqrt("predError"))

Correct Answer: D

Explanation
transactionsDf.withColumn("predErrorSqrt", sqrt(col("predError")))
Correct. The DataFrame.withColumn() operator is used to add a new column to a DataFrame. It takes two arguments: The name of the new column (here: predErrorSqrt) and a Column expression as the new column. In PySpark, a Column expression means referring to a column using the col("predError") command or by other means, for example by transactionsDf.predError, or even just using the column name as a string, "predError".
The question asks for the square root. sqrt() is a function in pyspark.sql.functions and calculates the square root. It takes a value or a Column as an input. Here it is the predError column of DataFrame transactionsDf expressed through col("predError").
transactionsDf.withColumn("predErrorSqrt", sqrt(predError))
Incorrect. In this expression, sqrt(predError) is incorrect syntax. You cannot refer to predError in this way - to Spark it looks as if you are trying to refer to the non-existent Python variable predError.
You could pass transactionsDf.predError, col("predError") (as in the correct solution), or even just "predError" instead.
transactionsDf.select(sqrt(predError))
Wrong. Here, the explanation just above this one about how to refer to predError applies.
transactionsDf.select(sqrt("predError"))
No. While this is correct syntax, it will return a single-column DataFrame only containing a column showing the square root of column predError. However, the question asks for a column to be added to the original DataFrame transactionsDf.
transactionsDf.withColumn("predErrorSqrt", col("predError").sqrt())
No. The issue with this statement is that column col("predError") has no sqrt() method. sqrt() is a member of pyspark.sql.functions, but not of pyspark.sql.Column.
More info: pyspark.sql.DataFrame.withColumn - PySpark 3.1.2 documentation and pyspark.sql.functions.sqrt - PySpark 3.1.2 documentation Static notebook | Dynamic notebook: See test 2

Associate-Developer-Apache-Spark Exam Question 43

The code block displayed below contains an error. The code block should arrange the rows of DataFrame transactionsDf using information from two columns in an ordered fashion, arranging first by column value, showing smaller numbers at the top and greater numbers at the bottom, and then by column predError, for which all values should be arranged in the inverse way of the order of items in column value. Find the error.
Code block:
transactionsDf.orderBy('value', asc_nulls_first(col('predError')))

A.Two orderBy statements with calls to the individual columns should be chained, instead of having both columns in one orderBy statement.

B.Column value should be wrapped by the col() operator.

C.Column predError should be sorted in a descending way, putting nulls last.

D.Column predError should be sorted by desc_nulls_first() instead.

E.Instead of orderBy, sort should be used.

Associate-Developer-Apache-Spark Exam Question 44

Which of the following code blocks creates a new 6-column DataFrame by appending the rows of the
6-column DataFrame yesterdayTransactionsDf to the rows of the 6-column DataFrame todayTransactionsDf, ignoring that both DataFrames have different column names?

A.union(todayTransactionsDf, yesterdayTransactionsDf)

B.todayTransactionsDf.unionByName(yesterdayTransactionsDf, allowMissingColumns=True)

C.todayTransactionsDf.unionByName(yesterdayTransactionsDf)

D.todayTransactionsDf.concat(yesterdayTransactionsDf)

E.todayTransactionsDf.union(yesterdayTransactionsDf)

Associate-Developer-Apache-Spark Exam Question 45

The code block shown below should return a two-column DataFrame with columns transactionId and supplier, with combined information from DataFrames itemsDf and transactionsDf. The code block should merge rows in which column productId of DataFrame transactionsDf matches the value of column itemId in DataFrame itemsDf, but only where column storeId of DataFrame transactionsDf does not match column itemId of DataFrame itemsDf. Choose the answer that correctly fills the blanks in the code block to accomplish this.
Code block:
transactionsDf.__1__(itemsDf, __2__).__3__(__4__)

A.1. join
2. transactionsDf.productId==itemsDf.itemId, how="inner"
3. select
4. "transactionId", "supplier"

B.1. select
2. "transactionId", "supplier"
3. join
4. [transactionsDf.storeId!=itemsDf.itemId, transactionsDf.productId==itemsDf.itemId]

C.1. join
2. [transactionsDf.productId==itemsDf.itemId, transactionsDf.storeId!=itemsDf.itemId]
3. select
4. "transactionId", "supplier"

D.1. filter
2. "transactionId", "supplier"
3. join
4. "transactionsDf.storeId!=itemsDf.itemId, transactionsDf.productId==itemsDf.itemId"

E.1. join
2. transactionsDf.productId==itemsDf.itemId, transactionsDf.storeId!=itemsDf.itemId
3. filter
4. "transactionId", "supplier"

Other Version: 2179Databricks.Associate-Developer-Apache-Spark.v2022-08-12.q63; 2792Databricks.Associate-Developer-Apache-Spark.v2022-06-21.q62; 98Databricks.Validbraindumps.Associate-Developer-Apache-Spark.v2022-04-02.by.doreen.61q.pdf

Latest Upload: 142Oracle.1D0-1057-25-D.v2026-06-03.q29; 273NAHQ.CPHQ.v2026-06-03.q396; 255CompTIA.220-1201.v2026-06-03.q196; 156GIAC.GCFE.v2026-06-03.q78; 154HIMSS.CPHIMS.v2026-06-03.q45; 233Google.Professional-Cloud-Architect.v2026-06-03.q165; 156HP.HPE7-A09.v2026-06-02.q48; 167ACDIS.CCDS-O.v2026-06-02.q56; 149Microsoft.AB-730.v2026-06-02.q31; 212ASQ.CSSBB.v2026-06-02.q130

Associate-Developer-Apache-Spark Exam Question 41

Associate-Developer-Apache-Spark Exam Question 42

Associate-Developer-Apache-Spark Exam Question 43

Associate-Developer-Apache-Spark Exam Question 44

Associate-Developer-Apache-Spark Exam Question 45

Download PDF File