Online Access Free Databricks.Associate-Developer-Apache-Spark.v2022-06-21.q62 Practice Test (Page 10)

Associate-Developer-Apache-Spark Exam Question 41

The code block displayed below contains an error. The code block should merge the rows of DataFrames transactionsDfMonday and transactionsDfTuesday into a new DataFrame, matching column names and inserting null values where column names do not appear in both DataFrames. Find the error.
Sample of DataFrame transactionsDfMonday:
1.+-------------+---------+-----+-------+---------+----+
2.|transactionId|predError|value|storeId|productId| f|
3.+-------------+---------+-----+-------+---------+----+
4.| 5| null| null| null| 2|null|
5.| 6| 3| 2| 25| 2|null|
6.+-------------+---------+-----+-------+---------+----+
Sample of DataFrame transactionsDfTuesday:
1.+-------+-------------+---------+-----+
2.|storeId|transactionId|productId|value|
3.+-------+-------------+---------+-----+
4.| 25| 1| 1| 4|
5.| 2| 2| 2| 7|
6.| 3| 4| 2| null|
7.| null| 5| 2| null|
8.+-------+-------------+---------+-----+
Code block:
sc.union([transactionsDfMonday, transactionsDfTuesday])

A.The DataFrames' RDDs need to be passed into the sc.union method instead of the DataFrame variable names.

B.Instead of union, the concat method should be used, making sure to not use its default arguments.

C.Instead of the Spark context, transactionDfMonday should be called with the join method instead of the union method, making sure to use its default arguments.

D.Instead of the Spark context, transactionDfMonday should be called with the union method.

E.Instead of the Spark context, transactionDfMonday should be called with the unionByName method instead of the union method, making sure to not use its default arguments.

Correct Answer: E

Explanation
Correct code block:
transactionsDfMonday.unionByName(transactionsDfTuesday, True)
Output of correct code block:
+-------------+---------+-----+-------+---------+----+
|transactionId|predError|value|storeId|productId| f|
+-------------+---------+-----+-------+---------+----+
| 5| null| null| null| 2|null|
| 6| 3| 2| 25| 2|null|
| 1| null| 4| 25| 1|null|
| 2| null| 7| 2| 2|null|
| 4| null| null| 3| 2|null|
| 5| null| null| null| 2|null|
+-------------+---------+-----+-------+---------+----+
For solving this question, you should be aware of the difference between the DataFrame.union() and DataFrame.unionByName() methods. The first one matches columns independent of their names, just by their order. The second one matches columns by their name (which is asked for in the question). It also has a useful optional argument, allowMissingColumns. This allows you to merge DataFrames that have different columns - just like in this example.
sc stands for SparkContext and is automatically provided when executing code on Databricks. While sc.union() allows you to join RDDs, it is not the right choice for joining DataFrames. A hint away from sc.union() is given where the question talks about joining "into a new DataFrame".
concat is a method in pyspark.sql.functions. It is great for consolidating values from different columns, but has no place when trying to join rows of multiple DataFrames.
Finally, the join method is a contender here. However, the default join defined for that method is an inner join which does not get us closer to the goal to match the two DataFrames as instructed, especially given that with the default arguments we cannot define a join condition.
More info:
- pyspark.sql.DataFrame.unionByName - PySpark 3.1.2 documentation
- pyspark.SparkContext.union - PySpark 3.1.2 documentation
- pyspark.sql.functions.concat - PySpark 3.1.2 documentation
Static notebook | Dynamic notebook: See test 3

Associate-Developer-Apache-Spark Exam Question 42

Which of the following DataFrame methods is classified as a transformation?

A.DataFrame.count()

B.DataFrame.show()

C.DataFrame.select()

D.DataFrame.foreach()

E.DataFrame.first()

Associate-Developer-Apache-Spark Exam Question 43

Which of the following statements about DAGs is correct?

A.DAGs help direct how Spark executors process tasks, but are a limitation to the proper execution of a query when an executor fails.

B.DAG stands for "Directing Acyclic Graph".

C.Spark strategically hides DAGs from developers, since the high degree of automation in Spark means that developers never need to consider DAG layouts.

D.In contrast to transformations, DAGs are never lazily executed.

E.DAGs can be decomposed into tasks that are executed in parallel.

Associate-Developer-Apache-Spark Exam Question 44

Which of the following describes the characteristics of accumulators?

A.Accumulators are used to pass around lookup tables across the cluster.

B.All accumulators used in a Spark application are listed in the Spark UI.

C.Accumulators can be instantiated directly via the accumulator(n) method of the pyspark.RDD module.

D.Accumulators are immutable.

E.If an action including an accumulator fails during execution and Spark manages to restart the action and complete it successfully, only the successful attempt will be counted in the accumulator.

Associate-Developer-Apache-Spark Exam Question 45

Which of the following code blocks returns a single-column DataFrame of all entries in Python list throughputRates which contains only float-type values ?

A.spark.createDataFrame((throughputRates), FloatType)

B.spark.createDataFrame(throughputRates, FloatType)

C.spark.DataFrame(throughputRates, FloatType)

D.spark.createDataFrame(throughputRates)

E.spark.createDataFrame(throughputRates, FloatType())

Other Version: 2179Databricks.Associate-Developer-Apache-Spark.v2022-08-12.q63; 1348Databricks.Associate-Developer-Apache-Spark.v2022-05-26.q61; 98Databricks.Validbraindumps.Associate-Developer-Apache-Spark.v2022-04-02.by.doreen.61q.pdf

Latest Upload: 142Oracle.1D0-1057-25-D.v2026-06-03.q29; 273NAHQ.CPHQ.v2026-06-03.q396; 255CompTIA.220-1201.v2026-06-03.q196; 156GIAC.GCFE.v2026-06-03.q78; 154HIMSS.CPHIMS.v2026-06-03.q45; 233Google.Professional-Cloud-Architect.v2026-06-03.q165; 156HP.HPE7-A09.v2026-06-02.q48; 167ACDIS.CCDS-O.v2026-06-02.q56; 149Microsoft.AB-730.v2026-06-02.q31; 212ASQ.CSSBB.v2026-06-02.q130

Associate-Developer-Apache-Spark Exam Question 41

Associate-Developer-Apache-Spark Exam Question 42

Associate-Developer-Apache-Spark Exam Question 43

Associate-Developer-Apache-Spark Exam Question 44

Associate-Developer-Apache-Spark Exam Question 45

Download PDF File