Online Access Free Databricks.Associate-Developer-Apache-Spark.v2022-05-26.q61 Practice Test (Page 13)

Associate-Developer-Apache-Spark Exam Question 56

Which of the following describes characteristics of the Dataset API?

A.The Dataset API does not support unstructured data.

B.In Python, the Dataset API mainly resembles Pandas' DataFrame API.

C.In Python, the Dataset API's schema is constructed via type hints.

D.The Dataset API is available in Scala, but it is not available in Python.

E.The Dataset API does not provide compile-time type safety.

Associate-Developer-Apache-Spark Exam Question 57

Which of the following describes Spark's way of managing memory?

A.Spark uses a subset of the reserved system memory.

B.Storage memory is used for caching partitions derived from DataFrames.

C.As a general rule for garbage collection, Spark performs better on many small objects than few big objects.

D.Disabling serialization potentially greatly reduces the memory footprint of a Spark application.

E.Spark's memory usage can be divided into three categories: Execution, transaction, and storage.

Associate-Developer-Apache-Spark Exam Question 58

The code block displayed below contains an error. The code block should produce a DataFrame with color as the only column and three rows with color values of red, blue, and green, respectively.
Find the error.
Code block:
1.spark.createDataFrame([("red",), ("blue",), ("green",)], "color")
Instead of calling spark.createDataFrame, just DataFrame should be called.

A.The commas in the tuples with the colors should be eliminated.

B.The colors red, blue, and green should be expressed as a simple Python list, and not a list of tuples.

C.Instead of color, a data type should be specified.

D.The "color" expression needs to be wrapped in brackets, so it reads ["color"].

Associate-Developer-Apache-Spark Exam Question 59

Which of the following code blocks returns a 2-column DataFrame that shows the distinct values in column productId and the number of rows with that productId in DataFrame transactionsDf?

A.transactionsDf.count("productId").distinct()

B.transactionsDf.groupBy("productId").agg(col("value").count())

C.transactionsDf.count("productId")

D.transactionsDf.groupBy("productId").count()

E.transactionsDf.groupBy("productId").select(count("value"))

Associate-Developer-Apache-Spark Exam Question 60

Which of the following code blocks creates a new DataFrame with two columns season and wind_speed_ms where column season is of data type string and column wind_speed_ms is of data type double?

A.spark.DataFrame({"season": ["winter","summer"], "wind_speed_ms": [4.5, 7.5]})

B.spark.createDataFrame([("summer", 4.5), ("winter", 7.5)], ["season", "wind_speed_ms"])

C.1. from pyspark.sql import types as T
2. spark.createDataFrame((("summer", 4.5), ("winter", 7.5)), T.StructType([T.StructField("season",

D.CharType()), T.StructField("season", T.DoubleType())]))

E.spark.newDataFrame([("summer", 4.5), ("winter", 7.5)], ["season", "wind_speed_ms"])

F.spark.createDataFrame({"season": ["winter","summer"], "wind_speed_ms": [4.5, 7.5]})

Correct Answer: B

Explanation
spark.createDataFrame([("summer", 4.5), ("winter", 7.5)], ["season", "wind_speed_ms"]) Correct. This command uses the Spark Session's createDataFrame method to create a new DataFrame. Notice how rows, columns, and column names are passed in here: The rows are specified as a Python list. Every entry in the list is a new row. Columns are specified as Python tuples (for example ("summer", 4.5)). Every column is one entry in the tuple.
The column names are specified as the second argument to createDataFrame(). The documentation (link below) shows that "when schema is a list of column names, the type of each column will be inferred from data" (the first argument). Since values 4.5 and 7.5 are both float variables, Spark will correctly infer the double type for column wind_speed_ms. Given that all values in column
"season" contain only strings, Spark will cast the column appropriately as string.
Find out more about SparkSession.createDataFrame() via the link below.
spark.newDataFrame([("summer", 4.5), ("winter", 7.5)], ["season", "wind_speed_ms"]) No, the SparkSession does not have a newDataFrame method.
from pyspark.sql import types as T
spark.createDataFrame((("summer", 4.5), ("winter", 7.5)), T.StructType([T.StructField("season",
T.CharType()), T.StructField("season", T.DoubleType())]))
No. pyspark.sql.types does not have a CharType type. See link below for available data types in Spark.
spark.createDataFrame({"season": ["winter","summer"], "wind_speed_ms": [4.5, 7.5]}) No, this is not correct Spark syntax. If you have considered this option to be correct, you may have some experience with Python's pandas package, in which this would be correct syntax. To create a Spark DataFrame from a Pandas DataFrame, you can simply use spark.createDataFrame(pandasDf) where pandasDf is the Pandas DataFrame.
Find out more about Spark syntax options using the examples in the documentation for SparkSession.createDataFrame linked below.
spark.DataFrame({"season": ["winter","summer"], "wind_speed_ms": [4.5, 7.5]}) No, the Spark Session (indicated by spark in the code above) does not have a DataFrame method.
More info: pyspark.sql.SparkSession.createDataFrame - PySpark 3.1.1 documentation and Data Types - Spark 3.1.2 Documentation Static notebook | Dynamic notebook: See test 1

Other Version: 2179Databricks.Associate-Developer-Apache-Spark.v2022-08-12.q63; 2794Databricks.Associate-Developer-Apache-Spark.v2022-06-21.q62; 98Databricks.Validbraindumps.Associate-Developer-Apache-Spark.v2022-04-02.by.doreen.61q.pdf

Latest Upload: 170NREMT.EMT.v2026-06-06.q125; 124Juniper.JN0-232.v2026-06-06.q60; 160Oracle.1D0-1057-25-D.v2026-06-03.q29; 292NAHQ.CPHQ.v2026-06-03.q396; 269CompTIA.220-1201.v2026-06-03.q196; 176GIAC.GCFE.v2026-06-03.q78; 169HIMSS.CPHIMS.v2026-06-03.q45; 257Google.Professional-Cloud-Architect.v2026-06-03.q165; 172HP.HPE7-A09.v2026-06-02.q48; 189ACDIS.CCDS-O.v2026-06-02.q56

Associate-Developer-Apache-Spark Exam Question 56

Associate-Developer-Apache-Spark Exam Question 57

Associate-Developer-Apache-Spark Exam Question 58

Associate-Developer-Apache-Spark Exam Question 59

Associate-Developer-Apache-Spark Exam Question 60

Download PDF File