Databricks-Certified-Professional-Data-Engineer Exam Question 86

A junior developer complains that the code in their notebook isn't producing the correct results in the development environment. A shared screenshot reveals that while they're using a notebook versioned with Databricks Repos, they're using a personal branch that contains old logic. The desired branch nameddev-2.3.9is not available from the branch selection dropdown.
Which approach will allow this developer to review the current logic for this notebook?
  • Databricks-Certified-Professional-Data-Engineer Exam Question 87

    What is the main difference between the below two commands?
    1.INSERT OVERWRITE table_name
    2.SELECT * FROM table
    1.CREATE OR REPLACE TABLE table_name
    2.AS SELECT * FROM table
  • Databricks-Certified-Professional-Data-Engineer Exam Question 88

    A newly joined team member John Smith in the Marketing team currently has access read access to sales tables but does not have access to update the table, which of the following commands help you accomplish this?
  • Databricks-Certified-Professional-Data-Engineer Exam Question 89

    Question-3: In machine learning, feature hashing, also known as the hashing trick (by analogy to the kernel
    trick), is a fast and space-efficient way of vectorizing features (such as the words in a language), i.e., turning
    arbitrary features into indices in a vector or matrix. It works by applying a hash function to the features and
    using their hash values modulo the number of features as indices directly, rather than looking the indices up in
    an associative array. So what is the primary reason of the hashing trick for building classifiers?
  • Databricks-Certified-Professional-Data-Engineer Exam Question 90

    A data engineer has configured a Structured Streaming job to read from a table, manipulate the data, and then
    perform a streaming write into a new table. The code block used by the data engineer is below:
    1. (spark.table("sales")
    2. .withColumn("avg_price", col("sales") / col("units"))
    3. .writeStream
    4. .option("checkpointLocation", checkpointPath)
    5. .outputMode("complete")
    6. ._____
    7. .table("new_sales")
    8.)
    If the data engineer only wants the query to execute a single micro-batch to process all of the available data,
    which of the following lines of code should the data engineer use to fill in the blank?