Online Access Free Databricks.Databricks-Certified-Professional-Data-Engineer.v2023-03-10.q102 Practice Test (Page 18)

Databricks-Certified-Professional-Data-Engineer Exam Question 81

You are currently asked to work on building a data pipeline, you have noticed that you are currently working with a data source that has a lot of data quality issues and you need to monitor data quality and enforce it as part of the data ingestion process, which of the following tools can be used to address this problem?

A.AUTO LOADER

B.DELTA LIVE TABLES

C.JOBS and TASKS

D.UNITY Catalog and Data Governance

E.STRUCTURED STREAMING with MULTI HOP

Databricks-Certified-Professional-Data-Engineer Exam Question 82

Below sample input data contains two columns, one cartId also known as session id, and the second column is called items, every time a customer makes a change to the cart this is stored as an array in the table, the Marketing team asked you to create a unique list of item's that were ever added to the cart by each customer, fill in blanks by choosing the appropriate array function so the query produces below expected result as shown below.
Schema: cartId INT, items Array<INT>
Sample Data

1.SELECT cartId, ___ (___(items)) as items
2.FROM carts GROUP BY cartId
Expected result:
cartId items
1 [1,100,200,300,250]

A.FLATTEN, COLLECT_UNION

B.ARRAY_UNION, FLATTEN

C.ARRAY_UNION, ARRAY_DISTINT

D.ARRAY_UNION, COLLECT_SET

E.ARRAY_DISTINCT, ARRAY_UNION

Databricks-Certified-Professional-Data-Engineer Exam Question 83

Which of the following techniques structured streaming uses to ensure recovery of failures during stream processing?

A.Checkpointing and Watermarking

B.Write ahead logging and watermarking

C.Checkpointing and write-ahead logging

D.Delta time travel

E.The stream will failover to available nodes in the cluster

F.Checkpointing and Idempotent sinks

Databricks-Certified-Professional-Data-Engineer Exam Question 84

How VACCUM and OPTIMIZE commands can be used to manage the DELTA lake?

A.VACCUM command can be used to compact small parquet files, and the OP-TIMZE command can be used to delete parquet files that are marked for dele-tion/unused.

B.VACCUM command can be used to delete empty/blank parquet files in a delta table. OPTIMIZE command can be used to update stale statistics on a delta table.

C.VACCUM command can be used to compress the parquet files to reduce the size of the table, OPTIMIZE command can be used to cache frequently delta tables for better performance.

D.VACCUM command can be used to delete empty/blank parquet files in a delta table, OPTIMIZE command can be used to cache frequently delta tables for better perfor-mance.

E.OPTIMIZE command can be used to compact small parquet files, and the VAC-CUM command can be used to delete parquet files that are marked for deletion/unused.
(Correct)

Databricks-Certified-Professional-Data-Engineer Exam Question 85

You have written a notebook to generate a summary data set for reporting, Notebook was scheduled using the job cluster, but you realized it takes 8 minutes to start the cluster, what feature can be used to start the cluster in a timely fashion so your job can run immediatley?

A.Setup an additional job to run ahead of the actual job so the cluster is running second job starts

B.Use the Databricks cluster pools feature to reduce the startup time

C.Use Databricks Premium edition instead of Databricks standard edition

D.Pin the cluster in the cluster UI page so it is always available to the jobs

E.Disable auto termination so the cluster is always running

Other Version: 1311Databricks.Databricks-Certified-Professional-Data-Engineer.v2025-06-26.q60; 1940Databricks.Databricks-Certified-Professional-Data-Engineer.v2024-11-21.q117; 663Databricks.Databricks-Certified-Professional-Data-Engineer.v2023-10-09.q24; 702Databricks.Databricks-Certified-Professional-Data-Engineer.v2023-02-09.q22; 1001Databricks.Databricks-Certified-Professional-Data-Engineer.v2022-09-06.q21

Latest Upload: 121Databricks.Databricks-Certified-Data-Analyst-Associate.v2025-12-22.q27; 126SAP.C_ARSOR_2404.v2025-12-22.q38; 121LinuxFoundation.CKS.v2025-12-22.q76; 122ISACA.CISA-CN.v2025-12-21.q601; 124Nutanix.NCP-CN.v2025-12-21.q49; 119Fortinet.FCSS_LED_AR-7.6.v2025-12-21.q42; 127Cloudera.CDP-3002.v2025-12-21.q157; 213Microsoft.AZ-400.v2025-12-21.q278; 273ISACA.CISM.v2025-12-21.q445; 125CompTIA.DA0-002.v2025-12-21.q55

Databricks-Certified-Professional-Data-Engineer Exam Question 81

Databricks-Certified-Professional-Data-Engineer Exam Question 82

Databricks-Certified-Professional-Data-Engineer Exam Question 83

Databricks-Certified-Professional-Data-Engineer Exam Question 84

Databricks-Certified-Professional-Data-Engineer Exam Question 85

Download PDF File