Databricks-Certified-Professional-Data-Engineer Exam Question 6

Which statement describes Delta Lake Auto Compaction?
  • Databricks-Certified-Professional-Data-Engineer Exam Question 7

    An upstream source writes Parquet data as hourly batches to directories named with the current date. A nightly batch job runs the following code to ingest all data from the previous day as indicated by thedatevariable:

    Assume that the fieldscustomer_idandorder_idserve as a composite key to uniquely identify each order.
    If the upstream system is known to occasionally produce duplicate entries for a single order hours apart, which statement is correct?
  • Databricks-Certified-Professional-Data-Engineer Exam Question 8

    Which configuration parameter directly affects the size of a spark-partition upon ingestion of data into Spark?
  • Databricks-Certified-Professional-Data-Engineer Exam Question 9

    What statement is true regarding the retention of job run history?
  • Databricks-Certified-Professional-Data-Engineer Exam Question 10

    The downstream consumers of a Delta Lake table have been complaining about data quality issues impacting performance in their applications. Specifically, they have complained that invalidlatitudeandlongitudevalues in theactivity_detailstable have been breaking their ability to use other geolocation processes.
    A junior engineer has written the following code to addCHECKconstraints to the Delta Lake table:

    A senior engineer has confirmed the above logic is correct and the valid ranges for latitude and longitude are provided, but the code fails when executed.
    Which statement explains the cause of this failure?