DP-203 Exam Question 11

Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You plan to create an Azure Databricks workspace that has a tiered structure. The workspace will contain the following three workloads:
* A workload for data engineers who will use Python and SQL.
* A workload for jobs that will run notebooks that use Python, Scala, and SOL.
* A workload that data scientists will use to perform ad hoc analysis in Scala and R.
The enterprise architecture team at your company identifies the following standards for Databricks environments:
* The data engineers must share a cluster.
* The job cluster will be managed by using a request process whereby data scientists and data engineers provide packaged notebooks for deployment to the cluster.
* All the data scientists must be assigned their own cluster that terminates automatically after 120 minutes of inactivity. Currently, there are three data scientists.
You need to create the Databricks clusters for the workloads.
Solution: You create a Standard cluster for each data scientist, a High Concurrency cluster for the data engineers, and a Standard cluster for the jobs.
Does this meet the goal?
  • DP-203 Exam Question 12

    You are planning the deployment of Azure Data Lake Storage Gen2.
    You have the following two reports that will access the data lake:
    Report1: Reads three columns from a file that contains 50 columns.
    Report2: Queries a single record based on a timestamp.
    You need to recommend in which format to store the data in the data lake to support the reports. The solution must minimize read times.
    What should you recommend for each report? To answer, select the appropriate options in the answer are a.
    NOTE: Each correct selection is worth one point.

    DP-203 Exam Question 13

    You are designing an Azure Stream Analytics job to process incoming events from sensors in retail environments.
    You need to process the events to produce a running average of shopper counts during the previous 15 minutes, calculated at five-minute intervals.
    Which type of window should you use?
  • DP-203 Exam Question 14

    You are designing an inventory updates table in an Azure Synapse Analytics dedicated SQL pool. The table will have a clustered columnstore index and will include the following columns:
    * EventDate: 1 million per day
    * EventTypelD: 10 million per event type
    * WarehouselD: 100 million per warehouse
    * ProductCategoryTypeiD: 25 million per product category type
    You identify the following usage patterns:
    Analyst will most commonly analyze transactions for a warehouse.
    Queries will summarize by product category type, date, and/or inventory event type.
    You need to recommend a partition strategy for the table to minimize query times.
    On which column should you recommend partitioning the table?
  • DP-203 Exam Question 15

    You have an Azure subscription that contains a logical Microsoft SQL server named Server1. Server1 hosts an Azure Synapse Analytics SQL dedicated pool named Pool1.
    You need to recommend a Transparent Data Encryption (TDE) solution for Server1. The solution must meet the following requirements:
    Track the usage of encryption keys.
    Maintain the access of client apps to Pool1 in the event of an Azure datacenter outage that affects the availability of the encryption keys.
    What should you include in the recommendation? To answer, select the appropriate options in the answer are a.
    NOTE: Each correct selection is worth one point.