DP-203 Exam Question 76

You are designing a date dimension table in an Azure Synapse Analytics dedicated SQL pool. The date dimension table will be used by all the fact tables.
Which distribution type should you recommend to minimize data movement?
  • DP-203 Exam Question 77

    You have an Azure Synapse Analytics dedicated SQL pool named pool1.
    You plan to implement a star schema in pool1 and create a new table named DimCustomer by using the following code.

    You need to ensure that DimCustomer has the necessary columns to support a Type 2 slowly changing dimension (SCD). Which two columns should you add? Each correct answer presents part of the solution. NOTE: Each correct selection is worth one point.
  • DP-203 Exam Question 78

    Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
    After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
    You plan to create an Azure Databricks workspace that has a tiered structure. The workspace will contain the following three workloads:
    A workload for data engineers who will use Python and SQL.
    A workload for jobs that will run notebooks that use Python, Scala, and SOL.
    A workload that data scientists will use to perform ad hoc analysis in Scala and R.
    The enterprise architecture team at your company identifies the following standards for Databricks environments:
    The data engineers must share a cluster.
    The job cluster will be managed by using a request process whereby data scientists and data engineers provide packaged notebooks for deployment to the cluster.
    All the data scientists must be assigned their own cluster that terminates automatically after 120 minutes of inactivity. Currently, there are three data scientists.
    You need to create the Databricks clusters for the workloads.
    Solution: You create a Standard cluster for each data scientist, a Standard cluster for the data engineers, and a High Concurrency cluster for the jobs.
    Does this meet the goal?
  • DP-203 Exam Question 79

    You have data stored in thousands of CSV files in Azure Data Lake Storage Gen2. Each file has a header row followed by a properly formatted carriage return (/r) and line feed (/n).
    You are implementing a pattern that batch loads the files daily into an enterprise data warehouse in Azure Synapse Analytics by using PolyBase.
    You need to skip the header row when you import the files into the data warehouse. Before building the loading pattern, you need to prepare the required database objects in Azure Synapse Analytics.
    Which three actions should you perform in sequence? To answer, move the appropriate actions from the list of actions to the answer area and arrange them in the correct order.
    NOTE: Each correct selection is worth one point

    DP-203 Exam Question 80

    You have an on-premises data warehouse that includes the following fact tables. Both tables have the following columns: DateKey, ProductKey, RegionKey. There are 120 unique product keys and 65 unique region keys.

    Queries that use the data warehouse take a long time to complete.
    You plan to migrate the solution to use Azure Synapse Analytics. You need to ensure that the Azure-based solution optimizes query performance and minimizes processing skew.
    What should you recommend? To answer, select the appropriate options in the answer area.
    NOTE: Each correct selection is worth one point