DP-201 Exam Question 106

A company stores large datasets in Azure, including sales transactions and customer account information.
You must design a solution to analyze the data. You plan to create the following HDInsight clusters:
You need to ensure that the clusters support the query requirements.
Which cluster types should you recommend? To answer, select the appropriate configuration in the answer area.
NOTE: Each correct selection is worth one point.

DP-201 Exam Question 107

You design data engineering solutions for a company.
You must integrate on-premises SQL Server data into an Azure solution that performs Extract-Transform-Load (ETL) operations have the following requirements:
* Develop a pipeline that can integrate data and run notebooks.
* Develop notebooks to transform the data.
* Load the data into a massively parallel processing database for later analysis.
You need to recommend a solution.
What should you recommend? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.

DP-201 Exam Question 108

You are planning a design pattern based on the Lambda architecture as shown in the exhibit.

Which Azure services should you use f2 or the cold path? To answer, drag the appropriate services to the correct layers. Each service may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.
NOTE: Each correct selection is worth one point.

DP-201 Exam Question 109

You need to design storage for the solution.
Which storage services should you recommend? To answer, select the appropriate configuration in the answer area.
NOTE: Each correct selection is worth one point.

DP-201 Exam Question 110

You are planning a solution that combines log data from multiple systems. The log data will be downloaded from an API and stored in a data store.
You plan to keep a copy of the raw data as well as some transformed versions of the data. You expect that there will be at least 2 TB of log files. The data will be used by data scientists and applications.
You need to recommend a solution to store the data in Azure. The solution must minimize costs.
What storage solution should you recommend?