Online Access Free 70-775 Exam Questions

Exam Code:70-775
Exam Name:Perform Data Engineering on Microsoft Azure HDInsight
Certification Provider:Microsoft
Free Question Number:63
Posted:May 31, 2026
Rating
100%

Question 1

Note: This question is part of a series of questions that use the same or similar answer choices. An answer choice may be correct for more than one question in the series. Each question is independent of the other questions in this series. Information and details provided in a question apply only to that question.
You need to deploy a domain-joined HDInsight cluster for query and analytics batch jobs. The cluster will be managed by using HDInsight applications. Apache Oozie will be used for scheduling jobs.
What should you do?

Question 2

Note: This question is part of a series of questions that use the same scenario. For your convenience, the scenario is repeated in each question. Each question presents a different goal and answer choices, but the text of the scenario is exactly the same in each question in this series.
You have an initial dataset that contains the crime data from major cities.
You plan to build training models from the training dat
a. You plan to automate the process of adding more data to the training models and to constantly tune the models by using the additional data, including data that is collected in near real-time. The system will be used to analyze event data gathered from many different sources, such as Internet of Things (IoT) devices, live video surveillance, and traffic activities, and to generate predictions of an increased crime risk at a particular time and place.
You have an incoming data stream from Twitter and an incoming data stream from Facebook, which are event-based only, rather than time-based. You also have a time interval stream every 10 seconds.
The data is in a key/value pair format. The value field represents a number that defines how many times a hashtag occurs within a Facebook post, or how many times a Tweet that contains a specific hashtag is retweeted.
You must use the appropriate data storage, stream analytics techniques, and Azure HDInsight cluster types for the various tasks associated to the processing pipeline.
You are designing the real-time portion of the input stream processing. The input will be a continuous stream of data and each record will be processed one at a time. The data will come from an Apache Kafka producer.
You need to identify which HDInsight cluster to use for the final processing of the input data. This will be used to generate continuous statistics and real-time analytics. The latency to process each record must be less than one millisecond and tasks must be performed in parallel.
Which type of cluster should you identify?

Question 3

Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
After you answer a question in this sections, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You have an Apache Pig table named Sales in Apache HCatalog.
You need to make the data in the table accessible from Apache Pig.
Solution: You use the following script.

Does this meet the goal?

Question 4

DRAG DROP
You have a customer who recently created a new Azure subscription.
You need to create an Azure HDInsight cluster for the customer.
Which four components should you create in sequence? To answer, move the appropriate components from the list of components to the answer area and arrange them in the correct order.

Question 5

Note: This question is part of a series of questions that use the same or similar answer choices. An answer choice may be correct for more than one question in the series. Each question is independent of the other questions in this series. Information and details provided in a question apply only to that question.
You need to deploy an HDInsight cluster. The cluster will host a low-latency NoSQL database that uses the key/value pair format in a columnar model. The cluster has the following requirements:
Supports SSH access
Supports autosharding
Uses Azure Data Lake Store as the default storage
What should you do?

Add Comments

Your email address will not be published. Required fields are marked *

insert code
Type the characters from the picture.