DP-100 Exam Question 101

Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.
After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.
You are analyzing a numerical dataset which contains missing values in several columns.
You must clean the missing values using an appropriate operation without affecting the dimensionality of the feature set.
You need to analyze a full dataset to include all values.
Solution: Calculate the column median value and use the median value as the replacement for any missing value in the column.
Does the solution meet the goal?
  • DP-100 Exam Question 102

    You are solving a classification task.
    The dataset is imbalanced.
    You need to select an Azure Machine Learning Studio module to improve the classification accuracy.
    Which module should you use?
  • DP-100 Exam Question 103

    You are conducting feature engineering to prepuce data for further analysis.
    The data includes seasonal patterns on inventory requirements.
    You need to select the appropriate method to conduct feature engineering on the data.
    Which method should you use?
  • DP-100 Exam Question 104

    You use the Azure Machine Learning SDK to run a training experiment that trains a classification model and calculates its accuracy metric.
    The model will be retrained each month as new data is available.
    You must register the model for use in a batch inference pipeline.
    You need to register the model and ensure that the models created by subsequent retraining experiments are registered only if their accuracy is higher than the currently registered model.
    What are two possible ways to achieve this goal? Each correct answer presents a complete solution.
    NOTE: Each correct selection is worth one point.
  • DP-100 Exam Question 105

    You use the Azure Machine Learning Python SDK to create a batch inference pipeline.
    You must publish the batch inference pipeline so that business groups in your organization can use the pipeline. Each business group must be able to specify a different location for the data that the pipeline submits to the model for scoring.
    You need to publish the pipeline.
    What should you do?