Online Access Free Google.Professional-Machine-Learning-Engineer.v2025-07-22.q137 Practice Test (Page 13)

Professional-Machine-Learning-Engineer Exam Question 56

You work for a bank and are building a random forest model for fraud detection. You have a dataset that includes transactions, of which 1% are identified as fraudulent. Which data transformation strategy would likely improve the performance of your classifier?

A.Write your data in TFRecords.

B.Z-normalize all the numeric features.

C.Oversample the fraudulent transaction 10 times.

D.Use one-hot encoding on all categorical features.

Professional-Machine-Learning-Engineer Exam Question 57

While performing exploratory data analysis on a dataset, you find that an important categorical feature has 5% null values. You want to minimize the bias that could result from the missing values. How should you handle the missing values?

A.Remove the rows with missing values, and upsample your dataset by 5%.

B.Replace the missing values with the feature's mean.

C.Replace the missing values with a placeholder category indicating a missing value.

D.Move the rows with missing values to your validation dataset.

Correct Answer: C

The best option for handling missing values in a categorical feature is to replace them with a placeholder category indicating a missing value. This is a type of imputation, which is a method of estimating the missing values based on the observed data. Imputing the missing values with a placeholder category preserves the information that the data is missing, and avoids introducing bias or distortion in the feature distribution. It also allows the machine learning model to learn from the missingness pattern, and potentially use it as a predictor for the target variable. The other options are not suitable for handling missing values in a categorical feature, because:
* Removing the rows with missing values and upsampling the dataset by 5% would reduce the size of the dataset and potentially lose important information. It would also introduce sampling bias and overfitting, as the upsampling process would create duplicate or synthetic observations that do not reflect the true population.
* Replacing the missing values with the feature's mean would not make sense for a categorical feature, as the mean is a numerical measure that does not capture the mode or frequency of the categories. It would also create a new category that does not exist in the original data, and might confuse the machine learning model.
* Moving the rows with missing values to the validation dataset would compromise the validity and reliability of the model evaluation, as the validation dataset would not be representative of the test or production data. It would also reduce the amount of data available for training the model, and might introduce leakage or inconsistency between the training and validation datasets. References:
* Imputation of missing values
* Effective Strategies to Handle Missing Values in Data Analysis
* How to Handle Missing Values of Categorical Variables?
* Google Cloud launches machine learning engineer certification
* Google Professional Machine Learning Engineer Certification
* Professional ML Engineer Exam Guide
* Preparing for Google Cloud Certification: Machine Learning Engineer Professional Certificate

Professional-Machine-Learning-Engineer Exam Question 58

You are creating a deep neural network classification model using a dataset with categorical input values.
Certain columns have a cardinality greater than 10,000 unique values. How should you encode these categorical values as input into the model?

A.Convert each categorical value into an integer value.

B.Convert the categorical string data to one-hot hash buckets.

C.Map the categorical variables into a vector of boolean values.

D.Convert each categorical value into a run-length encoded string.

Professional-Machine-Learning-Engineer Exam Question 59

You need to use TensorFlow to train an image classification model. Your dataset is located in a Cloud Storage directory and contains millions of labeled images Before training the model, you need to prepare the data.
You want the data preprocessing and model training workflow to be as efficient scalable, and low maintenance as possible. What should you do?

A.1 Create a Dataflow job that creates sharded TFRecord files in a Cloud Storage directory.
2 Reference tf .data.TFRecordDataset in the training script.
3. Train the model by using Vertex Al Training with a V100 GPU.

B.1 Create a Dataflow job that moves the images into multiple Cloud Storage directories, where each directory is named according to the corresponding label.
2 Reference tfds.fclder_da-asst.imageFclder in the training script.
3. Train the model by using Vertex AI Training with a V100 GPU.

C.1 Create a Jupyter notebook that uses an n1-standard-64, V100 GPU Vertex Al Workbench instance.
2 Write a Python script that creates sharded TFRecord files in a directory inside the instance
3. Reference tf. da-a.TFRecrrdDataset in the training script.
4. Train the model by using the Workbench instance.

D.1 Create a Jupyter notebook that uses an n1-standard-64, V100 GPU Vertex Al Workbench instance.
2 Write a Python scnpt that copies the images into multiple Cloud Storage directories, where each directory is named according to the corresponding label.
3 Reference tf ds. f older_dataset. imageFolder in the training script.
4. Train the model by using the Workbench instance.

Professional-Machine-Learning-Engineer Exam Question 60

You need to design a customized deep neural network in Keras that will predict customer purchases based on their purchase history. You want to explore model performance using multiple model architectures, store training data, and be able to compare the evaluation metrics in the same dashboard. What should you do?

A.Create multiple models using AutoML Tables

B.Automate multiple training runs using Cloud Composer

C.Run multiple training jobs on Al Platform with similar job names

D.Create an experiment in Kubeflow Pipelines to organize multiple runs

Premium Bundle

Newest Professional-Machine-Learning-Engineer Exam PDF Dumps shared by Actual4test.com for Helping Passing Professional-Machine-Learning-Engineer Exam! Actual4test.com now offer the updated Professional-Machine-Learning-Engineer exam dumps, the Actual4test.com Professional-Machine-Learning-Engineer exam questions have been updated and answers have been corrected get the latest Actual4test.com Professional-Machine-Learning-Engineer pdf dumps with Exam Engine here:

Access Professional-Machine-Learning-Engineer Premium Version

(300 Q&As Dumps, 30%OFF Special Discount: Freepdfdumps)

Other Version: 781Google.Professional-Machine-Learning-Engineer.v2025-08-30.q138; 1705Google.Professional-Machine-Learning-Engineer.v2025-07-21.q129; 1289Google.Professional-Machine-Learning-Engineer.v2023-04-15.q71; 1006Google.Professional-Machine-Learning-Engineer.v2023-01-24.q55; 2607Google.Professional-Machine-Learning-Engineer.v2022-08-16.q71; 120Google.Braindumpquiz.Professional-Machine-Learning-Engineer.v2022-06-03.by.dorothy.67q.pdf; 1636Google.Professional-Machine-Learning-Engineer.v2022-02-28.q67; 1841Google.Professional-Machine-Learning-Engineer.v2021-12-10.q53; 55Google.Exam4labs.Professional-Machine-Learning-Engineer.v2021-09-02.by.giles.53q.pdf

Latest Upload: 141Oracle.1D0-1057-25-D.v2026-06-03.q29; 273NAHQ.CPHQ.v2026-06-03.q396; 254CompTIA.220-1201.v2026-06-03.q196; 156GIAC.GCFE.v2026-06-03.q78; 153HIMSS.CPHIMS.v2026-06-03.q45; 233Google.Professional-Cloud-Architect.v2026-06-03.q165; 156HP.HPE7-A09.v2026-06-02.q48; 167ACDIS.CCDS-O.v2026-06-02.q56; 149Microsoft.AB-730.v2026-06-02.q31; 212ASQ.CSSBB.v2026-06-02.q130