Free Microsoft DP-100 Exam Actual Questions

The questions for DP-100 were last updated On May 2, 2024

Question No. 1

You need to resolve the local machine learning pipeline performance issue. What should you do?

Show Answer Hide Answer
Correct Answer: A

Question No. 2

You need to select an environment that will meet the business and data requirements.

Which environment should you use?

Show Answer Hide Answer
Correct Answer: D

Question No. 3

You need to implement a scaling strategy for the local penalty detection data.

Which normalization type should you use?

Show Answer Hide Answer
Correct Answer: C

Post batch normalization statistics (PBN) is the Microsoft Cognitive Toolkit (CNTK) version of how to evaluate the population mean and variance of Batch Normalization which could be used in inference Original Paper.

In CNTK, custom networks are defined using the BrainScriptNetworkBuilder and described in the CNTK network description language 'BrainScript.'

Scenario:

Local penalty detection models must be written by using BrainScript.


https://docs.microsoft.com/en-us/cognitive-toolkit/post-batch-normalization-statistics

Question No. 4

You need to implement a feature engineering strategy for the crowd sentiment local models.

What should you do?

Show Answer Hide Answer
Correct Answer: D

The linear discriminant analysis method works only on continuous variables, not categorical or ordinal variables.

Linear discriminant analysis is similar to analysis of variance (ANOVA) in that it works by comparing the means of the variables.

Scenario:

Data scientists must build notebooks in a local environment using automatic feature engineering and model building in machine learning pipelines.

Experiments for local crowd sentiment models must combine local penalty detection data.

All shared features for local models are continuous variables.

Incorrect Answers:

B: The Pearson correlation coefficient, sometimes called Pearson's R test, is a statistical value that measures the linear relationship between two variables. By examining the coefficient values, you can infer something about the strength of the relationship between the two variables, and whether they are positively correlated or negatively correlated.

C: Spearman's correlation coefficient is designed for use with non-parametric and non-normally distributed data. Spearman's coefficient is a nonparametric measure of statistical dependence between two variables, and is sometimes denoted by the Greek letter rho. The Spearman's coefficient expresses the degree to which two variables are monotonically related. It is also called Spearman rank correlation, because it can be used with ordinal variables.


https://docs.microsoft.com/en-us/azure/machine-learning/studio-module-reference/fisher-linear-discriminant-analysis

https://docs.microsoft.com/en-us/azure/machine-learning/studio-module-reference/compute-linear-correlation

Question No. 5

You need to implement a model development strategy to determine a user's tendency to respond to an ad.

Which technique should you use?

Show Answer Hide Answer
Correct Answer: A

Split Data partitions the rows of a dataset into two distinct sets.

The Relative Expression Split option in the Split Data module of Azure Machine Learning Studio is helpful when you need to divide a dataset into training and testing datasets using a numerical expression.

Relative Expression Split: Use this option whenever you want to apply a condition to a number column. The number could be a date/time field, a column containing age or dollar amounts, or even a percentage. For example, you might want to divide your data set depending on the cost of the items, group people by age ranges, or separate data by a calendar date.

Scenario:

Local market segmentation models will be applied before determining a user's propensity to respond to an advertisement.

The distribution of features across training and production data are not consistent


https://docs.microsoft.com/en-us/azure/machine-learning/studio-module-reference/split-data