Free IIBA CBDA Exam Actual Questions

The questions for CBDA were last updated On May 2, 2024

Question No. 1

An analyst is performing regression analysis and reviewing the results. They would like to rescale the variables in the model to more clearly reflect the relationship between the regression coefficients. Which technique could be used to rescale the variables?

Show Answer Hide Answer
Correct Answer: C

Question No. 2

After completing their data analysis, an analyst is drawing out the results, explaining the methods and processes used, and identifying any limitations or weaknesses in the data or methods applied. While performing these steps, which recommended practice would the analyst apply?

Show Answer Hide Answer
Correct Answer: B

Question No. 3

An analyst is working through data on comparing performance scores in different schools across the state, for ranking purposes. Since there is a lot of data and some extreme outliers, the analyst is trying to determine which type of statistical average would best represent the results. Which of the following is a concern when relying too heavily on summary statistics during data analysis?

Show Answer Hide Answer
Correct Answer: A

Summary statistics are numerical measures that describe certain characteristics of a data set, such as the mean, median, mode, standard deviation, range, or quartiles. Summary statistics can help simplify and communicate complex data, but they can also obscure or distort important information, such as the distribution, shape, outliers, or trends of the data. Contextualization is the process of providing relevant background information, assumptions, limitations, or explanations for the data analysis and its results. Contextualization can help avoid misinterpretation, confusion, or bias when using summary statistics. Contextualization can also help connect the data analysis to the business problem, objectives, and stakeholders.


Question No. 4

An analyst is doing a clinical study on the value of analyte among a large population of healthy people. The analyst is going to use a Gaussian Distribution to share the results. Which of the following represents a Gaussian Distribution?

Show Answer Hide Answer
Correct Answer: B

The Gaussian distribution, also known as the normal distribution, is a probability distribution that is symmetric about the mean, showing that data near the mean are more frequent in occurrence than data far from the mean. In graph form, the Gaussian distribution will appear as a bell curve, which is the case with option A. It is characterized by its bell-shaped curve and is defined by the mean () and the standard deviation (). It is a common assumption for the distribution of independent, randomly generated variables.


Question No. 5

A data scientist is analyzing a dataset to determine if there is a strong relationship between two variables. A measure of covariance is done. Which of the following graphs indicate Zero Covariance between variables?

Show Answer Hide Answer
Correct Answer: C

Covariance measures the directional relationship between the returns on two assets. A positive covariance means that asset returns move together while a negative covariance means they move inversely. Zero covariance indicates that the returns on the two assets move independently of each other. In the context of a scatter plot, zero covariance is represented by a plot where the points do not show any upward or downward trend but are rather scattered randomly on the graph with no discernible pattern.

Graph 4 displays such a pattern where there is no apparent relationship between the variables on the x and y axes, indicating that there is zero covariance between them.