Q: What is the difference between population and sample standard deviation?

Population standard deviation divides by N (total population size), while sample standard deviation divides by n - 1 (degrees of freedom). The sample version uses n - 1 (Bessel's correction) to provide an unbiased estimate of the population variance.

Q: How do you detect outliers using the IQR method?

A data point is considered an outlier if it falls below Q1 - 1.5 × IQR or above Q3 + 1.5 × IQR . This method is used in box plots to identify unusually extreme values in a dataset.

Q: What is a skewed distribution ?

A distribution is skewed when it is not symmetric. In a right-skewed (positive skew) distribution, the tail extends to the right and mean > median. In a left-skewed (negative skew) distribution, the tail extends to the left and mean < median.

Question 1

What is the mean of a dataset?

Accepted Answer

The mean is the arithmetic average, calculated by summing all values and dividing by the number of values. Formula: x̄ = Σxᵢ / n. It is sensitive to outliers and is the most commonly used measure of central tendency.

Question 2

What is the median and when is it preferred over the mean?

Accepted Answer

The median is the middle value when data is sorted in order. For an even number of observations, it is the average of the two middle values. It is preferred over the mean when data is skewed or contains outliers, as it is more robust.

Question 3

What is the mode of a dataset?

Accepted Answer

The mode is the value that appears most frequently in a dataset. A dataset can be unimodal (one mode), bimodal (two modes), or multimodal (more than two modes). If no value repeats, the dataset has no mode.

Question 4

What is standard deviation and what does it measure?

Accepted Answer

Standard deviation measures the average amount of dispersion or spread in a dataset relative to the mean. A low standard deviation indicates data points are close to the mean, while a high standard deviation indicates they are spread out. Formula: σ = √(Σ(xᵢ - x̄)² / n) for a population.

Question 5

What is the difference between population and sample standard deviation?

Accepted Answer

Population standard deviation divides by N (total population size), while sample standard deviation divides by n - 1 (degrees of freedom). The sample version uses n - 1 (Bessel's correction) to provide an unbiased estimate of the population variance.

Question 6

What is variance and how does it relate to standard deviation?

Accepted Answer

Variance is the average of the squared differences from the mean: σ² = Σ(xᵢ - x̄)² / n. Standard deviation is the square root of variance. Variance is expressed in squared units of the original data, making standard deviation more interpretable.

Question 7

What is the range of a dataset?

Accepted Answer

The range is the difference between the maximum and minimum values: Range = Max - Min. It is the simplest measure of dispersion but is highly sensitive to outliers and does not reflect how data is distributed between the extremes.

Question 8

What are quartiles and the interquartile range (IQR)?

Accepted Answer

Quartiles divide sorted data into four equal parts. Q1 (25th percentile), Q2 (median, 50th percentile), and Q3 (75th percentile). The IQR is Q3 - Q1 and measures the spread of the middle 50% of data. It is robust to outliers.

Question 9

How do you detect outliers using the IQR method?

Accepted Answer

A data point is considered an outlier if it falls below Q1 - 1.5 × IQR or above Q3 + 1.5 × IQR. This method is used in box plots to identify unusually extreme values in a dataset.

Question 10

What is a skewed distribution?

Accepted Answer

A distribution is skewed when it is not symmetric. In a right-skewed (positive skew) distribution, the tail extends to the right and mean > median. In a left-skewed (negative skew) distribution, the tail extends to the left and mean < median.

Question 11

What is the addition rule of probability?

Accepted Answer

For any two events A and B: P(A ∪ B) = P(A) + P(B) - P(A ∩ B). The subtraction of the intersection prevents double-counting. For mutually exclusive events, P(A ∩ B) = 0, so it simplifies to P(A ∪ B) = P(A) + P(B).

Question 12

What is the multiplication rule of probability?

Accepted Answer

For two events A and B: P(A ∩ B) = P(A) × P(B|A). If A and B are independent, this simplifies to P(A ∩ B) = P(A) × P(B). This rule calculates the probability that both events occur.

Statistics And Probability

🎯 What You'll Learn

Preview Questions

What is the <b>mean</b> of a dataset?

What is the <b>median</b> and when is it preferred over the mean?

What is the <b>mode</b> of a dataset?

What is <b>standard deviation</b> and what does it measure?

What is the difference between <b>population</b> and <b>sample</b> standard deviation?

What is <b>variance</b> and how does it relate to standard deviation?

What is the <b>range</b> of a dataset?

What are <b>quartiles</b> and the <b>interquartile range (IQR)</b>?

How do you detect <b>outliers</b> using the IQR method?

What is a <b>skewed distribution</b>?

What is the <b>addition rule</b> of probability?

What is the <b>multiplication rule</b> of probability?

🎮 Study Modes Available

Related Topics in Mathematics

📖 Learning Resources