Weeks 5 & 6 · Statistics for Accounting

Probability
Distributions

Discrete and continuous models for real business decisions

Running scenario: Vantage Retail Co.
Australian e-commerce and retail analytics firm — modelling returns, defects, orders, delivery times and customer spend.

2

Week 5 — Learning Objectives

Discrete Random Variables

Find expected value, variance and standard deviation for a discrete random variable
Calculate and interpret covariance between two random variables

Binomial Distribution

Identify when the binomial distribution is appropriate
Calculate and interpret binomial probabilities

Poisson Distribution

Identify when the Poisson distribution is appropriate
Calculate and interpret Poisson probabilities
Scale the Poisson rate across different time intervals

Vantage Retail Co. — Week 5 Contexts

Product returns per order — discrete RV
Defective items in a shipment — binomial
Online orders per hour — Poisson

3

Section 1 — Week 5

Discrete Random Variables

Assigning numbers to uncertain outcomes — and measuring what we expect and how much things vary.

1.1

1.1 What Is a Random Variable?

Definition

A random variable $X$ is a numerical outcome of a random experiment. It assigns a single real number to each outcome in the sample space.

Discrete Random Variable

Takes a countable set of distinct values — typically whole numbers.

Number of items a customer returns
Number of defective units in a batch
Number of support calls received per day

Continuous Random Variable

Takes any real value in an interval — measurements on a continuous scale.

Customer transaction value (\$)
Delivery time (hours)
Product weight (grams)

Scope of Weeks 5 & 6

Week 5 covers discrete random variables — general, binomial, and Poisson. Week 6 shifts to continuous distributions — normal, uniform, and exponential.

1.2

1.2 Discrete Probability Distributions

A probability distribution specifies every value $X$ can take and its associated probability.

Vantage Retail Co. — Returns per Online Order

Analysis of 10,000 orders yields the following distribution for $X$ = number of items returned:

$x$	0	1	2	3
$P(X=x)$	0.55	0.25	0.15	0.05

Rule 1 — Valid Probabilities

$0 \le P(X = x) \le 1$ for every value of $x$.

Rule 2 — Probabilities Sum to 1

$\displaystyle\sum_{\text{all } x} P(X = x) = 1$

Verification: $0.55 + 0.25 + 0.15 + 0.05 = 1.00$ ✓

1.3

1.3 Expected Value $E(X)$

Formula

The expected value (mean) is the probability-weighted average of all possible outcomes: $$E(X) = \mu = \sum_{x} x \cdot P(X = x)$$

Vantage Retail Co. — Expected Returns per Order

$x$	$P(X=x)$	$x \cdot P(X=x)$
0	0.55	0.00
1	0.25	0.25
2	0.15	0.30
3	0.05	0.15
Total	1.00	0.70

Operations Manager — Interpretation

$E(X) = 0.70$ items returned per order. Over 10,000 orders, Vantage should plan for approximately 7,000 return items — directly informing logistics staffing and returns-processing budgets.

1.4

1.4 Variance and Standard Deviation

Variance

$$\sigma^2 = \text{Var}(X) = \sum_{x} (x - \mu)^2 \cdot P(X = x)$$

Standard Deviation

$$\sigma = \sqrt{\text{Var}(X)}$$

Vantage Retail Co. — Variance of Returns ($\mu = 0.70$)

$x$	$(x - 0.70)^2$	$P(X=x)$	$(x-\mu)^2 \cdot P(x)$
0	0.4900	0.55	0.2695
1	0.0900	0.25	0.0225
2	1.6900	0.15	0.2535
3	5.2900	0.05	0.2645
Variance $\sigma^2$			0.8100

$\sigma = \sqrt{0.81} = 0.90$ items. Returns typically vary by 0.90 items around the mean of 0.70.

1.5

1.5 Covariance

Formula

The covariance measures the direction and magnitude of the linear relationship between two random variables $X$ and $Y$: $$\text{Cov}(X, Y) = \sigma_{XY} = \sum_{x,y}(x - \mu_X)(y - \mu_Y) \cdot P(X=x, Y=y)$$

Interpreting the Sign

Value	Relationship
$\sigma_{XY} > 0$	X and Y tend to increase together
$\sigma_{XY} < 0$	One increases as the other decreases
$\sigma_{XY} = 0$	No linear association

Limitation

Covariance is unit-dependent. To compare relationships across variables, standardise using the correlation coefficient: $\rho = \sigma_{XY} / (\sigma_X \cdot \sigma_Y)$

Vantage Retail Co.

Let $X$ = number of returns, $Y$ = refund cost per order (\$).

A positive covariance here is expected — orders with more returns generate higher refund costs.

This quantifies the relationship needed to forecast the total refund liability as a function of return volume.

1.Q

Knowledge Check — Section 1

Q1: A discrete distribution gives $P(X=0)=0.30$, $P(X=1)=0.50$, $P(X=2)=0.20$. What is $E(X)$?

$E(X) = 0(0.30) + 1(0.50) + 2(0.20) = 0 + 0.50 + 0.40 = 0.90$

Q2: Vantage finds that as the number of returns per order increases, average customer satisfaction decreases. The covariance $\sigma_{XY}$ between returns and satisfaction is:

Covariance is negative when one variable tends to increase as the other decreases. Higher returns → lower satisfaction means $\sigma_{XY} < 0$.

10

Section 2 — Week 5

The Binomial Distribution

Counting successes across a fixed number of independent trials — the most widely used discrete distribution in auditing and quality control.

2.1

2.1 When Is the Binomial Distribution Appropriate?

All four conditions must hold simultaneously:

1. Fixed trials

The number of trials $n$ is determined in advance and does not change.

2. Binary outcomes

Each trial produces exactly one of two outcomes: success or failure.

3. Constant probability

The probability of success $p$ is the same on every trial.

4. Independence

The outcome of one trial does not affect any other trial.

Vantage Retail Co. — Checking the Conditions

Scenario A: A shipment of 20 electronics units arrives. Each unit is independently defective with probability 5%. Count the defectives. ✓ All four conditions hold — binomial applies.

Scenario B: Count the total revenue from 20 orders. Revenue is continuous and not binary. ✗ Binomial does not apply.

2.2

2.2 The Binomial Probability Formula

Formula

$$P(X = x) = \binom{n}{x} p^x (1-p)^{n-x}, \qquad x = 0, 1, 2, \ldots, n$$

where $\displaystyle\binom{n}{x} = \frac{n!}{x!\,(n-x)!}$ counts the number of distinct ways to arrange $x$ successes among $n$ trials.

Vantage Retail Co. — Defective Items in a Shipment

Shipment: $n = 20$ units, defect probability $p = 0.05$. What is the probability that exactly 2 units are defective?

$$P(X = 2) = \binom{20}{2}(0.05)^2(0.95)^{18} = 190 \times 0.0025 \times 0.3972 \approx \mathbf{0.1887}$$

There is an 18.9% probability that exactly 2 units in this shipment are defective.

Reading the Formula

$\binom{n}{x}$ counts the arrangements; $p^x$ is the probability of $x$ successes; $(1-p)^{n-x}$ is the probability of the remaining $n-x$ failures. Multiply all three together.

2.3

2.3 Mean and Variance of the Binomial

Mean (Expected Value)

$$\mu = E(X) = np$$

Variance and Standard Deviation

$$\sigma^2 = np(1-p) \qquad \sigma = \sqrt{np(1-p)}$$

Vantage Retail Co. — Defective Items ($n=20,\; p=0.05$)

$\mu = 20 \times 0.05 = \mathbf{1.0}$ defective unit per shipment on average

$\sigma^2 = 20 \times 0.05 \times 0.95 = 0.95$

$\sigma = \sqrt{0.95} \approx \mathbf{0.97}$ units

Quality Manager View

On average, 1 defective unit per shipment. Finding 0–3 defectives is well within one standard deviation — entirely normal variation. A shipment with 5+ defectives should prompt a supplier review.

Figure 2.3: Binomial(20, 0.05) — right-skewed with peak at $x=1$, matching $\mu=1.0$

2.4

2.4 Cumulative Binomial Probabilities

Many business decisions require probabilities over a range of outcomes, not just a single value.

Additive Rule for Discrete Distributions

$$P(X \le k) = \sum_{x=0}^{k} P(X = x) \qquad P(X > k) = 1 - P(X \le k)$$

Vantage Retail Co. — Acceptable Defect Threshold

Shipment: $n=20, p=0.05$. Vantage's quality policy states a shipment is acceptable if it has at most 2 defectives. What is the probability a shipment is accepted?

$$P(X \le 2) = P(X=0) + P(X=1) + P(X=2)$$ $$= 0.3585 + 0.3774 + 0.1887 = \mathbf{0.9246}$$

There is a 92.5% probability any given shipment passes the quality check.

Complement: $P(X > 2) = 1 - 0.9246 = \mathbf{0.0754}$ — only a 7.5% chance of a shipment being rejected.

Audit Application

Auditors use cumulative binomial probabilities to assess whether the number of errors found in a sample is consistent with a claimed low error rate — a key tool in audit risk assessment and sampling plans.

2.Q

Knowledge Check — Section 2

Q1: A survey finds 30% of Vantage customers use a loyalty card. In a random sample of 10 customers, what is the expected number who use a loyalty card?

$\mu = np = 10 \times 0.30 = 3$. The binomial mean is simply the product of the number of trials and the probability of success.

Q2: Which scenario does not satisfy the conditions for a binomial distribution?

Option B describes a geometric distribution — you count trials until the first success, so $n$ is not fixed in advance. This violates the binomial's fixed-$n$ condition.

16

Section 3 — Week 5

The Poisson Distribution

Modelling the number of events in a fixed interval — ideal for website traffic, call volumes, and transaction counts.

3.1

3.1 When Is the Poisson Distribution Appropriate?

1. Count of events

$X$ counts how many times an event occurs in a fixed interval of time, space, or volume.

2. Independence

Each event occurs independently — one event does not affect the probability of another.

3. Constant rate

Events occur at a constant average rate $\lambda$ throughout the interval.

4. Rare in small intervals

Two events cannot occur at exactly the same instant (probability of simultaneous events is negligible).

Vantage Retail Co. — Valid Poisson Scenarios

Number of online orders per hour during peak trading
Number of customer support calls per day
Number of payment processing errors per week
Number of product page visits per minute during a flash sale

Single Parameter

The Poisson distribution is completely specified by one parameter: $\lambda$ — the average number of events per interval.

3.2

3.2 The Poisson Probability Formula

Formula

$$P(X = x) = \frac{e^{-\lambda} \lambda^{x}}{x!}, \qquad x = 0, 1, 2, 3, \ldots$$

where $\lambda > 0$ is the average number of events per interval, and $e \approx 2.71828$.

Vantage Retail Co. — Peak Hour Orders ($\lambda = 12$ orders/hour)

What is the probability of receiving exactly 10 orders in a given peak hour?

$$P(X = 10) = \frac{e^{-12} \cdot 12^{10}}{10!} = \frac{6.144 \times 10^{-6} \times 61{,}917{,}364{,}224}{3{,}628{,}800} \approx \mathbf{0.1048}$$

There is approximately a 10.5% probability of receiving exactly 10 orders in any given peak hour.

3.3

3.3 Mean and Variance of the Poisson Distribution

Mean

$$E(X) = \mu = \lambda$$

Variance

$$\text{Var}(X) = \sigma^2 = \lambda$$

Remarkable Property

For the Poisson distribution, mean = variance = $\lambda$. This is unique among distributions. In practice, if empirical data show mean ≈ variance, the Poisson model is plausible. If variance is much larger than the mean, consider the negative binomial instead.

Vantage Retail Co. — Staffing the Fulfilment Centre ($\lambda = 12$)

$\sigma = \sqrt{12} \approx 3.46$ orders

Within $\pm 2\sigma$ of the mean:

$12 - 2(3.46) = 5.1$ to $12 + 2(3.46) = 18.9$

So roughly 5 to 19 orders per peak hour covers ~95% of cases.

Operations Manager Decision

Staff the fulfilment centre to handle up to 19–20 orders per hour during peak periods. Demand above 20 is rare but should be covered by an overflow escalation protocol.

3.4

3.4 Scaling the Poisson Rate Across Intervals

The Poisson rate $\lambda$ is always tied to a specific interval. When the question uses a different interval length, scale $\lambda$ proportionally first.

Scaling Rule

$$\lambda_{\text{new}} = \lambda_{\text{original}} \times \frac{\text{new interval length}}{\text{original interval length}}$$

Vantage Retail Co. — 30-Minute Planning Window

Orders average $\lambda = 12$ per hour. The warehouse manager needs to know: what is the probability of fewer than 5 orders arriving in a 30-minute window?

Step 1 — Scale: $\lambda_{30\text{min}} = 12 \times \tfrac{30}{60} = 6$

Step 2 — Calculate cumulative probability:

$$P(X < 5) = P(X \le 4) = \sum_{x=0}^{4} \frac{e^{-6} \cdot 6^x}{x!} \approx 0.2851$$

There is a 28.5% probability of fewer than 5 orders arriving in any 30-minute window — a lull that could be used for restocking or staff breaks.

3.Q

Knowledge Check — Section 3

Q1: Vantage's call centre receives an average of 4 customer service calls per hour. What are the mean and variance of calls per hour?

For the Poisson distribution, mean = variance = $\lambda$. With $\lambda = 4$, both the mean and variance equal 4. This equality is a defining characteristic of the Poisson.

Q2: If the call centre receives 4 calls per hour on average, what is the correct $\lambda$ for a 15-minute window?

$\lambda_{15\text{min}} = 4 \times (15/60) = 4 \times 0.25 = 1$. Always scale $\lambda$ proportionally to the new interval length before calculating any probabilities.

22

Week 5 — Summary

General Discrete RV

$E(X) = \sum x \cdot P(x)$

$\text{Var}(X) = \sum (x-\mu)^2 P(x)$

Covariance measures direction of linear association. Correlation standardises it.

Binomial$(n, p)$

$P(X=x)=\binom{n}{x}p^x(1-p)^{n-x}$

$\mu = np$, $\sigma = \sqrt{np(1-p)}$

Fixed $n$, binary outcomes, constant $p$, independence.

Poisson$(\lambda)$

$P(X=x) = \dfrac{e^{-\lambda}\lambda^x}{x!}$

$\mu = \sigma^2 = \lambda$

Count in a fixed interval. Scale $\lambda$ when the interval changes.

Selecting the Right Model

If you observe…	Consider
A fixed number of trials with binary outcomes	Binomial
A count of events occurring in a continuous interval	Poisson
A general discrete distribution not fitting the above	E(X) and Var(X) directly

23

Week 6 · Statistics for Accounting

Continuous Probability
Distributions

Normal, Uniform, and Exponential — modelling measured outcomes in retail and operations

Continuing with Vantage Retail Co.
Customer spend distributions, delivery time windows, and inter-order gaps.

24

Week 6 — Learning Objectives

Normal Distribution

Calculate areas under the standard normal curve
Solve and interpret normal distribution problems
Check assumptions of normality

Uniform Distribution

Calculate probabilities using the uniform distribution
Solve and interpret uniform distribution problems

Exponential Distribution

Calculate probabilities using the exponential distribution
Solve and interpret exponential distribution problems

A Fundamental Shift: Discrete → Continuous

For continuous distributions, probabilities are areas under a curve — not sums. Crucially, $P(X = \text{exact value}) = 0$. We always compute $P(a \le X \le b)$ by finding the area between $a$ and $b$. This is a conceptual break from Week 5.

25

Section 4 — Week 6

The Normal Distribution

The bell curve — the foundation of statistical inference, quality control, and financial modelling.

4.1

4.1 Properties of the Normal Distribution

Notation and Density

$X \sim N(\mu,\, \sigma^2)$. The probability density function is: $$f(x) = \frac{1}{\sigma\sqrt{2\pi}}\, e^{-\frac{1}{2}\left(\frac{x-\mu}{\sigma}\right)^2}$$

Symmetric about $\mu$ (mean = median = mode)
Bell-shaped, unimodal
Total area under curve = 1
Tails approach but never touch the x-axis

Empirical Rule

68% within $\pm1\sigma$ · 95% within $\pm2\sigma$ · 99.7% within $\pm3\sigma$

Figure 4.1: Bell curve — symmetric, 68% within ±1σ

4.2

4.2 The Standard Normal Distribution and Z-Scores

Standardising — The Z-Score

Any $X \sim N(\mu, \sigma^2)$ can be converted to a standard normal $Z \sim N(0,1)$ using: $$Z = \frac{X - \mu}{\sigma}$$

$Z$ measures how many standard deviations $X$ lies above (positive) or below (negative) the mean.

Vantage Retail Co. — Customer Transaction Values

Transaction values follow $X \sim N(\mu = 85,\; \sigma = 20)$ dollars. Calculate the Z-score for a transaction of \$115:

$$Z = \frac{115 - 85}{20} = \frac{30}{20} = 1.50$$

A \$115 transaction is 1.5 standard deviations above the mean.

Why This Matters

Once standardised to $Z$, we use a single standard normal (Z) table — which gives $P(Z \le z)$ — to find any probability for any normal distribution, regardless of $\mu$ and $\sigma$.

4.3

4.3 Finding Areas Under the Normal Curve

The Z-table gives $P(Z \le z)$ — the cumulative area to the left of any $z$ value.

Left-tail $P(Z \le z)$

Read directly from the Z-table.

$P(Z \le 1.50) = 0.9332$

Right-tail $P(Z > z)$

Use the complement:

$P(Z > 1.50) = 1 - 0.9332$

$\phantom{P(Z > 1.50)} = 0.0668$

Between $P(a \le Z \le b)$

Subtract cumulative areas:

$P(-1 \le Z \le 1.5)$

$= 0.9332 - 0.1587 = 0.7745$

Vantage Retail Co. — Identifying High-Value Transactions

What proportion of transactions exceed \$115? ($X \sim N(85, 20)$, so $Z = 1.50$)

$$P(X > 115) = P(Z > 1.50) = 1 - 0.9332 = \mathbf{0.0668}$$

6.7% of transactions exceed \$115 — Vantage's high-value customer segment, worth targeting for premium loyalty offers.

4.4

4.4 A Step-by-Step Framework for Normal Probabilities

Five Steps

Identify the parameters $\mu$ and $\sigma$, and the target value(s).
Convert to Z-scores: $Z = (X - \mu)/\sigma$.
Sketch the standard normal curve and shade the required region.
Look up the Z-table; apply complement or subtraction as needed.
Interpret in context — what does this probability mean for the business decision?

Vantage Retail Co. — Transactions Between \$65 and \$105

Transaction values: $X \sim N(85, 20)$. What proportion of transactions fall between \$65 and \$105?

$$Z_1 = \frac{65 - 85}{20} = -1.00 \qquad Z_2 = \frac{105 - 85}{20} = 1.00$$ $$P(65 \le X \le 105) = P(-1 \le Z \le 1) = 0.8413 - 0.1587 = \mathbf{0.6827}$$

About 68.3% of transactions fall in the \$65–\$105 range — the empirical rule ($\pm1\sigma$) confirmed.

4.5

4.5 Working Backwards: Finding $X$ from a Probability

Sometimes the probability is known and we need the corresponding value of $X$. This is the inverse normal problem.

Inverse Formula

$$X = \mu + Z \cdot \sigma$$

Step 1: Find the required $Z$ from the Z-table. Step 2: Convert back to the original scale using the formula above.

Vantage Retail Co. — VIP Loyalty Programme Threshold

Vantage wants to identify the top 10% of transactions for a VIP loyalty tier. Transaction values: $X \sim N(85, 20)$. What is the minimum qualifying spend?

Step 1: Find $z$ such that $P(Z \le z) = 0.90$. From the Z-table: $z = 1.28$.

Step 2: Convert: $X = 85 + 1.28 \times 20 = 85 + 25.60 = \mathbf{\$110.60}$

Decision: Customers spending more than \$110.60 per transaction qualify for VIP status, capturing the top 10% of spenders by transaction value.

4.6

4.6 Checking Assumptions of Normality

Before applying the normal distribution, verify the data are approximately normal. Never assume without checking.

Visual Methods

Histogram

Should be roughly bell-shaped and symmetric. Severe skew, heavy tails, or bimodality indicate non-normality.

Normal Q-Q Plot

Plot sample quantiles against theoretical normal quantiles. Points should track closely along a diagonal straight line if the data are normal.

Formal Tests

Shapiro-Wilk Test

Preferred for small samples ($n < 50$). A p-value above 0.05 means we cannot reject normality.

Practical Reality

No real dataset is perfectly normal. The question is whether departures are large enough to invalidate the analysis. For large samples, the Central Limit Theorem often rescues us — sample means are approximately normal regardless of the population shape.

4.Q

Knowledge Check — Section 4

Q1: Vantage's delivery lead times follow $X \sim N(\mu=3.5 \text{ days},\; \sigma=0.5)$. What is the Z-score for a lead time of 4.5 days?

$Z = (4.5 - 3.5)/0.5 = 1.0/0.5 = 2.0$. A 4.5-day delivery is 2 standard deviations above the mean — only the slowest ~2.3% of deliveries are this late.

Q2: Using the same distribution, what is $P(X \le 3.0 \text{ days})$? (Hint: $Z = -1.0$ gives $P(Z \le -1.0) = 0.1587$.)

$Z = (3.0 - 3.5)/0.5 = -1.0$. $P(Z \le -1.0) = 0.1587 = 15.87\%$. About 16% of deliveries arrive in under 3 days — useful for marketing an "early delivery" promise.

33

Section 5 — Week 6

The Uniform Distribution

When every outcome across an interval is equally likely — the simplest continuous distribution, and a natural model for guaranteed delivery windows.

5.1

5.1 The Continuous Uniform Distribution

Definition and PDF

$X \sim U(a,\, b)$ assigns equal probability density to all values in $[a, b]$: $$f(x) = \frac{1}{b - a}, \qquad a \le x \le b$$ $$f(x) = 0 \qquad \text{otherwise}$$

Mean and Variance

$$\mu = \frac{a + b}{2} \qquad \sigma^2 = \frac{(b-a)^2}{12}$$

Figure 5.1: Uniform density — perfectly flat between $a$ and $b$

5.2

5.2 Calculating Probabilities — Uniform Distribution

Probability Formula

$$P(c \le X \le d) = \frac{d - c}{b - a} \qquad \text{for } a \le c \le d \le b$$

This is simply the area of the rectangle between $c$ and $d$ — width $(d-c)$ times height $\frac{1}{b-a}$.

Vantage Retail Co. — Guaranteed Delivery Window

Vantage promises delivery between 2 and 6 hours after dispatch. Actual delivery time: $X \sim U(2,\, 6)$.

Mean delivery time:

$$\mu = \frac{2 + 6}{2} = 4 \text{ hours}$$

Standard deviation:

$$\sigma = \sqrt{\frac{(6-2)^2}{12}} = \sqrt{\frac{16}{12}} \approx 1.15 \text{ hours}$$

P(delivery within 3 hours):

$$P(X \le 3) = \frac{3-2}{6-2} = \frac{1}{4} = 0.25$$

P(delivery between 3 and 5 hours):

$$P(3 \le X \le 5) = \frac{5-3}{4} = 0.50$$

5.3

5.3 Applying the Uniform Distribution to Business Decisions

Vantage Retail Co. — SLA Compliance Analysis

Vantage's Service Level Agreement (SLA) requires delivery within 5 hours of dispatch. Delivery time: $X \sim U(2,\, 6)$.

$$P(X \le 5) = \frac{5 - 2}{6 - 2} = \frac{3}{4} = \mathbf{0.75}$$

Vantage meets its SLA 75% of the time. One in four deliveries breaches the 5-hour promise.

Operations Director — Decision Brief

Problem: A 25% breach rate exposes Vantage to customer compensation claims and reputational risk.

Three options to evaluate:

Option A: Revise SLA to 6 hours — achieves 100% compliance but weakens the customer promise.
Option B: Renegotiate courier contracts to cap delivery at 5.5 hours ($P \le 5) = 0.875$).
Option C: Priority dispatch for high-value orders, reducing variance for that segment.

The uniform distribution provides an analytically tractable framework for comparing each option's compliance probability before committing to a contract change.

5.Q

Knowledge Check — Section 5

Q1: Time for a Vantage warehouse worker to pack an order is $X \sim U(3,\, 9)$ minutes. What is the probability packing takes more than 7 minutes?

$P(X > 7) = (9-7)/(9-3) = 2/6 \approx 0.333$. The uniform formula is the ratio of the target interval length to the total interval length.

Q2: For the same packing time distribution $U(3, 9)$, what is the mean packing time and standard deviation?

$\mu = (3+9)/2 = 6$ min. $\sigma = \sqrt{(9-3)^2/12} = \sqrt{36/12} = \sqrt{3} \approx 1.73$ min. The mean is the midpoint; the variance uses $(b-a)^2/12$.

38

Section 6 — Week 6

The Exponential Distribution

Modelling the time between events in a Poisson process — the natural continuous companion to the Poisson distribution from Week 5.

6.1

6.1 The Exponential Distribution

Definition and PDF

If events occur at a Poisson rate of $\lambda$ per unit time, the time between consecutive events follows: $$f(x) = \lambda\, e^{-\lambda x}, \qquad x \ge 0$$ Written $X \sim \text{Exp}(\lambda)$.

Mean and Variance

$$\mu = \frac{1}{\lambda} \qquad \sigma^2 = \frac{1}{\lambda^2}$$

Higher rate $\lambda$ → shorter mean waiting time.

Figure 6.1: Exponential density — strongly right-skewed, most waiting times are short

6.2

6.2 Calculating Exponential Probabilities

Cumulative Distribution Function (CDF)

$$P(X \le x) = 1 - e^{-\lambda x} \qquad P(X > x) = e^{-\lambda x}$$ $$P(a \le X \le b) = e^{-\lambda a} - e^{-\lambda b}$$

No integration required — these closed-form expressions make exponential probability calculations quick.

Vantage Retail Co. — Time Between Online Orders ($\lambda = 12$/hour)

Mean time between orders: $\mu = 1/12$ hour = 5 minutes.

P(next order within 3 minutes = 0.05 hr):

$$P(X \le 0.05) = 1 - e^{-12 \times 0.05}$$ $$= 1 - e^{-0.6} \approx 1 - 0.549 = \mathbf{0.451}$$

About a 45% chance the next order arrives within 3 minutes.

P(gap exceeds 10 minutes = 1/6 hr):

$$P(X > 1/6) = e^{-12 \times 1/6} = e^{-2} \approx \mathbf{0.135}$$

Only a 13.5% chance of a 10-minute lull — staff should stay ready.

6.3

6.3 The Memoryless Property

Formal Statement

For an exponential random variable: $$P(X > s + t \;\mid\; X > s) = P(X > t), \qquad \text{for all } s, t \ge 0$$

The probability of waiting at least another $t$ units is the same regardless of how long you have already been waiting. The past waiting time is irrelevant.

Vantage Retail Co.

It has been 8 minutes since the last order arrived. What is the probability of waiting at least 5 more minutes?

By the memoryless property, this equals $P(X > 5\text{ min})$ — exactly as if no time had elapsed. The 8 minutes already passed are irrelevant.

When Is This Property Plausible?

The memoryless property is appropriate when the event rate is truly constant and random — each moment is independent of all prior moments. It would not be appropriate for machine failures, where parts "wear out" over time (use the Weibull distribution instead).

6.4

6.4 The Poisson–Exponential Connection

Key Relationship

If events occur as a Poisson process with rate $\lambda$ per unit time, then the time between consecutive events follows Exponential($\lambda$). The same parameter $\lambda$ describes both.

Vantage Retail Co. — Two Sides of the Same Process

Question	Distribution	Formula Used
How many orders arrive in 30 minutes?	Poisson ($\lambda = 6$ per 30 min)	$P(X=x) = e^{-6}6^x/x!$
How long until the next order arrives?	Exponential ($\lambda = 12$ per hour)	$P(T \le t) = 1-e^{-12t}$

Practical Insight for Operations

A manager who knows the Poisson arrival rate can immediately derive the inter-arrival distribution — no additional parameters. This relationship is foundational in queuing theory, which underpins staffing decisions, warehouse throughput planning, and customer service capacity modelling.

6.Q

Knowledge Check — Section 6

Q1: Vantage's customer support line receives calls at an average rate of 6 per hour. What is the mean time between consecutive calls?

$\mu = 1/\lambda = 1/6$ hour $= 10$ minutes. The mean inter-arrival time is the reciprocal of the Poisson rate. Be careful with units: 1/6 hour is 10 minutes, not 1/6 minutes.

Q2: Which unique property distinguishes the exponential distribution from all other continuous distributions?

The memoryless property — $P(X > s+t \mid X > s) = P(X > t)$ — is unique to the exponential among continuous distributions. It means the process has no "memory" of how long it has already been running.

44

Week 6 — Summary

Normal $N(\mu, \sigma^2)$

Bell-shaped, symmetric. Standardise: $Z = (X-\mu)/\sigma$. Use Z-table for areas.

Inverse: $X = \mu + Z\sigma$

Check normality: histogram, Q-Q plot, Shapiro-Wilk.

Uniform $U(a, b)$

Equal density across $[a,b]$.

$$P(c \le X \le d) = \frac{d-c}{b-a}$$ $$\mu = \frac{a+b}{2}, \quad \sigma^2 = \frac{(b-a)^2}{12}$$

Exponential($\lambda$)

Time between Poisson events.

$$P(X \le x) = 1 - e^{-\lambda x}$$ $$\mu = 1/\lambda, \quad \sigma^2 = 1/\lambda^2$$

Memoryless: past waiting does not predict future waiting.

Selecting the Right Continuous Model

Situation	Distribution
Natural variation in a measurement; symmetric, bell-shaped	Normal
All outcomes equally likely within a bounded range	Uniform
Waiting time between events in a Poisson process	Exponential

45

Weeks 5 & 6 — Final Recap: Vantage Retail Co.

Week 5: Discrete Distributions

Distribution	$\mu$	$\sigma^2$
General discrete	$\sum xP(x)$	$\sum(x{-}\mu)^2P(x)$
Binomial$(n,p)$	$np$	$np(1{-}p)$
Poisson$(\lambda)$	$\lambda$	$\lambda$

Week 6: Continuous Distributions

Distribution	$\mu$	$\sigma^2$
Normal$(\mu,\sigma^2)$	$\mu$	$\sigma^2$
Uniform$(a,b)$	$(a{+}b)/2$	$(b{-}a)^2/12$
Exponential$(\lambda)$	$1/\lambda$	$1/\lambda^2$

Vantage Retail Co. — Six Distributions, Six Business Questions

Business Question	Distribution Applied
Expected returns per order and return-cost covariance	Discrete RV — E(X), Var(X), Cov
Probability of $x$ defective items in a 20-unit shipment	Binomial$(20, 0.05)$
Number of online orders per peak hour; staffing decisions	Poisson$(\lambda=12)$
Proportion of customers spending above \$110.60	Normal$(85, 20^2)$
SLA compliance for delivery inside a 2–6 hour window	Uniform$(2, 6)$
Time between consecutive order arrivals; lull planning	Exponential$(\lambda=12)$

Distribution	\(\mu\)	\(\sigma^2\)
General discrete	\(\sum xP(x)\)	\(\sum(x{-}\mu)^2P(x)\)
Binomial\((n,p)\)	\(np\)	\(np(1{-}p)\)
Poisson\((\lambda)\)	\(\lambda\)	\(\lambda\)

Distribution	\(\mu\)	\(\sigma^2\)
Normal\((\mu,\sigma^2)\)	\(\mu\)	\(\sigma^2\)
Uniform\((a,b)\)	\((a{+}b)/2\)	\((b{-}a)^2/12\)
Exponential\((\lambda)\)	\(1/\lambda\)	\(1/\lambda^2\)

Value	Relationship
\(\sigma_{XY} > 0\)	X and Y tend to increase together
\(\sigma_{XY} < 0\)	One increases as the other decreases
\(\sigma_{XY} = 0\)	No linear association

Question	Distribution	Formula Used
How many orders arrive in 30 minutes?	Poisson (\(\lambda = 6\) per 30 min)	\(P(X=x) = e^{-6}6^x/x!\)
How long until the next order arrives?	Exponential (\(\lambda = 12\) per hour)	\(P(T \le t) = 1-e^{-12t}\)

Business Question	Distribution Applied
Expected returns per order and return-cost covariance	Discrete RV — E(X), Var(X), Cov
Probability of \(x\) defective items in a 20-unit shipment	Binomial\((20, 0.05)\)
Number of online orders per peak hour; staffing decisions	Poisson\((\lambda=12)\)
Proportion of customers spending above \$110.60	Normal\((85, 20^2)\)
SLA compliance for delivery inside a 2–6 hour window	Uniform\((2, 6)\)
Time between consecutive order arrivals; lull planning	Exponential\((\lambda=12)\)

Table of Contents

ProbabilityDistributions

Week 5 — Learning Objectives

Discrete Random Variables

Binomial Distribution

Poisson Distribution

Discrete Random Variables

1.1 What Is a Random Variable?

Discrete Random Variable

Continuous Random Variable

1.2 Discrete Probability Distributions

1.3 Expected Value \(E(X)\)

1.4 Variance and Standard Deviation

1.5 Covariance

Interpreting the Sign

Knowledge Check — Section 1

The Binomial Distribution

2.1 When Is the Binomial Distribution Appropriate?

2.2 The Binomial Probability Formula

2.3 Mean and Variance of the Binomial

2.4 Cumulative Binomial Probabilities

Knowledge Check — Section 2

The Poisson Distribution

3.1 When Is the Poisson Distribution Appropriate?

3.2 The Poisson Probability Formula

3.3 Mean and Variance of the Poisson Distribution

3.4 Scaling the Poisson Rate Across Intervals

Knowledge Check — Section 3

Week 5 — Summary

Continuous ProbabilityDistributions

Week 6 — Learning Objectives

Normal Distribution

Uniform Distribution

Exponential Distribution

The Normal Distribution

4.1 Properties of the Normal Distribution

4.2 The Standard Normal Distribution and Z-Scores

4.3 Finding Areas Under the Normal Curve

4.4 A Step-by-Step Framework for Normal Probabilities

4.5 Working Backwards: Finding \(X\) from a Probability

4.6 Checking Assumptions of Normality

Visual Methods

Formal Tests

Knowledge Check — Section 4

The Uniform Distribution

5.1 The Continuous Uniform Distribution

5.2 Calculating Probabilities — Uniform Distribution

5.3 Applying the Uniform Distribution to Business Decisions

Knowledge Check — Section 5

The Exponential Distribution

6.1 The Exponential Distribution

6.2 Calculating Exponential Probabilities

6.3 The Memoryless Property

6.4 The Poisson–Exponential Connection

Knowledge Check — Section 6

Week 6 — Summary

Weeks 5 & 6 — Final Recap: Vantage Retail Co.

Week 5: Discrete Distributions

Week 6: Continuous Distributions

Probability
Distributions

Continuous Probability
Distributions