DATA4400: Data-Driven Forecasting

Week 11: Adaptive Forecasting

Matching Methods to Business Problems

Learning Outcomes

By the end of this lesson, you will be able to:

Decode stakeholder requests to identify the real forecasting problem
Match forecasting methods to data patterns and business contexts
Decide how much historical data is actually needed
Handle disruptions in time series data appropriately

            Key Focus: Understanding what Business Analysts actually do with forecasting methods, not just the technical details of the methods themselves.
        

2

DATA4400 Roadmap

Week	Topic
1-4	Foundations: Data preparation, moving averages, stationarity, basic methods
5	Assessment 1: Group presentation
6-8	Advanced Methods: Prophet, ARIMA/SARIMA, VAR models
9	Regression models + Assessment 2
10	End-to-End Solutions: Building complete forecasting systems
11	Adaptive Forecasting: Making intelligent method choices
12	Assessment 3: Individual forecasting project

Today's Focus: You've learned the methods. Now learn how to choose and apply them like a professional Business Analyst.

3

PART 1: UNDERSTAND THE BUSINESS PROBLEM

What Does a Business Analyst Actually Do?

Before we can choose a forecasting method, we need to understand what stakeholders are really asking for.

4

The Business Analyst's Role in Forecasting

What Stakeholders Say:

"Give me a forecast"
"How accurate can you be?"
"Use the best model"
"Tell me what will happen"

What BAs Must Discover:

Purpose: What decision depends on this?
Cost structure: What errors hurt most?
Constraints: Time, data, resources available?
Context: What's changing in the business?

            Critical Insight: Stakeholders rarely ask for what they actually need. Your job is to translate vague requests into specific forecasting problems with clear business outcomes.
        

5

Decoding Stakeholder Requests

What They Say	What They Might Really Mean	Your Follow-Up Questions
"Forecast customer churn"	• Identify at-risk customers • Budget retention costs • Predict revenue impact • Test intervention strategies	• What action will you take with the forecast? • What's the cost of losing a customer? • How far ahead do you need to know?
"Predict sales for next quarter"	• Set production targets • Plan inventory • Allocate marketing budget • Report to investors	• Is overproduction or stockout worse? • Can we adjust after seeing trends? • Are there upcoming campaigns/changes?
"How accurate can you be?"	• Is this forecast reliable enough? • Should we automate the decision? • Do we need more data?	• What decision changes at different accuracy levels? • What's your current decision process? • What's the cost of being wrong?

→ The same words can mean completely different things. Always clarify before building models.

6

Case Study: "Fix the Customer Churn Problem"

Scenario: A streaming service stakeholder says: "We're losing customers. Can you investigate churn so we can fix it?"

Interpretation A:

Identify who will churn next month

→ Need: Classification model on customer attributes
→ Data: Customer behavior, demographics, engagement
→ Output: List of at-risk customers for retention calls

Interpretation B:

Forecast total churn rate over time

→ Need: Time series forecast
→ Data: Historical monthly churn rates
→ Output: Expected churn % for revenue planning

Interpretation C:

Find what causes churn

→ Need: Causal analysis (regression/survival)
→ Data: Customer journey events, service quality
→ Output: Actionable drivers to fix

Interpretation D:

Test if marketing reduces churn

→ Need: A/B test or intervention analysis
→ Data: Control vs. treatment groups
→ Output: ROI of retention campaigns

→ Same request. Four completely different projects. Your job: Ask questions before building anything.

7

Quiz 1: Understanding Business Questions

Question 1: A retail manager says "I need to know if we'll have enough staff for Black Friday." What's the FIRST question you should ask?

A) What forecasting methods have you used before?

B) What's worse: being understaffed or overstaffed that day?

C) How many years of Black Friday data do you have?

D) What accuracy level do you need?

8

Quiz 1: Understanding Business Questions

Question 2: A stakeholder asks "Can you predict which products will be popular next season?" This is ambiguous because it could mean:

A) Forecast sales volume for existing products

B) Identify trending product categories for new designs

C) Determine optimal pricing for maximum revenue

D) All of the above - you need to ask clarifying questions

9

PART 2: MATCH METHODS TO PROBLEMS

The Decision Framework

Now that you understand the business problem, how do you choose the right forecasting method?

10

The BA's Method Selection Framework

When choosing a forecasting method, consider these factors in this order:

1. BUSINESS CONTEXT

What decision will this forecast drive? What costs are associated with errors?

2. DATA PATTERN

Does your data show trend? Seasonality? Are there external drivers? Multiple variables?

3. EXPLAINABILITY NEEDS

Does the stakeholder need to understand WHY, or just WHAT will happen?

4. PRACTICAL CONSTRAINTS

How much data do you have? How often will you update? Do you need automation?

→ Never start with "Which method is most accurate?" Start with "Which method fits the business need?"

11

Matching Methods to Data Patterns

Data Pattern	Suitable Methods	Business Example	When to Use
Level only (no trend/season)	Moving Average, SES	ATM withdrawals, daily sales in stable business	Quick baseline, short-term tactical decisions
Trend, no seasonality	Holt, AR(p), ARIMA	Subscription growth, market expansion	Business is scaling but cycles aren't established
Trend + seasonality	Holt-Winters, SARIMA, Prophet	Retail sales, airline passengers, energy demand	Established business with clear cycles
Multiple interacting variables	VAR, Multiple Regression	GDP/inflation/interest rates, price-volume dynamics	System-level forecasts, scenario planning
External drivers + events	Prophet, Regression with dummies	Promotions, holidays, weather effects	Business has controllable/known external factors

→ This table is your starting point, not a strict rule. Business context may override pattern matching.

12

Method Profiles: When Simple Methods Win

Moving Average & Single Exponential Smoothing (SES)

Best for: Stable operations with random fluctuations, no clear trend/seasonality

Business advantage: Fast, transparent, easy to explain to non-technical stakeholders

Use when:

You need a quick answer for tactical planning (next week's inventory)
The business is stable and mature (not rapidly growing/changing)
Stakeholders need to "see" how the forecast was made

Real example: A convenience store forecasting daily milk purchases (level data, high noise, need quick updates)

BA Decision Rule: If your data looks like random wiggling around a stable average, and you need an answer today, start here. Don't overcomplicate.

13

Method Profiles: Capturing Growth

Holt (Double Exponential Smoothing) & ARIMA

Best for: Data with clear trend but no strong seasonality

Business advantage: Captures momentum in growth/decline businesses

Choose Holt when:

You want something simple and fast
The trend is relatively smooth
Business context: startup growth, market penetration, product lifecycle stages

Choose ARIMA when:

You need short-term accuracy over explainability
The data requires differencing to become stationary
Business context: financial forecasting, short-term demand shocks

BA Decision Rule: If the business is scaling but seasonal patterns aren't yet clear or stable, use trend methods. Think: "We're growing, but how fast will growth continue?"

14

Method Profiles: Handling Cycles

Holt-Winters, SARIMA, Prophet

Best for: Established businesses with predictable seasonal patterns

Choose Holt-Winters when:

Seasonality is stable and regular
You need to decide: additive (constant seasonal swings) or multiplicative (seasonal swings grow with trend)
Business context: Monthly retail, quarterly reporting cycles

Choose SARIMA when:

You have strong statistical expertise available
Seasonal patterns visible in ACF/PACF plots at seasonal lags
Business context: Complex seasonal patterns requiring fine-tuning

Choose Prophet when:

You have holidays, events, or known changepoints to incorporate
Missing data or outliers are present
Stakeholders need interpretable components (trend + seasonality breakdown)
Business context: Marketing campaigns, tourism, e-commerce with promotions

15

Method Profiles: Multiple Variables

VAR (Vector Autoregression) & Regression

Best for: When multiple time series influence each other, or external factors drive your target

Choose VAR when:

You have multiple series that interact (no clear "cause → effect" direction)
You want to test relationships with Granger causality
You need system-level forecasts and scenario simulations
Business context: Macroeconomic indicators, price-volume interactions, marketing spend → sales → web traffic

Choose Regression when:

You have clear predictors (price, marketing spend, temperature, day of week)
Explainability is critical ("each $1 in marketing adds X sales")
You want to quantify impact of specific drivers
Business context: Promotional effectiveness, pricing strategies, external factor analysis

BA Decision Rule: Use these when the question is "What drives our outcome?" not just "What will happen?"

16

Quiz 2: Choosing the Right Method

Question 1: An e-commerce company needs to forecast monthly sales. The data shows clear yearly seasonality (holiday spikes) and steady growth. They also run frequent promotional campaigns. Which method is MOST appropriate?

A) Simple Moving Average - it's fast and interpretable

B) Holt - it captures the growth trend

C) Prophet - it handles trend, seasonality, and event effects (promotions)

D) Simple Linear Regression - it's the most explainable

17

Quiz 2: Choosing the Right Method

Question 2: A financial analyst needs to forecast next quarter's GDP and wants to understand how inflation and interest rates interact with it. The stakeholder needs to run "what-if" scenarios. Which approach is best?

A) ARIMA on GDP alone - highest short-term accuracy

B) VAR model - captures interactions between GDP, inflation, and interest rates

C) Prophet - it's user-friendly and handles multiple patterns

D) Holt-Winters - it's the standard for economic data

18

Quiz 2: Choosing the Right Method

Question 3: A stakeholder says "I need to know if our marketing spend actually increases sales, and by how much per dollar spent." What should you build?

A) SARIMA - it's the most sophisticated time series method

B) Regression with lagged marketing spend as predictor - provides coefficient showing $ impact

C) VAR model - it handles multiple variables

D) Moving Average - it's simple and stakeholders will understand it

19

PART 3: CRITICAL DECISIONS FOR BAs

Data Volume & Disruptions

Two critical questions every BA faces: "How much data do I actually need?" and "What do I do about that disruption?"

20

How Much Data Do You Really Need?

Common assumption: "More data is always better." Reality: Sometimes more data makes forecasts worse.

When MORE is better:

Stable business with consistent patterns
Clear seasonal cycles (need 2-3+ full cycles)
No structural changes over time

Example: Electricity demand patterns haven't fundamentally changed in 10 years → use all data

When LESS is better:

Business model has changed
Market conditions shifted
New products/services launched
Post-disruption "new normal"

Example: Company went digital in 2020 → pre-2020 data may hurt accuracy

            BA Decision Rule: Ask "Is my business today fundamentally the same as it was X years ago?" If no, exclude old data or weight recent data more heavily.
        

21

Real Examples: When Old Data Hurts

Example 1: Apple Stock (AAPL)

Stock price from 1980 vs. recent 5 years:

1980-2024: Includes pre-iPhone era, near-bankruptcy periods, CEO changes → fundamentally different company
2020-2024: Current product mix, market position, competitive landscape
BA Decision: For short-term trading, recent data is more relevant. Old patterns don't apply.

Example 2: Retail Store Footfall

Pre-COVID (2015-2019) vs. Post-COVID (2021-2024):

Pre-COVID: Normal seasonal patterns, in-store shopping dominant
Post-COVID: Permanently higher online mix, different weekend patterns, reduced lunch-hour traffic
BA Decision: 2015-2019 data may mislead. Focus on post-2021 patterns for current planning.

→ Just because you have the data doesn't mean it's valid for forecasting.

22

Handling Disruptions in Time Series

Disruptions are sudden, significant changes that break historical patterns (COVID-19, regulatory changes, major events).

Option 1: Exclude It

When: Disruption was temporary and won't repeat

Method: Remove the disruption period from training data

Example: One-time supply chain shock in Q2 2023 → exclude that quarter

Option 2: Model It

When: Disruption changed behavior permanently or semi-permanently

Method: Add intervention variables (dummy/phase variables in Prophet/Regression)

Example: COVID-19 created "pre," "during," "post" phases with different patterns

Intervention Variables: How They Work

Create binary (0/1) or phase (1, 2, 3) indicators:

Binary: covid = 1 during pandemic, 0 otherwise → allows model to adjust level during disruption
Multi-phase: phase1 (pre-COVID), phase2 (lockdown), phase3 (recovery) → captures evolving patterns
In Prophet: Add as regressor; in Regression: Include as dummy variable

23

BA Decision Tree: Disruption Handling

Step 1: Is the disruption one-time or structural?

One-time (temporary spike/drop) → Consider excluding
Structural (permanent change) → Must model it

Step 2: Will similar disruptions happen again?

No → Exclude or isolate period
Yes (e.g., future pandemics, policy changes) → Model with intervention variables so future instances can be handled

Step 3: Does your method support intervention variables?

Prophet, Regression → Yes, add as regressors
ARIMA, Holt-Winters → Limited support; may need to pre-process or switch methods

            Critical: Ignoring disruptions typically produces terrible forecasts. Either remove the period or explicitly model it. Don't pretend it didn't happen.
        

24

Quiz 3: Data & Disruption Decisions

Question 1: You're forecasting sales for a company that switched from retail-only to omnichannel (online + stores) in 2021. You have data from 2015-2024. What should you do?

A) Use all data (2015-2024) - more data is always better

B) Use only 2021-2024 data - the business model fundamentally changed

C) Use 2015-2020 data only - it's more stable

D) Average forecasts from both periods

25

Quiz 3: Data & Disruption Decisions

Question 2: Your monthly sales data shows a huge drop in March 2020 (COVID lockdown) followed by gradual recovery to a "new normal" level that's different from pre-COVID. You're using Prophet. What's the best approach?

A) Delete March 2020 data point

B) Exclude all 2020 data and forecast from 2021 onwards

C) Add phase regressors: pre-COVID, lockdown, post-COVID to capture evolving patterns

D) Use Holt-Winters instead - it's simpler

26

Quiz 3: Data & Disruption Decisions

Question 3: A factory had a one-time equipment failure in July 2023 causing a production drop. This won't happen again (new equipment installed). You're forecasting future production. What should you do with July 2023?

A) Keep it - the model needs to see all historical variation

B) Exclude it - it's a non-repeating outlier that will distort the forecast

C) Add an intervention variable for equipment_failure = 1 in July 2023

D) Replace July 2023 value with the average of June and August

27

Summary: The BA's Forecasting Workflow

1. UNDERSTAND the business problem

→ Decode stakeholder requests. Ask: What decision? What costs? What constraints?

2. MATCH method to context

→ Consider: Business need → Data pattern → Explainability → Constraints
→ Use the decision framework, not just "most accurate"

3. DECIDE on data scope

→ Ask: Is old data still relevant? Has the business fundamentally changed?
→ More ≠ better if patterns have shifted

4. HANDLE disruptions explicitly

→ Never ignore them. Either exclude or model with intervention variables
→ Choose based on: one-time vs. structural, repeatable vs. unique

→ You're not just running models. You're translating business problems into technical solutions that drive real decisions.

28

Preparing for Assessment 3

Assessment 3 (40% of grade) requires you to demonstrate everything we've covered:

Choose appropriate methods based on data characteristics and business context
Justify your choices with business reasoning, not just statistical metrics
Calculate dollar-based ROI and business impact of your forecasts
Handle real-world complexities like disruptions, data quality issues, changing patterns
Communicate findings concisely (strict word limits!)

Key Success Factors:

Start with the business problem, not the method
Be concise - quality over quantity in your write-up
Show business value in dollars and specific timelines, not just statistical accuracy
Document your decision process: "I chose X because the business needs Y"

Next: Discussion time with your facilitator about A3 questions and concerns.

29

Assessment 3 Discussion

Questions? Concerns? Let's discuss.

30