Assessment 1: Skills-Building Project

Predictive and Prescriptive Analytics for Business Decision Making

DATA5000 - Artificial Intelligence Programming in Business Analytics

25 Marks | 25% of Final Grade | Due Week 5

Webinar Objectives

By the end of this session, you will understand:

Assessment Structure

What you need to submit
How marks are allocated
Report structure (6 sections)
Word limits and formatting

Success Strategies

What markers are looking for
Common mistakes to avoid
Tips for high marks
Using GenAI appropriately

Key Message: This assessment tests your ability to translate technical AI analysis into business recommendations that executives can act upon.

Assessment Overview

Component	Details
Type	Individual Business Report
Word Limit	1,200 words maximum (+/- 10% acceptable)
Marks	25 marks total (25% of final grade)
Due Date	Week 5, Tuesday 23:55 AEST
Submission	Turnitin via MyKBS
GenAI Policy	Level 2 - Optional use with documentation

Important: Only the business report (.docx) is marked. The notebook is for reference only.

The Business Scenario

CloudMetrics Inc. - SaaS Analytics Platform

You are a Business Analytics Consultant hired by CloudMetrics, a growing SaaS company experiencing customer retention challenges.

The Business Problem

Customer churn affecting growth
Need to predict at-risk customers
Want to understand churn drivers
Testing retention campaign effectiveness

Executive Questions

Which customers will churn?
What factors drive churn?
Does our retention campaign work?
What actions should we take?

Your Task: Analyse their customer data using AI techniques and provide actionable business recommendations.

What You Submit

Google Colab Notebook

Reference Only - Not Marked

Pre-built code provided
Run all cells to generate outputs
Saves 11 visualisations
Share link with your report

Time needed: 30-45 minutes

Business Report (.docx)

This Is What Gets Marked!

1,200 words maximum
6 structured sections
3-5 visualisations from notebook
Professional executive format

Time needed: 3-4 hours

Workflow: Complete the notebook FIRST, then write your report using the outputs and insights generated.

The Google Colab Notebook

The notebook contains all the code you need. Your job is to run it and interpret the results.

Section 1A: Data Prep

Load 15,000 customers
Summary statistics
4 visualisations
Handle missing values

Section 1B: Prediction

LightGBM churn model
TFT revenue forecasting
Performance metrics
Confusion matrix

Section 1C: SHAP

Feature importance
Summary plot
Waterfall plot
Dependence plots

Section 1D: Causal Analysis

EconML DRLearner
ATE calculation
CATE by segments
Treatment effects

Section 2: Summary

Key findings checklist
List of saved images
Report writing tips
Submission reminders

Report Structure: 6 Sections

Section 1

Business Problem & Data

4 marks

~250 words

Section 2

Predictive Analytics

4 marks

~400 words

Section 3

Explainable AI

4 marks

~200 words

Section 4

Causal Analysis

4 marks

~150 words

Section 5

Recommendations

5 marks

~250 words

Section 6

Communication

4 marks

Overall quality

Total: 25 marks | 1,200 words maximum | 3-5 visualisations required

Section 1: Business Problem & Data Overview 4 marks

Word target: ~250 words | Purpose: Set the business context

What Markers Want

Clear problem statement
Specific data statistics (e.g., "15,000 customers, 7.7% churn rate")
Key variables identified
Business impact stated

Common Mistakes

Generic descriptions without numbers
Not mentioning churn rate
Copying dataset description verbatim
Too much technical jargon

Example of Good Opening:

"CloudMetrics faces a significant retention challenge with a 7.7% churn rate across its 15,000-customer base. Analysis of 26 customer attributes including engagement metrics, contract types, and payment patterns reveals opportunities for targeted intervention..."

Section 2: Predictive Analytics Findings 4 marks

Word target: ~400 words | Purpose: Present prediction results with business meaning

LightGBM Churn Model

Report accuracy, precision, recall
Explain what metrics mean for business
Include confusion matrix insights
Identify false positive/negative trade-offs

TFT Revenue Forecasting

Report MAE, RMSE, MAPE
Explain forecast accuracy
Describe revenue trends
Business planning implications

Critical Requirement:

You must cover BOTH models (LightGBM AND TFT). Reports discussing only one model will lose marks.

Tip: Don't just list numbers. Explain what "85% accuracy" means: "The model correctly identifies 85 out of 100 customers' churn status, enabling targeted retention efforts."

Section 3: Explainable AI Insights 4 marks

Word target: ~200 words | Purpose: Explain what drives churn using SHAP

Required: Top 5 SHAP Features

You MUST identify and explain the top 5 features from your SHAP analysis. Example:

Customer Health Score - Lower scores strongly predict churn
Days Since Last Login - Longer gaps increase churn risk
Contract Type - Month-to-month customers churn more
Payment Failures - Each failure increases churn probability
Feature Adoption Score - Low adoption signals disengagement

Good Approach

"SHAP analysis reveals customer_health_score as the strongest predictor, with scores below 40 increasing churn probability by 35%."

Weak Approach

"SHAP shows various features are important for predicting churn in the model."

Section 4: Causal Analysis & Treatment Effects 4 marks

Word target: ~150 words | Purpose: Show if the retention campaign actually works

Key Concepts to Report

ATE (Average Treatment Effect)

Overall impact of retention campaign on churn. Example: "The campaign reduces churn by 18% on average."

CATE (Conditional Treatment Effect)

How effect varies by segment. Example: "Enterprise customers show 30% reduction vs 12% for SMB."

Correlation vs Causation

Critical Distinction: SHAP tells us what predicts churn (correlation). EconML tells us what causes changes in churn (causation). Your recommendations should focus on causal factors because those are actionable.

Section 5: Recommendations 5 marks

Word target: ~250 words | Purpose: Synthesise all findings into actionable strategy

This is the highest-weighted section! It tests whether you can connect all your analysis into coherent business strategy.

Strong Recommendation

"Based on CATE analysis showing 30% churn reduction for high-value customers, CloudMetrics should prioritise retention campaigns for Enterprise accounts with health scores below 50. Expected ROI: $240,000 annually."

Weak Recommendation

"CloudMetrics should focus on reducing churn and improving customer satisfaction through better service."

Strong recommendations include:

Specific actions (who, what, when)
Evidence from your analysis (cite specific findings)
Expected business impact (quantified where possible)
Prioritisation based on causal effects, not just correlations

Section 6: Business Communication 4 marks

Purpose: Overall report quality, structure, and professional presentation

Writing Quality

Clear, concise language
No jargon without explanation
Logical flow between sections
Executive-appropriate tone

Visual Elements

3-5 charts from notebook
Charts support key points
Proper labels/captions
Professional formatting

Structure

All 6 sections present
Within word limit
Notebook link included
Proper .docx format

Visualisation Selection Tips

Choose charts that directly support your narrative. Good choices: SHAP summary plot (for Section 3), CATE by segments (for Section 4), confusion matrix or feature importance (for Section 2).

Understanding the Mark Distribution

Grade Band	Mark Range	What It Looks Like
High Distinction	21.25 - 25	Exceptional synthesis, specific evidence, quantified business impact
Distinction	18.75 - 21.24	Strong analysis with clear business application, good evidence use
Credit	16.25 - 18.74	Competent analysis, some business insight, adequate evidence
Pass	12.50 - 16.24	Basic understanding, limited business application, some missing elements
Fail	0 - 12.49	Missing sections, no evidence from data, generic statements

Key to High Marks: Every claim must be supported by specific data from YOUR notebook analysis. Generic statements without evidence will not score well.

Tips for High Marks

Do This

Be specific: "7.7% churn rate" not "high churn"
Cite your analysis: "SHAP shows..." or "The ATE of -0.18 indicates..."
Explain business meaning: What does 85% accuracy mean for CloudMetrics?
Connect sections: Recommendations should flow from your findings
Quantify impact: "Expected to save $X annually"
Choose visuals wisely: 3-5 charts that support your narrative

Avoid This

Generic statements: "Churn is a problem for businesses"
Theory without application: Explaining what SHAP is without using results
Missing sections: Skipping TFT or causal analysis
Over word limit: Penalties apply for exceeding 1,320 words
Technical jargon: Write for executives, not data scientists
Disconnected recommendations: Suggestions not based on your analysis

Top 10 Common Mistakes to Avoid

Only covering LightGBM - You need BOTH models
Fewer than 3 visuals - Minimum required
No specific numbers - Every section needs data
Confusing correlation with causation - SHAP vs EconML
Vague recommendations - "Improve customer service"

Missing notebook link - Required for reference
Wrong file format - Must be .docx
Over word limit - 5% penalty per 10% over
Not running all notebook cells - Missing outputs
Copy-pasting technical output - Interpret, don't copy

                    Word Count Penalties
                    
                        Over LimitPenalty
1-10% (1,201-1,320 words)-0.5 marks
11-20% (1,321-1,440 words)-1.0 marks
>20% (1,441+ words)-2.0 marks

                

Over Limit	Penalty
1-10% (1,201-1,320 words)	-0.5 marks
11-20% (1,321-1,440 words)	-1.0 marks
>20% (1,441+ words)	-2.0 marks

Using Generative AI (GenAI Level 2)

Level 2: Optional Use with Documentation

You MAY use GenAI tools (ChatGPT, Claude, etc.) to help with this assessment, but you MUST document your use.

Appropriate GenAI Use

Explaining technical concepts
Improving grammar and clarity
Structuring your report
Understanding error messages
Generating initial drafts (then edit significantly)

Inappropriate GenAI Use

Generating entire report without editing
Not documenting GenAI use
Using GenAI output without verification
Submitting AI analysis of different data
Asking GenAI to interpret YOUR notebook results*

*Critical: GenAI cannot see your notebook. If you ask "what does my SHAP plot show?", it will make up an answer. Always interpret your own results.

Recommended Timeline

Week 4

Review materials

Read brief & rubric

→

Week 4-5

Complete notebook

30-45 minutes

→

Week 5

Write report

3-4 hours

→

Before Due

Review & submit

Check all requirements

                    Pre-Submission Checklist
                    All 6 sections complete
Word count under 1,200 (check in Word)
3-5 visualisations included
Notebook link in report
Saved as .docx format
GenAI use documented (if applicable)

                

Resources Available to You

Assessment Materials

Assessment Brief (PDF)
Student Marking Rubric
Google Colab Notebook
Dataset files (CSV)

Course Materials

Week 2: LightGBM slides
Week 3: Deep Learning
Week 4: Causal ML
Workshop notebooks

Support

Workshop Q&A time
Discussion forums
Email your facilitator
Academic Success Centre

Where to Find Materials

All assessment materials are available on MyKBS under the Assessment 1 section. The notebook link and datasets will be provided in Week 4.

Questions?

                Key Takeaways
                Only the business report is marked (not the notebook)
Every claim needs specific evidence from your analysis
Section 5 (Recommendations) is worth the most - synthesise everything
Write for executives, not data scientists
Run the notebook first, then write your report

            

Due: Week 5, Tuesday 23:55 AEST

Good luck with your assessment!