DATA5000 Comprehensive Review Quiz

Artificial Intelligence Programming in Business Analytics - Final Preparation

Time Remaining:
30:00

Student Information

Question 1: AI vs ML Relationship

Below is a diagram showing the relationship between different AI concepts:

AI Technology Hierarchy Artificial Intelligence Machine Learning Deep Learning Expert Systems Robotics Rule-based AI Computer Vision

Based on this diagram, which statement is most accurate?

Question 2: Four Types of Business Analytics

The chart below shows the progression of business analytics types:

Business Analytics Progression Descriptive: "What happened?" Diagnostic: "Why did it happen?" Predictive: "What will happen?" Prescriptive: "What should we do?" Increasing Value & Complexity

A retail company wants to decide whether to increase inventory for winter products. Which type of analytics should they primarily use?

Question 3: Machine Learning Workflow

The flowchart below shows the typical ML workflow:

Machine Learning Workflow Data Collection Data Preprocessing Train/Test Split Model Training Model Evaluation Deployment

In the ML workflow, what is the primary purpose of the Train/Test Split step?

Question 4: LightGBM vs NeuralProphet

Comparison of two algorithms learned in Week 2:

Algorithm Best Use Case Data Type Key Strength
LightGBM Tabular data classification/regression Structured data with features High accuracy, fast training
NeuralProphet Time series forecasting Sequential time-based data Captures complex seasonality

A company wants to predict customer purchase amounts based on their demographics, past purchases, and website behavior. Which algorithm would be most appropriate?

Question 5: Neural Network Architecture

Below is a simplified neural network diagram:

Neural Network Architecture Input Layer Hidden Layer Output Layer Features Pattern Recognition Prediction

What is the primary purpose of the hidden layer in this neural network?

Question 6: Transformer Attention Mechanism

Below is a visualization of how attention works in transformers:

Transformer Attention Mechanism The cat sat on mat 0.1 0.6 0.2 0.05 0.05 cat* *Enhanced representation of "cat" considering context

What does the attention mechanism in transformers primarily accomplish?

Question 7: Treatment Effects Visualization

Below is a chart showing treatment effects for a marketing campaign across different customer segments:

Marketing Campaign Treatment Effects Young Middle-aged Seniors All Groups Customer Segments 0% 5% 10% 15% 20% Sales Increase (%) 12% 6% 4% 8% CATE CATE CATE ATE

What does this data suggest about the marketing campaign's effectiveness?

Question 8: SHAP Values Interpretation

SHAP values for a house price prediction model:

SHAP Values: House Price Prediction Base: $300K Location: +$50K Size: +$30K Garden: +$10K Age: -$25K Prediction: $365K $300K + $50K + $30K + $10K - $25K = $365K

Based on these SHAP values, which feature has the strongest impact on increasing the house price?

Question 9: True/False

Deep Learning is always better than traditional Machine Learning for business applications.

Question 10: True/False

Correlation always implies causation in business data analysis.

Question 11: Analytics Integration

A retail company has the following business scenario:

  • They want to understand customer purchasing patterns
  • Predict future sales for inventory planning
  • Determine the causal impact of marketing campaigns
  • Decide optimal marketing budget allocation

Which combination of analytics types would best address these needs?

Question 12: Algorithm Selection

A streaming service wants to:

  1. Predict how many users will watch a new show (based on viewer demographics, genre preferences, time of release)
  2. Forecast daily viewing hours for capacity planning (using historical viewing data)
  3. Understand what factors drive user engagement (interpretable model needed)

Which algorithms would be most appropriate for these three tasks respectively?

Question 13: Cross-Validation Calculation

A machine learning model is evaluated using 5-fold cross-validation with the following accuracy scores for each fold:

Fold 1 Fold 2 Fold 3 Fold 4 Fold 5
0.87 0.92 0.85 0.89 0.91

Calculate the average cross-validation accuracy (round to 2 decimal places):

Question 14: Treatment Effect Calculation

A marketing campaign analysis shows the following results:

Group Average Purchase Before ($) Average Purchase After ($)
Treatment Group (received campaign) 45.20 67.80
Control Group (no campaign) 44.50 48.90

Calculate the Difference-in-Differences (DiD) treatment effect (round to 2 decimal places):

Formula: DiD = (Treatment_After - Treatment_Before) - (Control_After - Control_Before)

$

Question 15: SHAP Values Calculation

A house price prediction model has the following components:

Component Value
Base Value (average prediction) $280,000
Location SHAP value +$45,000
Size SHAP value +$32,000
Age SHAP value -$18,000
Condition SHAP value +$12,000

Calculate the final model prediction for this house:

Formula: Prediction = Base Value + Sum of all SHAP values

$

Question 16: Neural Network Forward Propagation

A simple neural network neuron receives the following inputs and has learned these weights:

Input Value Weight Input × Weight
x₁ (house size) 0.8 0.6 0.48
x₂ (location score) 0.9 0.4 0.36
x₃ (house age) 0.3 -0.2 -0.06
Bias 1.0 0.1 0.10
x₁ 0.8 x₂ 0.9 x₃ 0.3 b 1.0 w₁=0.6 w₂=0.4 w₃=-0.2 w_b=0.1 Σ +bias → σ y Sigmoid: σ(z) = 1/(1+e⁻ᶻ)

Calculate the final output of this neuron after applying the sigmoid activation function (round to 2 decimal places):

Step 1: z = Σ(input × weight) + bias = 0.48 + 0.36 + (-0.06) + 0.10 = ?

Step 2: output = σ(z) = 1/(1+e⁻ᶻ)

Question 17: Comprehensive Understanding

You are hired as an AI consultant for a healthcare system. The client asks:

"We want to use AI to improve patient outcomes and reduce costs. We have 5 years of patient data including demographics, treatments, outcomes, and costs. What's your approach?"

Which comprehensive approach best demonstrates your DATA5000 knowledge?

Quiz Results

Your score: 0/17