CP3501 · Week 4 Neural Networks & Gradient Descent Quiz
0 / 30
Week 4  ·  Assessment Quiz

Neural Networks & Gradient Descent

25 multiple-choice questions on neurons, backpropagation, PyTorch and gradient descent from scratch, plus 5 short-answer questions.

📋 30 questions total ⭐ 30 marks 🕐 No time limit 🔒 Answers not revealed
PART A

Multiple Choice  (25 marks)

Select the single best answer for each question. Each question is worth 1 mark.

Q1

In a neural network, a single neuron computes:

Q2

The purpose of an activation function after a linear layer is to:

Q3

The ReLU activation function outputs:

Q4

Backpropagation is used to:

Q5

Gradient descent updates each weight by:

Q6

The vanishing gradient problem occurs when:

Q7

A fully connected layer with 10 inputs and 5 output neurons contains how many weight parameters (excluding biases)?

Q8

The learning rate controls:

Q9

A bias term in a neuron allows:

Q10

"Depth" in a deep neural network refers to:

Q11

Which activation function is most commonly used in hidden layers of modern deep networks?

Q12

Stochastic Gradient Descent (SGD) updates weights:

Q13

The forward pass through a neural network computes:

Q14

Mean Squared Error (MSE) loss measures:

Q15

If a model's training loss increases consistently over epochs, this most likely indicates:

Q16

The chain rule in backpropagation allows:

Q17

In PyTorch, tensor.requires_grad_(True) tells PyTorch to:

Q18

loss.backward() in PyTorch:

Q19

optimizer.step() in a PyTorch training loop:

Q20

optimizer.zero_grad() must be called each iteration to:

Q21

Momentum in gradient descent helps by:

Q22

The difference between a parameter and a hyperparameter is:

Q23

In the Excel-to-PyTorch analogy taught in Week 4, Excel formulas correspond to:

Q24

Weight initialisation matters because:

Q25

A computational graph in PyTorch records:

PART B

Short Answer  (5 marks — marked by lecturer)

Answer each question in 2–4 sentences. Precise technical language is expected. Code snippets are welcome where relevant.

Q26

Explain gradient descent in your own words. Why do we need iterative optimisation rather than solving directly for the optimal weights?written

Your answer
0 / 700
Q27

Describe the forward pass and backward pass of a neural network. What is computed in each, and how do they work together during training?written

Your answer
0 / 700
Q28

What is the vanishing gradient problem and why does ReLU help address it compared to sigmoid or tanh?written

Your answer
0 / 700
Q29

Write out the steps of a single training iteration (one batch) in PyTorch and explain what each line does: forward pass, loss computation, zero_grad, backward, step.written

Your answer
0 / 700
Q30

Explain the bias-variance tradeoff in the context of neural network depth. How does adding more layers affect a model's bias and variance?written

Your answer
0 / 700

Complete all 30 questions then click Submit. Your MCQ score (25/25) will be shown. Short answers are marked separately.

MCQ Score
0 / 25
✏️ Your 5 short-answer responses are recorded for your lecturer.
Full total: MCQ + short-answer marks = / 30