25 multiple-choice questions on neurons, backpropagation, PyTorch and gradient descent from scratch, plus 5 short-answer questions.
Select the single best answer for each question. Each question is worth 1 mark.
In a neural network, a single neuron computes:
The purpose of an activation function after a linear layer is to:
The ReLU activation function outputs:
Backpropagation is used to:
Gradient descent updates each weight by:
The vanishing gradient problem occurs when:
A fully connected layer with 10 inputs and 5 output neurons contains how many weight parameters (excluding biases)?
The learning rate controls:
A bias term in a neuron allows:
"Depth" in a deep neural network refers to:
Which activation function is most commonly used in hidden layers of modern deep networks?
Stochastic Gradient Descent (SGD) updates weights:
The forward pass through a neural network computes:
Mean Squared Error (MSE) loss measures:
If a model's training loss increases consistently over epochs, this most likely indicates:
The chain rule in backpropagation allows:
In PyTorch, tensor.requires_grad_(True) tells PyTorch to:
loss.backward() in PyTorch:
optimizer.step() in a PyTorch training loop:
optimizer.zero_grad() must be called each iteration to:
Momentum in gradient descent helps by:
The difference between a parameter and a hyperparameter is:
In the Excel-to-PyTorch analogy taught in Week 4, Excel formulas correspond to:
Weight initialisation matters because:
A computational graph in PyTorch records:
Answer each question in 2–4 sentences. Precise technical language is expected. Code snippets are welcome where relevant.
Explain gradient descent in your own words. Why do we need iterative optimisation rather than solving directly for the optimal weights?written
Describe the forward pass and backward pass of a neural network. What is computed in each, and how do they work together during training?written
What is the vanishing gradient problem and why does ReLU help address it compared to sigmoid or tanh?written
Write out the steps of a single training iteration (one batch) in PyTorch and explain what each line does: forward pass, loss computation, zero_grad, backward, step.written
Explain the bias-variance tradeoff in the context of neural network depth. How does adding more layers affect a model's bias and variance?written
Complete all 30 questions then click Submit. Your MCQ score (25/25) will be shown. Short answers are marked separately.