Description
Q1. Regularization
[15 points] We use polynomial regression for the prediction task of a dataset. The given dataset includes a train set (train.csv) and a test set (test.csv). To illustrate the effect of regularization, please first implement the following regression models using python language (third-party packages are allowed). Then, plot the data points of the train set and the regression lines of the trained models. Finally, compute the RMSE of the trained models using the test set and make a comparative discussion about underfitting and overfitting.
โข Polynomial regression without regularization (polynomial to 5th power)
โข L1 Regularized polynomial regression: ๐ = 1 and ๐ = 100 โข L2 Regularized polynomial regression: ๐ = 1 and ๐ = 100
The given datasets can be downloaded at:
https://drive.google.com/drive/folders/1LSZNIEWf6XKnQtRw8L01tS6yAB67Aad2?usp=sharing
โข The initial values for parameters ๐ฅ = [0.77 0.48 0.19 0.43 0.31] and ๐๐ = [0.68
0.44 0.51 0.18
0.36 0.62
0.78]
0.08
0.92
Q2. Recommender System
Build up a collaborative filtering-based recommender system to provide effective hotel recommendation. The training dataset as shown in the table below contains the ratings from 4 users to 3 hotels. The ratings range from 1 point to 5 points.
Hotel 1 Hotel 2 Hotel 3
User 1 5 1 ?
User 2 4 ? 3
User 3 ? 4 5
User 4 3 3 4
We use the gradient descent algorithm to solve cost minimization in the collaborative filtering model. Some settings are as follows.
โข The constant learning rate ๐ผ = 0.0002
โข The regularization parameter ๐ = 0.02
โข The dimension for user/item feature vectors ๐พ = 2
a) [5 points] If we finally obtain ๐ฅ(1) = [1.268 0.994]๐ and ๐(3) = [0.271 0.694]๐ after the training procedure, what is the rating of user 3 on hotel 1?
b) [10 points] Calculate the values of ๐ฅ1(1) (i.e., the first element in the item feature vector of hotel 1) and ๐1(2) (i.e., the first element in the user feature vector of user 2) after the first iteration.
c) [5 point] Implement the gradient descent algorithm to update the parameters ๐ฅ and ๐ using python language. Please calculate the ratings of user 2 on hotel 2 after 50 rounds and upload the source code file.
ps. For a) and b), the detailed calculation process is required and the intermediate and final results should be rounded to 3 decimal places.
Q3. Neural Network
[10 points] Consider the following neural network:
Where ๐๐ = โ๐ ๐ค๐๐๐ง๐ ๐ง๐ = ๐๐(๐๐) for ๐ = 1,2,3,4 ๐ง0 = ๐0 (an input neuron) ๐3(๐ฅ) = relu(๐ฅ) and ๐1(๐ฅ) = ๐2(๐ฅ) = ๐4(๐ฅ) = sigmoid(๐ฅ). relu(๐ฅ) corresponds to a rectifier linear unit transfer function defined as: relu(๐ฅ) = max {0,๐ฅ}. The cost function is defined as ๐ฝ(๐ค) = 12 (๐ง4 โ ๐ฆ)2.
(a) Write a function ๐น to simulate the neural network.
๐๐ฝ
(b) Assume that we are given a training data ๐ฅ = 1.0,๐ฆ = 0.1 what is the value of ๐๐ค34?
Reviews
There are no reviews yet.