Collaborative filtering has been used to generate recommendations for users. From Listenbrainz data dump, 56011220 listens from 2005-01 to 2019-12 have been used to train, validate and test the model. Approximately, 65% data has been used to train the model, and the remaining data has been used to validate and test the model.
This refers to the number of factors in our ALS model, that is,the number of hidden features in our low-rank approximation matrices.
This parameter controls the regularization of our model.Thus, lambda controls over fitting.
This refers to the number of iterations to run.
The alpha parameter controls the baseline level of confidence weighting applied.A higher level of alpha tends to make the model more confident about the fact that missing data equates to no preference for the relevant user-item pair.
The following table gives information about the parameters fed to the model in every iteration
Note: Here, iteration does not refer to the parameter "iteration", but the number of times the whole process of training is carried out.
rank | lmbda | iterations | validation rmse |
---|---|---|---|
8 | 0.1 | 10 | 8.46 |
8 | 0.1 | 20 | 8.01 |
8 | 10.0 | 10 | 7.60 |
8 | 10.0 | 20 | 8.07 |
12 | 0.1 | 10 | 8.59 |
12 | 0.1 | 20 | 8.53 |
12 | 10.0 | 10 | 8.19 |
12 | 10.0 | 20 | 7.14 |
Value of alpha used is 3.0
Best model has error = 7.14, rank = 12, lmbda = 10.0, iteration = 20.
Best model trained in 6501.89s