course-recommendation

this is the final project of the course 2022 ADL Fall, topic: how how grand challenge

Introduction

In this challenge, there are 2 main domains, seen and unseen. Each domain contains 2 sub-domain which is course-prediction and topic-prediction. Literally, we are going to recommend the courses and the topics of course for the customers based on different level of information.

For both domain we have the customers' personal information such as the gender, occupation, interests and recreation. There are also some information related to the courses such as the course desciptions, the name of instructors, introduction of instructors, topic of the coures and also some brief introduciton of courses' chapter. Theres is seen custormes' purchase history, howerver, there isn't any purchase history of unseen custormers and that's basically the "unseen" stands for.

In terms of the methodology, in seen domain, I apply the traditional collaborative filtering(CF) method and also conduct few trials applying neural network framework. As for the unseen domain, I take advantage of the BM25 algorithm and neural network as well.

Methods

The idea of CF is that it's reasonable to recommend based on a bunch of items purchased by the customers who share similarity with the customer. Hence, it's feasible for the seen customers to use CF since we can constuct the customer-course matrix based on the purchase history. I adopt three algorithms: alternating least squares(ALS), bayesian personalized ranking(BPR) and k nearest neighbors. The ALS and BPR are kind of tricky in that the assumption contains that there are latent representaion for both customers and courses. The goal is to find the best latent representation to approximate the customer-course matrix.

With regard to BM25, it's a mature techique in the Field of information retrieval. The concept is given query and list of documents, find the best match based on the frequency.This concept can be adopted in both seen and unseen domain in that it doesn't depend on the purchase history.

At last, there are various deep neural network framework, and I first try out multi-layer perceptron(MLP) with and without dropout layer, and also the "two tower" latent model. All the experiments result will be present.

Experiments

For course-prediction task, I adopt various algorithms. While for topic-prediction task, I wheter borrow the course-prediciotion results and find their corresponding topics as the predicted topics or directyly apply the algorithm for topic-prediction task. Moreover, the evaluation metric is the mean average precesion at 50.

Collaborative Filtering(seen domain)

After try out the ALS, BPR and KNN, I further do the rearrangement wishing to improve the order of the recommendation further. The rearrangement idea is why not consider the recommendation result of three based method simultaneously? There are two rearrangement strategy, one is combing two of them and the other is combining all of them. The key difference among the rearrangement stratety is to choose the base result and one or two of the reference results since we have three kinds of CF methods. Take BPR_KNN for example, the BPR is the based method and the KNN is the reference while in terms of the BPR_ALS_KNN, BPR is still the based method and both ALS and KNN are referenced.

ALS

There are three major hyperparamers, alpha, regularizations, factors in ALS and in out case the alpha have minor impact. Hence, I first fix the number of iterations to 30. After find the optimal regularizations and factors, I further inspect the impact of number of iterations.

task\hparam	factors	regularization	iterations	val_score
course	60	51.02939	312	0.07909
topic from course	370	489.7961	30	0.25506
topic	40	51.02939	425	0.2437019

BPR

There are two major hyperparamers, factors, learning rate in BPR. Hence, I first fix the number of iterations to 300. After find the optimal factors and learning rate, I further inspect the impact of number of iterations.

task\hparam	factors	learning rate	iterations	val_score
course	30	0.001437	300	0.078738
topic from course	430	0.000418	300	0.25742
topic	250	0.00062	389	0.20595

KNN

The only hpyerparameter for KNN algorithm is the numbor of neighbors. Hence, I inspect the interval of k_nighbor from 100 to 2000 getting 100 values to reach the best validation score.

task\hparam	k_neighbor	val_score
course	2000	0.061138
topic from course	1980	0.247165
topic	1980	0.236282

MIX2

task\rearrangement_strategy	ALS_BPR	ALS_KNN	BPR_ALS	BPR_KNN	KNN_ALS	KNN_BPR
course	0.07802	0.07357	0.08092	0.07258	0.07130	0.07091
topic from course	0.25587	0.25682	0.26285	0.25527	0.25639	0.25342
topic	0.23413	0.25247	0.23144	0.216	0.24102	0.21328

MIX3

task\regarrangement_strategy	ALS_BPR_KNN	BPR_ALS_KNN	KNN_ALS_BPR
course	0.07602	0.07643	0.07041
topic from course	0.26129	0.25386	0.26140
topic	0.24405	0.23817	0.23609

BM25(unseen domain)

seen

task\model	BM25_Okapi	BM25L	BM25+
course	0.02904	0.02883	0.02591
topic from course	0.22820	0.18742	0.21973

unseen

task\model	BM25_Okapi	BM25L	BM25+
course	0.05158	0.04905	0.04765
topic from course	0.29233	0.26805	0.28547

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
BM25		BM25
CF_KNN		CF_KNN
DNN		DNN
preprocess		preprocess
report/picture		report/picture
.gitignore		.gitignore
README.md		README.md
download.sh		download.sh
metric.py		metric.py
preprocess.sh		preprocess.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

course-recommendation

Introduction

Methods

Experiments

Collaborative Filtering(seen domain)

ALS

BPR

KNN

MIX2

MIX3

BM25(unseen domain)

seen

unseen

nerual network in seen domain

MLP(without dropout)

MLP(with dropout)

Two Tower

nerual network in seen domain

conclusion

About

Languages

githubjacky/course-recommendation

Folders and files

Latest commit

History

Repository files navigation

course-recommendation

Introduction

Methods

Experiments

Collaborative Filtering(seen domain)

ALS

BPR

KNN

MIX2

MIX3

BM25(unseen domain)

seen

unseen

nerual network in seen domain

MLP(without dropout)

MLP(with dropout)

Two Tower

nerual network in seen domain

conclusion

About

Topics

Resources

Stars

Watchers

Forks

Languages