The world of movies contains enormous information and is worth digging for prediction and recommendation. IMDB database is one good example for extracting the relevant features, including both the item-based information (only related to the movies), for example, the title, the poster, the trailer, the genre, the cast, the director, etc., and the user-related information, such as name, gender, age, occupation, review, number of watching (the same movie), etc. Our project plans to benefit both the movie production corporations and the audience. For movie production corporations, our goals include better predicting the movie ratings given the item-based information and the user-related information. Data mining and supervised machine learning techniques are applied for labelling and classification to the nominal features. Text mining and sentimental analysis techniques are used for the review context in the database. For the audience, the hybrid recommendation system algorithm is introduced. The missing patterns of the features in the dataset are researched via visualizations and analyses. And special analysis on the “cold start” phenomenon (recommendation for new items or from insufficient information) on the database is also conducted.
Publisher
Conference on Statistical Learning and Data Science / Nonparametric Statistics
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.