Extension version of previous IMDB classification project. This project tries to predict the IMDB score of movies with different regressors.
The dataset used in this project is 5000 IMDB movie. Here is the link of original dataset from dataworld: https://data.world/data-society/imdb-5000-movie-dataset.
Technique and model used:
- Data cleaning & analysis: Numpy, Pandas
- Data visualization: Plotly, Seaborn
- Data preprocessing: OneHotEnconder, Feature Selection, Polynomial Feature
- Models:
- Linear Regressor, RANSAC, SGDRegressor
- SVC
- DecisionTreeRegressor
- Ensemble: VotingRegressor, Bagging, RandomForest, Adaboost, XGBoost