Fake Review Detection on Yelp Dataset Using Classification Techniques in Machine Learning








Abstract

This paper provides a summary of our research, which aims to build a machine learning model that can detect whether the reviews on Yelp's dataset are true or fake. In particular, we applied and compared different classification techniques in machine learning to find out which one would give the best result. Brief descriptions for each of the classification techniques are provided to aid understanding of why some methods are better than others in some cases. The best result was achieved by using the XGBoost classification technique, with F-1 score reaching 0.99 in prediction.


Modules


Algorithms

Machine learning algorithms


Software And Hardware

• Hardware: Processor: i3 ,i5 RAM: 4GB Hard disk: 16 GB • Software: operating System : Windws2000/XP/7/8/10 Anaconda,jupyter,spyder,flask Frontend :-python Backend:- MYSQL