Data mining of popular science books based on web crawler









Abstract

Big data, artificial intelligence, mobile internet and other information technologies has provided increasingly rich technical means for science popularization, improved the information source of science popularization statistical data, and provided better data support for the research work in the field of science popularization. Using web crawler to grab the data of Taobao, using excel and spss to clean, summarize and structure the data, such as the distribution of sales, the relationship between price and sales, the relationship between price ranking and sales, the relationship between price and comment number, etc., and then carry out data visualization processing and relevant analysis to summarize the public's consumption interest and attention of popular science books.


Modules


Algorithms


Software And Hardware

• Hardware: Processor: i3 ,i5 RAM: 4GB Hard disk: 16 GB • Software: operating System : Windws2000/XP/7/8/10 Anaconda,jupyter,spyder,flask Frontend :-python Backend:- MYSQL