Ask you a question about python crawler and big data.

my graduation project is to do a system to climb the Douban website, and my classmate"s graduation project is big data"s topic. Our teacher Bishu asked me to use a crawler to store the data, and then make an interface to transmit the data to my classmates (the data was transferred from my computer to his computer), and then he used the platform he built to process the data and show it again. Now my crawler system is done and his platform is set up. I would like to ask how to design the interface for transmitting data, and what knowledge should be mastered

Mar.03,2021

my colleagues and I have also done this kind of cooperation. After I have finished climbing the data, I can directly store the data to mysql, and check the data directly. We discussed the table structure of the database in advance.


Interface data format according to the needs of each other's platform, it is almost the same as writing an app interface. Give a json data


you can actually design the database structure. You can save it here and read it there. Relational databases like mysql and document databases like mongodb are fine.


your crawler is a winform project. You can do another web project. If your classmate needs any interface, you can write a corresponding web api interface to send json format data to him.
if you want to do it in the crawler winform project, you can write a socket server on your side and a socket client on your classmate's side to request data. However, it is not recommended to write the server in winform, it is recommended to use the web api, just mentioned, followed by the recommendation webservice and so on.


1. What is the order of magnitude of your data? it's MB,GB,PB?.
2. Whether your data is crawled down and stored in the file or in the database
3, how you plan to open the data to the other party, whether to provide web api, or ftp, or database access
once you have thought about these questions, you will naturally know what to do and which materials to study

.
MySQL Query : SELECT * FROM `codeshelper`.`v9_news` WHERE status=99 AND catid='6' ORDER BY rand() LIMIT 5
MySQL Error : Disk full (/tmp/#sql-temptable-64f5-1beb21d-31a38.MAI); waiting for someone to free some space... (errno: 28 "No space left on device")
MySQL Errno : 1021
Message : Disk full (/tmp/#sql-temptable-64f5-1beb21d-31a38.MAI); waiting for someone to free some space... (errno: 28 "No space left on device")
Need Help?