Crawlers crawl web pages, a single crawler can crawl, multi-threaded crawlers can not open the url?

as mentioned above, if a single crawler can crawl, the multithreaded crawler cannot open the url. Is the request time between the multithreaded crawlers too short, which triggers the anti-crawler mechanism of the website?

May.22,2021

pay attention to the delay request. I usually only start batch downloading when downloading images.


you can try to change the proxy every time. The IP, is most likely pulled into the blacklist after the access rate is too fast.

MySQL Query : SELECT * FROM `codeshelper`.`v9_news` WHERE status=99 AND catid='6' ORDER BY rand() LIMIT 5
MySQL Error : Disk full (/tmp/#sql-temptable-64f5-1b40d97-2c53a.MAI); waiting for someone to free some space... (errno: 28 "No space left on device")
MySQL Errno : 1021
Message : Disk full (/tmp/#sql-temptable-64f5-1b40d97-2c53a.MAI); waiting for someone to free some space... (errno: 28 "No space left on device")
Need Help?