selenium is good, although inefficient
articles are obtained through ajax
, why don't you just use this interface?
finally, I chose puppeteer
. I think that the retro combination of scrapy and bs4 will not fail to apply
dynamic web pages loaded through ajax. It is recommended to use selenium
.Please tell me why chrome debugging js file breakpoint, then why walk to other js files inside? how to avoid this? ...
now there is a requirement that begins with 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 145, 147, 150, 151, 152, 153, 155, 156, 157, 158, 159, 173, 175, 176, 177, 178, 180, 181, 182, 183, 184, 184, 185, 186, 187, 188, 189, 166, 198, 199 ...