use python3 bs4 to climb the latest movie of movie paradise http://www.dytt8.net/
, but crawl out is web data, very messy, you can use soup.findAll to directly find the link tag to extract import urllib.request
from bs4 import BeautifulSoup
html = urllib.request.urlopen (" http://www.dytt8.net/")
bsObj = BeautifulSoup (html,"html.parser")
a = bsObj.findAll ("div", {" class":"co_content8"})
list1 = []
for i in a:
j = i.findAll("a")
print(type(j))
print("-sharp-sharp-sharp")
print(list1.append(str(j)))
print (" list1 is:", list1)
print (type (list1)
print (len (list1)
for n in list1:
print(n.split(","))
part of the source code of the web page is as follows: