when collecting, there is a lot of a links in the crawled content!
in which I want to get
<ul class="list_box">
<li><a href="xxxx.html">xxxxx</a></li>
<li><a href="xxxx.html">xxxxx</a></li>
<li><a href="xxxx.html">xxxxx</a></li>
<li><a href="xxxx.html">xxxxx</a></li>
<li><a href="xxxx.html">xxxxx</a></li>
<li><a href="xxxx.html">xxxxx</a></li>
</ul>
but I can"t get every address. Is there a problem with my writing?
$patten = "/<ul\s*class=\"list_box\">\s*<li><a\s*href=\"(.*)\">(.*)<\/a><\/li>\s*<\/ul>/s";
what happens when you can"t match?
sometimes you can get but only one.
what I want is to traverse and collect all the an addresses.