Using re.findall to create a link grabber in Python -
so going try , create scrapper friend. basically, want take links website. that's it.
i know around lines of:
links = re.findall() print links
i found re.finall on web, not sure how use it! pointers in right direction lot!
try beautifulsoup instead. handles crappy html , presents nice interface parsing html. plus, it's easy use. here's scraper (straight from docs):
for link in soup.find_all('a'): print(link.get('href'))
Comments
Post a Comment