HTML Parsing
Reference
- “HOWTO Fetch Internet Resources Using the Urllib Package” - file:///Library/Frameworks/Python.framework/Versions/3.2/Resources/English.lproj/Documentation/howto/urllib2.html?highlight=urlopen
- “Python and HTML Processing” - http://www.boddie.org.uk/python/HTML.html (version 2.x)
Fetching URLs
import urllib.request req = urllib.request.Request('http://www.voidspace.org.uk') response = urllib.request.urlopen(req) the_page = response.read()