根据内容类型编码解码接收的字节
必须使用正确的字符编码对接收的字节进行解码,以将其解释为文本:
Python 3.x >= 3.0
import urllib.request
response = urllib.request.urlopen("http://stackoverflow.com/")
data = response.read()
encoding = response.info().get_content_charset()
html = data.decode(encoding)
Python 2.x <= 2.7
import urllib2
response = urllib2.urlopen("http://stackoverflow.com/")
data = response.read()
encoding = response.info().getencoding()
html = data.decode(encoding)