根据内容类型编码解码接收的字节

必须使用正确的字符编码对接收的字节进行解码,以将其解释为文本:

Python 3.x >= 3.0

import urllib.request

response = urllib.request.urlopen("http://stackoverflow.com/")
data = response.read()

encoding = response.info().get_content_charset()
html = data.decode(encoding)

Python 2.x <= 2.7

import urllib2
response = urllib2.urlopen("http://stackoverflow.com/")
data = response.read()

encoding = response.info().getencoding()
html = data.decode(encoding)