如今最常用的编码方式是unicode编码,Unicode编码可以把文字编码后夹在url后作为后缀
quote编码
#如今的常用编码方式:unicode编码
import urllib.parse
import urllib.request
import time
url = 'https://www.baidu.com/s?w='
headers={
'User-Agent':
'Mozilla/5.0 (Linux; Android 6.0; Nexus 5 Build/MRA58N) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/118.0.0.0 Mobile Safari/537.36 Edg/118.0.2088.46',
'Accept':
'text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7'
}
#把汉字变成Unicode编码模式:编码
name=urllib.parse.quote("葬送的芙莉莲")
url=url+name
print(url)
request = urllib.request.Request(url=url, headers=headers)
response = urllib.request.urlopen(request)
content = response.read().decode('utf-8')
print(content)
用urlencode方式编码
#如今的常用编码方式:unicode编码
import urllib.parse
import urllib.request
import time
url = 'https://www.baidu.com/s?w='
headers={
'User-Agent':
'Mozilla/5.0 (Linux; Android 6.0; Nexus 5 Build/MRA58N) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/118.0.0.0 Mobile Safari/537.36 Edg/118.0.2088.46',
'Accept':
'text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7'
}
#把汉字变成Unicode编码模式:编码
name=urllib.parse.quote("葬送的芙莉莲")
url=url+name
print(url)
request = urllib.request.Request(url=url, headers=headers)
response = urllib.request.urlopen(request)
content = response.read().decode('utf-8')
print(content)
总结:
quote编码适用于只需要编码一个对象的时候,其传入的参数是一个字符串
urlencode编码适用于要编码多个对象的时候,其传入的参数是一个列表