Python編程基礎訓練之網絡爬蟲模擬登錄豆瓣網

一、基礎回顧

1、瀏覽器訪問網頁的過程

 2、URL

3、HTTP Request請求

4、HTTP請求類型

 5、HTTP Response響應

6、HTTP 狀態碼 

HTTP Requ

 二、使用錯誤的賬號密碼登錄豆瓣網,通過Chrome來獲取POST的數據字段

 三、通過POST方式,利用已註冊好的賬號密碼模擬登錄操作

# -*- coding: utf-8 -*-
"""
Created on Mon Jun 24 15:34:15 2019

@author: UnderMask
"""
#利用賬號密碼登錄豆瓣網,python代碼模擬登錄網頁操作
import requests
import http.cookiejar as cookielib
import random#隨機添加/修改User-Agent
ualist = [#一些可用的瀏覽器名稱
    "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.1 (KHTML, like Gecko) Chrome/22.0.1207.1 Safari/537.1",
    "Mozilla/5.0 (X11; CrOS i686 2268.111.0) AppleWebKit/536.11 (KHTML, like Gecko) Chrome/20.0.1132.57 Safari/536.11",
    "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/536.6 (KHTML, like Gecko) Chrome/20.0.1092.0 Safari/536.6",
    "Mozilla/5.0 (Windows NT 6.2) AppleWebKit/536.6 (KHTML, like Gecko) Chrome/20.0.1090.0 Safari/536.6",
    "Mozilla/5.0 (Windows NT 6.2; WOW64) AppleWebKit/537.1 (KHTML, like Gecko) Chrome/19.77.34.5 Safari/537.1",
    "Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/536.5 (KHTML, like Gecko) Chrome/19.0.1084.9 Safari/536.5",
    "Mozilla/5.0 (Windows NT 6.0) AppleWebKit/536.5 (KHTML, like Gecko) Chrome/19.0.1084.36 Safari/536.5",
    "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/536.3 (KHTML, like Gecko) Chrome/19.0.1063.0 Safari/536.3",
    "Mozilla/5.0 (Windows NT 5.1) AppleWebKit/536.3 (KHTML, like Gecko) Chrome/19.0.1063.0 Safari/536.3",
    "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_8_0) AppleWebKit/536.3 (KHTML, like Gecko) Chrome/19.0.1063.0 Safari/536.3",
    "Mozilla/5.0 (Windows NT 6.2) AppleWebKit/536.3 (KHTML, like Gecko) Chrome/19.0.1062.0 Safari/536.3",
    "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/536.3 (KHTML, like Gecko) Chrome/19.0.1062.0 Safari/536.3",
    "Mozilla/5.0 (Windows NT 6.2) AppleWebKit/536.3 (KHTML, like Gecko) Chrome/19.0.1061.1 Safari/536.3",
    "Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/536.3 (KHTML, like Gecko) Chrome/19.0.1061.1 Safari/536.3",
    "Mozilla/5.0 (Windows NT 6.1) AppleWebKit/536.3 (KHTML, like Gecko) Chrome/19.0.1061.1 Safari/536.3",
    "Mozilla/5.0 (Windows NT 6.2) AppleWebKit/536.3 (KHTML, like Gecko) Chrome/19.0.1061.0 Safari/536.3",
    "Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/535.24 (KHTML, like Gecko) Chrome/19.0.1055.1 Safari/535.24",
    "Mozilla/5.0 (Windows NT 6.2; WOW64) AppleWebKit/535.24 (KHTML, like Gecko) Chrome/19.0.1055.1 Safari/535.24" ]
url = "https://accounts.douban.com/j/mobile/login/basic"
headers = {"Connection":"Keep-alive", "User-Agent":random.choice(ualist)}#隨機添加/修改User-Agent
data = {'name':'xxxxxxxxxxx', 'password':'xxxxxx'}#自己的賬號密碼
r = requests.post(url, data = data, headers = headers)
print(r.text)#查看響應內容,r.text返回的是Unicode格式的數據
print(r.url)#查看完整url地址
print(r.status_code)#查看響應碼

四、運行效果

 Python代碼利用賬號密碼模擬登錄豆瓣網頁操作完成,可以類比此類網頁的登錄、過於複雜的驗證碼驗證網頁除外~

estHTTP 請求

發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章