生活随笔
收集整理的這篇文章主要介紹了
智联招聘反爬虫—MmEwMD参数定位
小編覺(jué)得挺不錯(cuò)的,現(xiàn)在分享給大家,幫大家做個(gè)參考.
本文僅供學(xué)習(xí)交流使用,請(qǐng)勿用于商業(yè)用途或不正當(dāng)行為
如果侵犯到貴公司的隱私或權(quán)益,請(qǐng)聯(lián)系我立即刪除
1、x-zp-client-id
全局搜索x-zp-client-id很容易定位到是由cookie中來(lái)的
2、x_zp_page_request_id
全局搜索發(fā)現(xiàn)從window.zpPageRequestId的值中獲取, 該值時(shí)訪問(wèn)列表頁(yè)返回的
3、MmEwMD
我們可以編寫(xiě)一個(gè)插件來(lái)進(jìn)行hook,插件內(nèi)容來(lái)自IT貓之家:https://www.itmaohome.com/windows-soft/574.html, 可以在這個(gè)頁(yè)面內(nèi)直接下載
1)首先創(chuàng)建一個(gè)文件夾, 命名為hook, 然后進(jìn)入
2)創(chuàng)建一個(gè)文件, 命名為inject.js, 具體內(nèi)容為
var code = function(){var open
= window
.XMLHttpRequest
.prototype
.open
;window
.XMLHttpRequest
.prototype
.open = function open(method
, url
, async){if (url
.indexOf("MmEwMD")>-1){debugger;}return open
.apply(this, arguments
);};}var script
= document
.createElement('script');
script
.textContent
= '(' + code
+ ')()';
(document
.head
||document
.documentElement
).appendChild(script
);
script
.parentNode
.removeChild(script
);
3)創(chuàng)建一個(gè)文件, 命名為manifest.json, 具體內(nèi)容為
{"name": "Injection","version": "2.0","description": "RequestHeader鉤子","manifest_version": 2,"content_scripts": [{"matches": ["<all_urls>"],"js": ["inject.js"],"all_frames": true,"permissions": ["tabs"],"run_at": "document_start"}]
}
4)然后點(diǎn)擊chrome擴(kuò)展程序→加載已解壓的擴(kuò)展程序→選中hook這個(gè)文件夾即可
5)打開(kāi)chrome開(kāi)發(fā)者工具,然后輸入列表頁(yè)的對(duì)應(yīng)的網(wǎng)址,可以發(fā)現(xiàn)斷下來(lái), 然后就可以查看調(diào)用堆棧進(jìn)行下一步分析了
4、參數(shù)獲取接口
寫(xiě)了個(gè)接口, 可以獲取以上三個(gè)參數(shù), 僅供測(cè)試使用, 切勿高頻率請(qǐng)求
http://106.54.191.103:8000/key=e4d1f73c78f19bb6c7db25f6b39dc5c6
import requests
import jsonresp
= requests
.get
('http://106.54.191.103:8000/key=e4d1f73c78f19bb6c7db25f6b39dc5c6')
resp_dict
= json
.loads
(resp
.text
)
print(resp_dict
)x_zp_page_request_id
= resp_dict
['x_zp_page_request_id']
x_zp_client_id
= resp_dict
['x_zp_client_id']
MmEwMD
= resp_dict
['MmEwMD']
url
= f
'https://fe-api.zhaopin.com/c/i/sou?x-zp-page-request-id={x_zp_page_request_id}&x-zp-client-id={x_zp_client_id}&MmEwMD={MmEwMD}'headers
= {'authority': "fe-api.zhaopin.com",'pragma': "no-cache",'cache-control': "no-cache,no-cache",'accept': "application/json, text/plain, */*",'user-agent': "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/83.0.4103.116 Safari/537.36",'content-type': "application/json;charset=UTF-8",'origin': "https://sou.zhaopin.com",'sec-fetch-site': "same-site",'sec-fetch-mode': "cors",'sec-fetch-dest': "empty",'referer': "https://sou.zhaopin.com/?p=2&jl=530&kw=python&kt=3",'accept-language': "zh-CN,zh;q=0.9,en;q=0.8",}payload
= {'start': '90','pageSize': '90','cityId': '530','workExperience': '-1','companyType': '-1','employmentType': '-1','jobWelfareTag': '-1','kw': 'python','kt': '3',
}response
= requests
.post
(url
, json
=payload
, headers
=headers
)
print(response
.text
)
總結(jié)
以上是生活随笔為你收集整理的智联招聘反爬虫—MmEwMD参数定位的全部?jī)?nèi)容,希望文章能夠幫你解決所遇到的問(wèn)題。
如果覺(jué)得生活随笔網(wǎng)站內(nèi)容還不錯(cuò),歡迎將生活随笔推薦給好友。