|
5威望
本帖最后由 北凉不悲凉 于 2018-1-18 16:49 编辑
url = 'https://www.hoomxb.com/plan/444'
需要爬取这个页面的加入记录,这个记录是动态加载的,NETWORK监控了其加载地址,但在构造post请求时候始终请求不成功,返回403,我写的代码如下
headers = {
'User-Agent':'Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/57.0.2987.133 Safari/537.36',
'Host':'www.hoomxb.com',
'Referer':'https://www.hoomxb.com/plan/449',
'Cookie': 'koa:sess=0uoCTCCnwWNokOwXPMUspYoD; koa:sess.sig=_QXuhUkPls_gQvQnsl5Ima0lYjo; Hm_lvt_669d03c874797a405408c4aafdff0c46=1516153119,1516245778,1516256946; Hm_lpvt_669d03c874797a405408c4aafdff0c46=1516256982'
}params = {'id':'444'}
html_json = requests.post('https://www.hoomxb.com/api/plan/joinRecord',data=params,headers = headers)
print(html_json.status_code)#403
求大神帮我写一下这个代码,要能获取加入记录的json数据,我刚入手爬虫不久,脑子不灵光,万分感谢啊
|
|