爬取微博热搜代码

古城 · 发表于 2021-4-26 14:39:16

import requests
from lxml import etree
import time
url='https://s.weibo.com/top/summary?Refer=top_hot&topnav=1&wvr='
header={'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/80.0.3987.132 Safari/537.36'}
resp = requests.get (url,headers=header)
resp1 = resp.content.decode(encoding='utf-8',errors='ignore')
resp2=etree.HTML(resp1)
title = resp2.xpath('//*[@id="pl_top_realtimehot"]/table/tbody/tr/td/a/text()')
print (time.strftime("%F,%R")+'微博热搜\n')
for i in range(51):
print (' '.join([title[i]]),'\n')
time.sleep(1)

no1024 · 发表于 2022-1-28 16:22:09

print (' '.join([title[i]]),'\n')
IndexError: list index out of range

hnchshlily · 发表于 2022-6-16 10:34:33

Traceback (most recent call last):
File "C:/Users/Administrator/Desktop/2.py", line 2, in <module>
import requests
ModuleNotFoundError: No module named 'requests'

		自动登录	找回密码
密码			立即注册

[代码与实例] 爬取微博热搜代码