爬虫问题

benson · 发表于 2020-11-22 11:59:44

源代码:
import requests
from bs4 import BeautifulSoup
url="https://fs.lianjia.com/zufang/"
r=requests.get(url)
print(r)
print(r.text)
s=BeautifulSoup(r.text)
print(s,"html.parser")
s.find_all("div",class_="")

错误:
Warning (from warnings module):
File "C:\Users\Administrator\Desktop\python\11.21 01.py", line 7
s=BeautifulSoup(r.text)
GuessedAtParserWarning: No parser was explicitly specified, so I'm using the best available HTML parser for this system ("html.parser"). This usually isn't a problem, but if you run this code on another system, or in a different virtual environment, it may use a different parser and behave differently.

The code that caused this warning is on line 7 of the file C:\Users\Administrator\Desktop\python\11.21 01.py. To get rid of this warning, pass the additional argument 'features="html.parser"' to the BeautifulSoup c**tructor.

benson · 发表于 2020-11-22 12:15:01

对不起,发错了

benson · 发表于 2020-11-22 12:25:11

这个才是
源代码:
import requests
from bs4 import BeautifulSoup
url="https://dg.lianjia.com/"
r=requests.get(url)
print(r)
print(r.text)
s=BeautifulSoup(r.text,"html.parser")
print(s,"html.parser")
s.find_all("div",class_="")
s.find_all("div",class_="pic-panel")
links_div=s.find_all("div",class_="pic-panel")
links=[div.a.get("href") for div in links_div]
print(links)
为什么结果是[]?

sheeboard · 发表于 2020-11-23 11:02:21

找不到class_="pic-panel"

		自动登录	找回密码
密码			立即注册

[求助] 爬虫问题

回帖奖励 +10 点威望


10 点威望	回复本帖可获得 10 点威望奖励! 每人限 4 次(中奖概率 90%)