|
麻烦大伙啦,用re.findall()时,正则表达式报语法错误,正则本身没有问题,在正则表达式软件中已经测试过了,可是python运行就就报语法错误,实在不知道是该怎么解决。
下面是那段代码:
f = re.findall('<dd class="col2-7 word">[\s]+<span><a href="/seo/words/(?<grp0>[^"]+)"[\S\s]+?查百度[\S\s]+?<dd class="col2-2 center">[^\d]+(?<grp1>[^\D]+)[\s]+</dd>\n<dd class="col2-2 center">' , html)
还有报错信息:
Traceback (most recent call last):
File "E:\workspace_HBuilder\Python_Project\Python_Leaning\src\weixin_datadw\test.py", line 20, in <module>
get_index_5118("seo")
File "E:\workspace_HBuilder\Python_Project\Python_Leaning\src\weixin_datadw\test.py", line 14, in get_index_5118
f = re.findall(str('<dd class="col2-7 word">[\s]+<span><a href="/seo/words/(?<grp0>[^"]+)"[\S\s]+?查百度[\S\s]+?<dd class="col2-2 center">[^\d]+(?<grp1>[^\D]+)[\s]+</dd>\n<dd class="col2-2 center">'), html)
File "C:\Python27\lib\re.py", line 181, in findall
return _compile(pattern, flags).findall(string)
File "C:\Python27\lib\re.py", line 251, in _compile
raise error, v # invalid expression
sre_constants.error: syntax error
|
|