文章/答案/技术大牛

发布

社区首页 >问答首页 >Python (模式+通配符+模式)[返回](模式)

问Python (模式+通配符+模式)[返回](模式)
EN

Stack Overflow用户

提问于 2019-01-22 22:54:02

回答 2查看 103关注 0票数 0

用selenium刮取字符串中的python，用re解析

<div type="copy3" class="sc-bxivhb dHqnfT">756 W Peachtree St NW Atlanta GA 30308</div>

我想回来

756 W Peachtree St NW Atlanta GA 30308

这个判据

("copy3").*?(?=</div>)

把我还给你

"copy3" class="sc-bxivhb dHqnfT">756 W Peachtree St NW Atlanta GA 30308

但我想把>之前的一切都排除在756之外

我该怎么把这个包括进去？

python

regex

回答 2

Stack Overflow用户

回答已采纳

发布于 2019-01-22 23:01:27

用硒擦拭，用硒得到.

my_element = driver.find_element_by_css_selector('div[type="copy3"]')
address = my_element.text

票数 2

Stack Overflow用户

发布于 2019-01-22 22:58:05

匹配一个>，然后捕获一个组中跟随的非<，然后提取该组：

type="copy3"[^>]+>([^<]+)

https://regex101.com/r/BX2tVj/1

如果只想匹配第一个<之后的内容，则必须使用lookbehind (只有在确切知道class=""属性可能包含什么的情况下才能可靠)：

(?<=type="copy3" class="sc-bxivhb dHqnfT">)[^<]+

https://regex101.com/r/BX2tVj/2

或者使用regex模块，这样您就可以使用\K了。

type="copy3"[^>]+>\K[^<]+

https://regex101.com/r/BX2tVj/3

import regex
str = '<div type="copy3" class="sc-bxivhb dHqnfT">756 W Peachtree St NW Atlanta GA 30308</div>'
match = regex.search(r'type="copy3"[^>]+>\K[^<]+', str)

票数 1

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/54317630

复制

相似问题

问Python (模式+通配符+模式)[返回](模式)
EN

回答 2

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问Python (模式+通配符+模式)[返回](模式)EN

回答 2

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问Python (模式+通配符+模式)[返回](模式)
EN