python怎样清除html文件中的内容?Python要实现清除html文件中的内容有很多方法,例如使用join方法或者compile方法等等,接下来小编给大家分享几个Python清除html文件中的内容的方法及实例,感兴趣的朋友可以参考。
本教程操作环境:windows7系统、python3.9版,DELL G3电脑,该方法适用于所有品牌电脑。
python清除html文件中内容的方法:
方法1:
In [97]: str_ = '' ...: flag = 1 ...: for ele in test: ...: if ele == "<": ...: flag = 0 ...: elif ele == '>': ...: flag = 1 ...: continue ...: if flag == 1: ...: str_ = ele ...: In [98]: str_ Out[98]: 'just for testjust for testtest' In [99]: str_ = '' ...: flag = 1 ...: for ele in test: ...: if ele == "<": ...: flag = 0 ...: elif ele == '>': ...: flag = 1 ...: ele = ' ' ...: if flag == 1: ...: str_ = ele ...: In [100]: str_ Out[100]: ' just for test just for test test '
方法2:
import re In [156]: pat = re.compile('(?<=>).*?(?=<)') In [157]: pat.findall(test) Out[157]: ['just for test', '', '', 'just for test', '', 'test'] In [158]: ''.join(pat.findall(test)) Out[158]: 'just for testjust for testtest'
方法3:
pat = re.compile('>(.*?)<') ''.join(pat.findall(test))
方法4:
In [167]: pat = re.compile('<[^>] >', re.S) In [168]: pat.sub('', test) Out[168]: 'just for testjust for testtest'
对python清除html文件中的内容的方法就介绍到这了,上述实例有一定的参考价值,感兴趣的朋友可以了解看看,希望大家阅读完这篇文章能有所收获,想要了解更多请关注群英网络其它相关文章。
Copyright © QY Network Company Ltd. All Rights Reserved. 2003-2018 群英 版权所有 茂名市群英网络有限公司
增值电信经营许可证 : B1.B2-20140078 粤ICP备09006778号-36 粤公网安备 44090202000006号 粤工商备P091701000595