當前位置：首頁 > 编程资源 > 编程问答 >内容正文

编程问答

[PY3]——字符串的分割、匹配、搜索方法总结

發布時間：2024/9/5 编程问答 33 豆豆

生活随笔收集整理的這篇文章主要介紹了 [PY3]——字符串的分割、匹配、搜索方法总结小編覺得挺不錯的,現在分享給大家,幫大家做個參考.

？分割、匹配、搜索時可以用到什么樣的解決方法？

分割方法總結

1. str.split( )

* 分割字符串

* 返回列表?

s1='I love python' # 默認以空格為界定符，且多個空格都當做一個處理 print(s1.split()) ['I', 'love', 'python']# (s1中有兩個空格)如果這是指定了空格為界定符，則會有其中一個空格會被當做字符輸出 print(s1.split(' ')) ['I', '', 'love', '', 'python']# 可指定任意字符/字符串作為界定符 print(s1.split('o')) ['I l', 've pyth', 'n']# maxsplit=n，指定分割n次 print(s1.split(maxsplit=1)) ['I', 'love python']

2. re.split()

* 可定義多個界定符?

import re line = 'asdf fjdk; afed, fjek,asdf, foo'# 可指定多個字符作為界定符 print(re.split(r'[;,\s]\s*',line)) ['asdf', 'fjdk', 'afed', 'fjek', 'asdf', 'foo']# 加一個括號表示捕獲分組 print(re.split(r'(;|,|\s)\s*',line)) ['asdf', ' ', 'fjdk', ';', 'afed', ',', 'fjek', ',', 'asdf', ',', 'foo']# (?:)強調為非捕獲分組 print(re.split(r'(?:,|;|\s)\s*',line)) ['asdf', 'fjdk', 'afed', 'fjek', 'asdf', 'foo']

搜索和匹配方法總結

1. str.startswith() | str.endswith()

* 開頭/結尾匹配
* 返回True/False
* 常用于“判斷文件夾中是否存在指定文件類型”、“URL”

url="http://www.python.org" # startswith('string')判斷是否以string開頭 print(url.startswith('http')) True# endswith('string')判斷是否以string結尾 print(url.endswith('com')) False# startswith('string',n,m) 可指定索引范圍n-m print(url.endswith('n',11,17)) True# 要注意一個特性，傳遞給startswith/endswith處理的只能是tuple，不能是list choices=['http:','ftp:'] print(url.startswith(choices))TypeError: startswith first arg must be str or a tuple of str, not list print(url.startswith(tuple(choices))) True# endswith()，應用在檢索/判斷，一個目錄中是否有某一類型結尾的文件 import os filenames=os.listdir('/test')#Example-1 print(filenames) ['aa', 'zhuabao', '.python-version', 'test.sh', 'hh.c', '.test.py.swp', 'zhuabao2', 'abc', 'linshi.sh'] print([candsh for candsh in filenames if candsh.endswith(('.sh','.c'))]) ['test.sh', 'hh.c', 'linshi.sh']#Example-2 if any(name.endswith(('.sh','.c')) for name in os.listdir('/test')):print('have') have

2. fnmatch() | fnmatchcase()

* 使用Shell通配符匹配

3. str.find()

* 返回索引

4. re.match(r'')

* 使用正則表達式匹配

* 只檢查字符串開始位置

5. re.findall(r'')

* 從任意位置開始匹配
* 以列表方式返回

6. re.finditer(r'')

* 以迭代方式返回

7. r' $'——>正則表達式以$結尾

* 確保精確

8. re.compile(r'')——>先編譯正則表達式

* 做多次/大量的匹配和搜索操作時

import re text1='2017/07/26' text2='Nov 27,2012' text3='Today is 11/27/2012. PyCon starts 3/13/2013.' text5='26/07/2017 is today,PyCon starts 3/13/2013.'# 編譯一個匹配 m/y/d/格式的正則表達式 datepat=re.compile(r'\d+/\d+/\d+')# re.match('string')實現在string中搜索 print(datepat.match(text1)) <_sre.SRE_Match object; span=(0, 10), match='2017/07/26'> print(datepat.match(text2)) None# 我們發現re.match() 只能實現從開始位置搜索，也只能搜索出開頭的第一個匹配項 print(datepat.match(text3)) None print(datepat.match(text5)) <_sre.SRE_Match object; span=(0, 10), match='26/07/2017'># 這種情況有時可能得不到我們想要的結果，一種情況是可以在末尾加$，實現精確匹配 text6='26/07/2017abcdef' datepat1=re.compile(r'\d+/\d+/\d+') print(datepat1.match(text6)) <_sre.SRE_Match object; span=(0, 10), match='26/07/2017'> datepat2=re.compile(r'\d+/\d+/\d+$') print(datepat2.match(text6)) None# 另一種情況是可以使用考慮使用re.findall('string') 可在string中的全部位置進行搜索 print(datepat.findall(text3)) ['11/27/2012', '3/13/2013']# re.findall返回列表，re.finditer()返回迭代對象 for m in datepat.finditer(text5):print(m.groups()) # # 捕獲分組 # # datepat=re.compile(r'(\d+)/(\d+)/(\d+)') m=datepat.match(text1) print(m.group(0)) 2017/07/26 print(m.group(1)) 2017 print(m.group(2)) 07 print(m.group(3)) 26 print(m.groups()) ('2017', '07', '26')for month,day,year in datepat.findall(text3):print('{}-{}-{}'.format(year,month,day)) 012-11-272013-3-13

9.??修飾符

* 將貪婪匹配變為非貪婪匹配

* 從而實現最短匹配模式

text6 = 'Computer says "no." Phone says "yes."' pat1=re.compile(r'\"(.*)\"') #匹配冒號包含的文本 print(pat1.findall(text6)) ['no." Phone says "yes.'] pat2=re.compile(r'\"(.*?)\"') #增加 ?修飾符 print(pat2.findall(text6)) ['no.', 'yes.']

10. （? : . | \n） | ?re.DOTALL

* 使得（.）能夠匹配包括換行符在內的所有字符

* 從而實現多行匹配模式

text7=''' /*this is a multiline comment*/ '''
pat1=re.compile(r'/\*(.*?)\*/') print(pat1.findall(text7)) [] #為什么沒匹配出來，因為(.)并不能匹配換行符 pat2=re.compile(r'/\*((?:.|\n)*?)\*/') #把(.) ——> (?:.|\n) print(pat2.findall(text7)) ['this is a\nmultiline comment']# re.DOTALL可以讓正則表達式中的點(.)匹配包括換行符在內的任意字符 pat3=re.compile(r'/\*(.*?)\*/',re.DOTALL) print(pat3.findall(text7)) ['this is a\nmultiline comment']

搜索和替換方法總結

1. str.replace()

# S.replace(old, new[, count]) -> str text5="a b c d e e e" print(text5.replace("e","a")) # a b c d a a a print(text5.replace("e","a",2)) # a b c d a a e

2. re.sub() |?re.(flags=re.IGNORECASE)

* 匹配并替換 | 忽略大小寫匹配

# sub(pattern, repl, string, count=0, flags=0) # 第1個參數：匹配什么 # 第2個參數：替換什么 # 第3個參數：處理的文本 # 第4個參數：替換次數 text1="l o v e" print(re.sub(r'\s','-',text1)) # l-o-v-e print(re.sub(r'\s','-',text1,count=1)) # l-o v e# flags=re.IGNORECASE 忽略大小寫 text3 = 'UPPER PYTHON, lower python, Mixed Python' print(re.sub('python','snake',text3,flags=re.IGNORECASE)) # UPPER snake, lower snake, Mixed snake # 如果想替換字符跟匹配字符的大小寫保持一致，我們需要一個輔助函數 def matchcase(word):def replace(m):text=m.group()if text.isupper():return word.upper()elif text.islower():return word.lower()elif text[0].isupper():return word.capitalize()else:return wordreturn replace print(re.sub('python',matchcase('snake'),text3,flags=re.IGNORECASE)) # UPPER SNAKE, lower snake, Mixed Snake

3. re.compile()

* 同理，多次替換時可先進行編譯

# 同樣可以先編譯、可以捕獲分組 text2='Today is 11/27/2012. PyCon starts 3/13/2013.' datepat=re.compile(r'(\d+)/(\d+)/(\d+)') print(datepat.sub(r'\3-\1-\2',text2)) # Today is 2012-11-27. PyCon starts 2013-3-13.

4. re.subn()

* 獲取替換的次數

# re.subn()可以統計替換發生次數 newtext,n=datepat.subn(r'\3-\1-\2',text2) print(newtext) # Today is 2012-11-27. PyCon starts 2013-3-13. print(n) # 2

轉載于:https://www.cnblogs.com/snsdzjlz320/p/7235791.html

總結

以上是生活随笔為你收集整理的[PY3]——字符串的分割、匹配、搜索方法总结的全部內容，希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網站內容還不錯，歡迎將生活随笔推薦給好友。

上一篇： ajax如何实现、readyState五
下一篇： Quick Search Article