用法
string.split( sep = None, maxsplit = -1)
string 要操作字符串
sep 分隔符,默认值为whitespace空白符
maxsplit 最大分割次数,默认值为-1,表示无限制
如果同时有多个分隔符怎么分割呢?
可以用循环多次分割来实现,例如:
>>> s = '6[5,12]3[2,6]1;35]67[8;9;11]12' >>> for j in '[],;': t=[i.split(j) for i in t] t=[i for j in t for i in j] >>> t ['6', '5', '12', '3', '2', '6', '1', '35', '67', '8', '9', '11', '12'] >>>
帮助
字符串方法str.split()帮助:
Help on method_descriptor: split(self, /, sep=None, maxsplit=-1) Return a list of the words in the string, using sep as the delimiter string. sep The delimiter according which to split the string. None (the default value) means split according to any whitespace, and discard empty strings from the result. maxsplit Maximum number of splits to do. -1 (the default value) means no limit.
正则
懂正则表达式的可以一步到位:
>>> import re >>> s = '6[5,12]3[2,6]1;35]67[8;9;11]12' >>> re.split('\[|\]|,|;',s) ['6', '5', '12', '3', '2', '6', '1', '35', '67', '8', '9', '11', '12']
注:竖线 | 是分隔符的分隔;也可用[]包括分隔符,[]中的不用竖线分开;两种方法都要注意转义符的使用。
>>> import re >>> s = '6[5,12]3[2,6]1;35]67[8;9;11]12' >>> re.split('[\[\],;]',s) ['6', '5', '12', '3', '2', '6', '1', '35', '67', '8', '9', '11', '12']
附:常见正则
1.测试正则表达式是否匹配字符串的全部或部分
regex=ur"" #正则表达式
if re.search(regex, subject): do_something() else: do_anotherthing()
2.测试正则表达式是否匹配整个字符串
regex=ur"\Z" #正则表达式末尾以\Z结束
if re.match(regex, subject): do_something() else: do_anotherthing()
3.创建一个匹配对象,然后通过该对象获得匹配细节(Create an object with details about how the regex matches (part of) a string)
regex=ur"" #正则表达式
match = re.search(regex, subject) if match: # match start: match.start() # match end (exclusive): atch.end() # matched text: match.group() do_something() else: do_anotherthing()
4.获取正则表达式所匹配的子串(Get the part of a string matched by the regex)
regex=ur"" #正则表达式
match = re.search(regex, subject) if match: result = match.group() else: result = ""
5. 获取捕获组所匹配的子串(Get the part of a string matched by a capturing group)
regex=ur"" #正则表达式
match = re.search(regex, subject) if match: result = match.group(1) else: result = ""
6. 获取有名组所匹配的子串(Get the part of a string matched by a named group)
regex=ur"" #正则表达式
match = re.search(regex, subject) if match: result = match.group"groupname") else: result = ""
7. 将字符串中所有匹配的子串放入数组中(Get an array of all regex matches in a string)
result = re.findall(regex, subject)
8.遍历所有匹配的子串(Iterate over all matches in a string)
for match in re.finditer(r"<(.*?)\s*.*?/\1>", subject) # match start: match.start() # match end (exclusive): atch.end() # matched text: match.group()
9.通过正则表达式字符串创建一个正则表达式对象(Create an object to use the same regex for many operations)
reobj = re.compile(regex)
10.用法1的正则表达式对象版本(use regex object for if/else branch whether (part of) a string can be matched)
reobj = re.compile(regex) if reobj.search(subject): do_something() else: do_anotherthing()
11.用法2的正则表达式对象版本(use regex object for if/else branch whether a string can be matched entirely)
reobj = re.compile(r"\Z") #正则表达式末尾以\Z 结束
if reobj.match(subject): do_something() else: do_anotherthing()
12.创建一个正则表达式对象,然后通过该对象获得匹配细节(Create an object with details about how the regex object matches (part of) a string)
reobj = re.compile(regex) match = reobj.search(subject) if match: # match start: match.start() # match end (exclusive): atch.end() # matched text: match.group() do_something() else: do_anotherthing()
13.用正则表达式对象获取匹配子串(Use regex object to get the part of a string matched by the regex) 文章来源地址https://www.yii666.com/blog/126608.html
reobj = re.compile(regex) match = reobj.search(subject) if match: result = match.group() else: result = ""
14.用正则表达式对象获取捕获组所匹配的子串(Use regex object to get the part of a string matched by a capturing group)
reobj = re.compile(regex) match = reobj.search(subject) if match: result = match.group(1) else: result = ""
15.用正则表达式对象获取有名组所匹配的子串(Use regex object to get the part of a string matched by a named group)
reobj = re.compile(regex) match = reobj.search(subject) if match: result = match.group("groupname") else: result = ""
16.用正则表达式对象获取所有匹配子串并放入数组(Use regex object to get an array of all regex matches in a string)
reobj = re.compile(regex) result = reobj.findall(subject)
17.通过正则表达式对象遍历所有匹配子串(Use regex object to iterate over all matches in a string)
reobj = re.compile(regex) for match in reobj.finditer(subject): # match start: match.start() # match end (exclusive): match.end() # matched text: match.group()
18.字符串替换
1).替换所有匹配的子串
#用newstring替换subject中所有与正则表达式regex匹配的子串
result = re.sub(regex, newstring, subject)
2).替换所有匹配的子串(使用正则表达式对象) 文章来源站点https://www.yii666.com/
reobj = re.compile(regex)
result = reobj.sub(newstring, subject)
19.字符串拆分
1).字符串拆分
result = re.split(regex, subject)
2).字符串拆分(使用正则表示式对象)
reobj = re.compile(regex) result = reobj.split(subject)