日韩性视频-久久久蜜桃-www中文字幕-在线中文字幕av-亚洲欧美一区二区三区四区-撸久久-香蕉视频一区-久久无码精品丰满人妻-国产高潮av-激情福利社-日韩av网址大全-国产精品久久999-日本五十路在线-性欧美在线-久久99精品波多结衣一区-男女午夜免费视频-黑人极品ⅴideos精品欧美棵-人人妻人人澡人人爽精品欧美一区-日韩一区在线看-欧美a级在线免费观看

歡迎訪問 生活随笔!

生活随笔

當前位置: 首頁 > 编程资源 > 编程问答 >内容正文

编程问答

被忽视的fuzzywuzzy库

發布時間:2024/10/8 编程问答 28 豆豆
生活随笔 收集整理的這篇文章主要介紹了 被忽视的fuzzywuzzy库 小編覺得挺不錯的,現在分享給大家,幫大家做個參考.

fuzzywuzzy包一個可以對字符串進行模糊匹配的包

from fuzzywuzzy import fuzz from fuzzywuzzy import process
字符串的對比

fuzz.ratio()對位置敏感,全匹配

fuzz.partial_ratio()非完全匹配

str1 = '毛利是個小菜比'str2 = '毛利是個小菜比,毛利是個小菜比'print("fuzz.ratio相似度:",fuzz.ratio(str1,str2))print("fuzz.partial_ratio相似度:",fuzz.partial_ratio(str1,str2)) fuzz.ratio相似度: 64 fuzz.partial_ratio相似度: 100 str1 = '毛利說:是個小菜比' str2 = '毛利說是個小菜比' print("fuzz.ratio相似度:",fuzz.ratio(str1,str2)) print("fuzz.partial_ratio相似度:",fuzz.partial_ratio(str1,str2)) fuzz.ratio相似度: 94 fuzz.partial_ratio相似度: 88

忽略順序匹配(token_sort_ratio)

str1 = '毛利說:是個小菜比' str2 = '是個小菜比:毛利說' print("fuzz.ratio相似度:",fuzz.ratio(str1,str2)) print("fuzz.partial_ratio相似度:",fuzz.partial_ratio(str1,str2)) print("token_sort_ratio相似度:",fuzz.token_sort_ratio(str1,str2)) fuzz.ratio相似度: 56 fuzz.partial_ratio相似度: 56 token_sort_ratio相似度: 100

去重子集匹配(token_set_ratio)

str1 = '毛利說:是個小菜比' str2 = '毛利說:是個小小菜比' print("fuzz.ratio相似度:",fuzz.ratio(str1,str2)) print("fuzz.partial_ratio相似度:",fuzz.partial_ratio(str1,str2)) print("token_sort_ratio相似度:",fuzz.token_sort_ratio(str1,str2)) print("token_set_ratio相似度:",fuzz.token_set_ratio(str1,str2)) fuzz.ratio相似度: 95 fuzz.partial_ratio相似度: 89 token_sort_ratio相似度: 95 token_set_ratio相似度: 95 print(fuzz.token_sort_ratio("fuzzy was a bear", "fuzzy fuzzy was a bear")) print(fuzz.token_set_ratio("fuzzy was a bear", "fuzzy fuzzy was a bear")) 84 100

process

用來返回模糊匹配的字符串和相似度

choices = ["python爬蟲教程", "python機器學習教程", "Python數據分析教程", "pythonweb開發教程"] print(process.extract("數據分析", choices, limit=3)) print(process.extractOne("分析", choices)) [('Python數據分析教程', 90), ('python爬蟲教程', 0), ('python機器學習教程', 0)] ('Python數據分析教程', 90)
案例

求和

import numpy as np import pandas as pd from fuzzywuzzy import fuzz from fuzzywuzzy import process def enum_row(row):print(row['state']) def find_state_code(row):if row['state'] != 0:print(process.extractOne(row['state'], states, score_cutoff=80)) def capital(str):return str.capitalize() def correct_state(row):if row['state'] != 0:state = process.extractOne(row['state'], states, score_cutoff=80)if state:state_name = state[0]return ' '.join(map(capital, state_name.split(' ')))return row['state'] def fill_state_code(row):if row['state'] != 0:state = process.extractOne(row['state'], states, score_cutoff=80)if state:state_name = state[0]return state_to_code[state_name]return ''if __name__ == "__main__":pd.set_option('display.width', 200)data = pd.read_excel('.\\sales.xlsx', sheet_name='sheet1', header=0)print('data.head() = \n', data.head())print('data.tail() = \n', data.tail())print('data.dtypes = \n', data.dtypes)print('data.columns = \n', data.columns)for c in data.columns:print(c, end=' ')print()data['total'] = data['Jan'] + data['Feb'] + data['Mar']print(data.head())print(data['Jan'].sum())print(data['Jan'].min())print(data['Jan'].max())print(data['Jan'].mean())print('=============')# 添加一行s1 = data[['Jan', 'Feb', 'Mar', 'total']].sum()print(s1)s2 = pd.DataFrame(data=s1)print(s2)print(s2.T)print(s2.T.reindex(columns=data.columns))# 即:s = pd.DataFrame(data=data[['Jan', 'Feb', 'Mar', 'total']].sum()).Ts = s.reindex(columns=data.columns, fill_value=0)print(s)data = data.append(s, ignore_index=True)data = data.rename(index={15:'Total'})print(data.tail())# apply的使用print('==============apply的使用==========')data.apply(enum_row, axis=1)state_to_code = {"VERMONT": "VT", "GEORGIA": "GA", "IOWA": "IA", "Armed Forces Pacific": "AP", "GUAM": "GU","KANSAS": "KS", "FLORIDA": "FL", "AMERICAN SAMOA": "AS", "NORTH CAROLINA": "NC", "HAWAII": "HI","NEW YORK": "NY", "CALIFORNIA": "CA", "ALABAMA": "AL", "IDAHO": "ID","FEDERATED STATES OF MICRONESIA": "FM","Armed Forces Americas": "AA", "DELAWARE": "DE", "ALASKA": "AK", "ILLINOIS": "IL","Armed Forces Africa": "AE", "SOUTH DAKOTA": "SD", "CONNECTICUT": "CT", "MONTANA": "MT","MASSACHUSETTS": "MA","PUERTO RICO": "PR", "Armed Forces Canada": "AE", "NEW HAMPSHIRE": "NH", "MARYLAND": "MD","NEW MEXICO": "NM","MISSISSIPPI": "MS", "TENNESSEE": "TN", "PALAU": "PW", "COLORADO": "CO","Armed Forces Middle East": "AE","NEW JERSEY": "NJ", "UTAH": "UT", "MICHIGAN": "MI", "WEST VIRGINIA": "WV", "WASHINGTON": "WA","MINNESOTA": "MN", "OREGON": "OR", "VIRGINIA": "VA", "VIRGIN ISLANDS": "VI","MARSHALL ISLANDS": "MH","WYOMING": "WY", "OHIO": "OH", "SOUTH CAROLINA": "SC", "INDIANA": "IN", "NEVADA": "NV","LOUISIANA": "LA","NORTHERN MARIANA ISLANDS": "MP", "NEBRASKA": "NE", "ARIZONA": "AZ", "WISCONSIN": "WI","NORTH DAKOTA": "ND","Armed Forces Europe": "AE", "PENNSYLVANIA": "PA", "OKLAHOMA": "OK", "KENTUCKY": "KY","RHODE ISLAND": "RI","DISTRICT OF COLUMBIA": "DC", "ARKANSAS": "AR", "MISSOURI": "MO", "TEXAS": "TX", "MAINE": "ME"}states = list(state_to_code.keys())print(fuzz.ratio('Python Package', 'PythonPackage'))print(process.extract('Mississippi', states))print(process.extract('Mississipi', states, limit=1))print(process.extractOne('Mississipi', states))data.apply(find_state_code, axis=1)print('Before Correct State:\n', data['state'])data['state'] = data.apply(correct_state, axis=1)print('After Correct State:\n', data['state'])data.insert(5, 'State Code', np.nan)data['State Code'] = data.apply(fill_state_code, axis=1)print(data)# group byprint('==============group by================')print(data.groupby('State Code'))print('All Columns:\n')print(data.groupby('State Code').sum())print('Short Columns:\n')print(data[['State Code', 'Jan', 'Feb', 'Mar', 'total']].groupby('State Code').sum())# 寫入文件data.to_excel('sales_result.xlsx', sheet_name='Sheet1', index=False)

這方法好復雜,看來以后要寫下office的筆記了

與50位技術專家面對面20年技術見證,附贈技術全景圖

總結

以上是生活随笔為你收集整理的被忽视的fuzzywuzzy库的全部內容,希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網站內容還不錯,歡迎將生活随笔推薦給好友。

主站蜘蛛池模板: 五月色综合 | 日本少妇一区 | 国产小精品| 激情 小说 亚洲 图片 伦 | 一级免费看片 | 精品日韩制服无码久久久久久 | 草草屁屁影院 | 波多野结衣国产 | 亚洲熟伦熟女新五十路熟妇 | 国产精品69久久久久孕妇欧美 | 人人妻人人藻人人爽欧美一区 | 日本系列第一页 | 一区二区三区三区在线 | 色哟哟国产精品色哟哟 | 免费观看成人 | 色综合天天综合网天天看片 | 久久久精品视频在线观看 | 狼人伊人av | 青青在线精品 | 欧美日韩国产一级片 | av在线电影网站 | 一级全黄裸体免费观看视频 | 日本韩国中文字幕 | 在线视频精品免费 | 午夜久久福利 | 亚洲免费观看高清在线观看 | www亚洲成人| 久久精品欧美一区二区三区麻豆 | 精品动漫一区二区三区 | 又骚又黄的视频 | 丰满人妻一区二区三区53视频 | 色爱成人综合 | 色涩色 | 婷婷777| 白白色在线播放 | 免费a v网站 | 欧美一区免费观看 | 新婚夫妇白天啪啪自拍 | 美女被草 | 日本精品中文字幕 | 黄污视频网站 | 69天堂网| 操xxxx| 亚洲精品女人久久久 | 午夜福利电影 | 婷婷伊人五月 | 免费毛片视频 | 三级亚洲 | 亚洲精品欧美激情 | 日日夜夜免费精品 | 91蜜桃婷婷狠狠久久综合9色 | 亚洲精品乱码久久久久久麻豆不卡 | 男人操女人免费视频 | 青青艹视频 | 日韩人妻精品一区二区三区视频 | 美女黄视频网站 | 日日摸日日碰夜夜爽无码 | 爱情岛亚洲首页论坛小巨 | 四虎精品久久 | 五月激情开心网 | 97精品熟女少妇一区二区三区 | 亚洲午夜av在线 | 亚洲美女www午夜 | 日本少妇做爰全过程毛片 | 久久性生活视频 | 91精品国产乱码在线观看 | 国产视频精品在线 | 精品午夜福利在线观看 | 国产三级全黄裸体 | 欧美三级手机在线观看 | 久久天堂av综合合色蜜桃网 | 国产精品日韩一区二区 | 热99在线| 免费伊人 | 国产aaa毛片 | 激情av网站 | 中文字幕精品无码亚 | 秋霞无码一区二区 | 五月综合久久 | 九九热伊人 | 成人性生交大片免费看96 | 国产激情av一区二区三区 | 波多野结衣中文在线 | 中国无码人妻丰满熟妇啪啪软件 | 国产精品人人 | 影音先锋中文字幕在线视频 | 五月激情婷婷网 | 蜜桃精品噜噜噜成人av | 日本黄色片免费看 | 亚洲我不卡 | 欧美在线一二三四区 | 欧美男人操女人 | 天堂网资源 | 中国少妇初尝黑人巨大 | 免费国产黄 | 女人一区二区三区 | 色噜噜日韩精品欧美一区二区 | 亚洲av人无码激艳猛片服务器 | 天天爱夜夜爽 |