當前位置：首頁 > 编程语言 > python >内容正文

python

【Python】pandas模块中更改Series的数据类型

發(fā)布時間：2025/3/15 python 46 豆豆

生活随笔收集整理的這篇文章主要介紹了【Python】pandas模块中更改Series的数据类型小編覺得挺不錯的,現(xiàn)在分享給大家,幫大家做個參考.

今天我們主要解決以下實際問題：一份黑名單數(shù)據(jù)存儲在excel中，由于數(shù)據(jù)量龐大，現(xiàn)需要通過pandas找到某一列的重復數(shù)據(jù)，處理后再存入到excel中。
pandas 是基于NumPy 的一種工具，該工具是為了解決數(shù)據(jù)分析任務而創(chuàng)建的,主要數(shù)據(jù)結(jié)構(gòu)為兩個類：
DataFrame: 可以理解為表格，類似于Excel的表格 pandas.core.frame.DataFrame
Series: 表示單列。DataFrame包含多個列，即多個Series，每個Series都有名稱。pandas.core.series.Series

Pandas所支持的數(shù)據(jù)類型（dtype）:?
1. float?（float64）
2. int?（int64，uint64）
3. bool?
4. datetime64[ns]? （2013-01-02）
5. datetime64[ns, tz]?
6. timedelta[ns]?
7. category?
8. object?（字符串）
默認的數(shù)據(jù)類型是int64,float64

以下是原始的excel文件

先查看文件中Series每一列的數(shù)據(jù)類型

import pandas as pd# 更改數(shù)據(jù)類型 def change_data_type():print(excel_df.dtypes)if __name__ == '__main__':excel_df = pd.read_excel('E:\zenglingwei\\test\\5.xlsx')change_data_type()

我們發(fā)現(xiàn)blacklistValue默認是int類型，但我們知道身份證18位，再次存入excel中時后面幾位會變成0，所以我們需要對這列進行數(shù)據(jù)類型轉(zhuǎn)換。主要有兩種思路，一種是讀取excel時轉(zhuǎn)換，另外一種是讀取后轉(zhuǎn)換。

一、讀取時全部轉(zhuǎn)換為字符串，dtype='object'或者dtype='str'

import pandas as pd# 更改數(shù)據(jù)類型 def change_data_type():print(excel_df.dtypes)if __name__ == '__main__':excel_df = pd.read_excel('E:\zenglingwei\\test\\5.xlsx',dtype='object') # dtype='str'change_data_type()

二、讀取時指定列轉(zhuǎn)換為字符串，object或者str

# 更改數(shù)據(jù)類型 def change_data_type():print(excel_df.dtypes)if __name__ == '__main__':excel_df = pd.read_excel('E:\zenglingwei\\test\\5.xlsx',dtype = {'blacklistValue' : object,'priority':str}) # dtype='str'change_data_type()

三、讀取后轉(zhuǎn)換為字符串：?astype(str)，不可以使用astype(object)-->存入到excel時還是int類型。

import pandas as pd# 更改數(shù)據(jù)類型 def change_data_type():excel_df[['blacklistValue','priority']] = excel_df[['blacklistValue','priority']].astype(str)print(excel_df.dtypes)excel_df.to_excel('excel_to_python.xls',sheet_name='sheet', index=False)if __name__ == '__main__':excel_df = pd.read_excel('E:\zenglingwei\\test\\5.xlsx') # dtype='str'change_data_type()

創(chuàng)作挑戰(zhàn)賽新人創(chuàng)作獎勵來咯，堅持創(chuàng)作打卡瓜分現(xiàn)金大獎

總結(jié)

以上是生活随笔為你收集整理的【Python】pandas模块中更改Series的数据类型的全部內(nèi)容，希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網(wǎng)站內(nèi)容還不錯，歡迎將生活随笔推薦給好友。

上一篇： react实现全选和反选_全选的实现
下一篇： python 随机数

日韩av黄I国产麻豆传媒I国产91av视频在线观看I日韩一区二区三区在线看I美女国产在线I麻豆视频国产在线观看I成人黄色短片

python

【Python】pandas模块中更改Series的数据类型

總結(jié)