當(dāng)前位置：首頁(yè) > 编程语言 > python >内容正文

python

Python更改数据类型——astype()方法和to_numeric()函数

發(fā)布時(shí)間：2023/12/13 python 35 豆豆

生活随笔收集整理的這篇文章主要介紹了 Python更改数据类型——astype()方法和to_numeric()函数小編覺(jué)得挺不錯(cuò)的,現(xiàn)在分享給大家,幫大家做個(gè)參考.

文章目錄

明確指定數(shù)據(jù)的類(lèi)型
- 通過(guò)dtypes屬性進(jìn)行查看
- 創(chuàng)建Pandas對(duì)象指定數(shù)據(jù)類(lèi)型
轉(zhuǎn)換數(shù)據(jù)類(lèi)型
- 通過(guò)astype()方法強(qiáng)制轉(zhuǎn)換數(shù)據(jù)的類(lèi)型
- 通過(guò)to_numeric()函數(shù)轉(zhuǎn)換數(shù)據(jù)類(lèi)型

明確指定數(shù)據(jù)的類(lèi)型

通過(guò)dtypes屬性進(jìn)行查看

import pandas as pddf = pd.DataFrame({'A': ['1', '2', '4'],'B': ['9', '-80', '5.3'],'C': ['x', '5.9', '0']}) print("df.dtypes:\n", df.dtypes) print("df:\n", df)

輸出結(jié)果：

df.dtypes:A object B object C object dtype: object df:A B C 0 1 9 x 1 2 -80 5.9 2 4 5.3 0

創(chuàng)建Pandas對(duì)象指定數(shù)據(jù)類(lèi)型

data = pd.DataFrame({'A': ['1', '2', '4'],'B': ['9', '80', '5']},dtype='int') print("data:\n", data) print("data.dtypes:\n", data.dtypes)

輸出結(jié)果：

data:A B 0 1 9 1 2 80 2 4 5 data.dtypes:A int32 B int32 dtype: object

轉(zhuǎn)換數(shù)據(jù)類(lèi)型

通過(guò)astype()方法強(qiáng)制轉(zhuǎn)換數(shù)據(jù)的類(lèi)型

astype(dypte, copy=True, errors = ‘raise’, **kwargs)

上述方法中部分參數(shù)表示的含義如下：

dtype：表示數(shù)據(jù)類(lèi)型

copy：是否建立副本，默認(rèn)為T(mén)rue

errors：錯(cuò)誤采取的處理方式，可以取值為raise或ignore，默認(rèn)為raise。其中raise表示允許引發(fā)異常，ignore表示抑制異常。

運(yùn)用astype()方法將DataFrame對(duì)象df中B列數(shù)據(jù)的類(lèi)型轉(zhuǎn)換為int類(lèi)型：

print("df['B']:\n", df['B']) print("df['B'].astype:\n", df['B'].astype(dtype='float')) df['B']:0 9 1 -80 2 5.3 Name: B, dtype: object df['B'].astype:0 9.0 1 -80.0 2 5.3 Name: B, dtype: float64

之所以沒(méi)有將所有列進(jìn)行類(lèi)型轉(zhuǎn)換是因?yàn)镃列中有非數(shù)字類(lèi)型的字符，無(wú)法將其轉(zhuǎn)換為int類(lèi)型，若強(qiáng)制轉(zhuǎn)換會(huì)出現(xiàn)ValueError異常。（當(dāng)參數(shù)errors取值ignore時(shí)可以抑制異常，但抑制異常后輸出結(jié)果仍是未轉(zhuǎn)換類(lèi)型之前的對(duì)象——也就是并未進(jìn)行數(shù)據(jù)類(lèi)型轉(zhuǎn)換的操作，只是不會(huì)報(bào)錯(cuò)罷了）

print("df['C']:\n", df['C']) print("df['C'].astype(errors='ignore'):\n", df['C'].astype(dtype='float', errors='ignore'))

輸出結(jié)果：

df['C']:0 x 1 5.9 2 0 Name: C, dtype: object df['C'].astype(errors='ignore'):0 x 1 5.9 2 0 Name: C, dtype: object

通過(guò)to_numeric()函數(shù)轉(zhuǎn)換數(shù)據(jù)類(lèi)型

to_numeric()函數(shù)不能直接操作DataFrame對(duì)象

pandas.to_numeric(arg, errors=‘raise’, downcast=None)

上述函數(shù)中常用參數(shù)表示的含義如下：

arg：表示要轉(zhuǎn)換的數(shù)據(jù)，可以是list、tuple、Series

errors：錯(cuò)誤采用的處理方式可以取值除raise、ignore外，還可以取值coerce，默認(rèn)為raise。其中raise表示允許引發(fā)異常，ignore表示抑制異常。

to_numeric()函數(shù)較之a(chǎn)stype()方法的優(yōu)勢(shì)在于解決了后者的局限性：只要待轉(zhuǎn)換的數(shù)據(jù)中存在數(shù)字以外的字符，在使用后者進(jìn)行類(lèi)型轉(zhuǎn)換時(shí)就會(huì)出現(xiàn)錯(cuò)誤，而to_numeric()函數(shù)之所以可以解決這個(gè)問(wèn)題，就源于其errors參數(shù)可以取值coerce——當(dāng)出現(xiàn)非數(shù)字字符時(shí)，會(huì)將其替換為缺失值之后進(jìn)行數(shù)據(jù)類(lèi)型轉(zhuǎn)換。

se = pd.Series(df['A']) se1 = pd.Series(df['B']) se2 = pd.Series(df['C']) print("df['A']:\n", df['A']) print("to_numeric(df['A']):\n", pd.to_numeric(se)) print("df['B']:\n", df['B']) print("to_numeric(df['B']):\n", pd.to_numeric(se1)) print("df['C']:\n", df['C']) print("to_numeric(df['C'], errors='ignore'):\n", pd.to_numeric(se2, errors='ignore')) print("to_numeric(df['C'], errors='coerce'):\n", pd.to_numeric(se2, errors='coerce'))

輸出結(jié)果：

df['A']:0 1 1 2 2 4 Name: A, dtype: object to_numeric(df['A']):0 1 1 2 2 4 Name: A, dtype: int64 df['B']:0 9 1 -80 2 5.3 Name: B, dtype: object to_numeric(df['B']):0 9.0 1 -80.0 2 5.3 Name: B, dtype: float64 df['C']:0 x 1 5.9 2 0 Name: C, dtype: object to_numeric(df['C'], errors='ignore'):0 x 1 5.9 2 0 Name: C, dtype: object to_numeric(df['C'], errors='coerce'):0 NaN 1 5.9 2 0.0 Name: C, dtype: float64

總結(jié)

以上是生活随笔為你收集整理的Python更改数据类型——astype()方法和to_numeric()函数的全部?jī)?nèi)容，希望文章能夠幫你解決所遇到的問(wèn)題。

如果覺(jué)得生活随笔網(wǎng)站內(nèi)容還不錯(cuò)，歡迎將生活随笔推薦給好友。

上一篇：木石世纪全职业图文一览
下一篇： Python之分组级运算——【trans