Python數據處理的實用技巧有哪些

發布時間：2022-04-01 13:43:21 來源：億速云閱讀：146 作者：iii 欄目：開發技術

今天小編給大家分享一下Python數據處理的實用技巧有哪些的相關知識點，內容詳細，邏輯清晰，相信大部分人都還太了解這方面的知識，所以分享這篇文章給大家參考一下，希望大家閱讀完這篇文章后有所收獲，下面我們一起來了解一下吧。

我使用的 Pandas 版本如下，順便也導入 Pandas 庫。

>>> import pandas as pd
>>> pd.__version__
'0.25.1'

在開始前先確保解釋器和數據集在同一目錄下：

>>> import os
>>> os.chdir('D://source/dataset') # 這是我的數據集所在目錄
>>> os.listdir() # 確認此目錄已經存在 IMDB-Movie-Data 數據集
['drinksbycountry.csv', 'IMDB-Movie-Data.csv', 'movietweetings', 'titanic_eda_data.csv', 'titanic_train_data.csv']

準備工作就位后，正式開始數據處理技巧之旅。

1 Pandas 移除某列

導入數據

>>> df = pd.read_csv("IMDB-Movie-Data.csv")
>>> df.head(1) # 導入并顯示第一行
   Rank                    Title                    Genre  ...   Votes Revenue (Millions) Metascore
0     1  Guardians of the Galaxy  Action,Adventure,Sci-Fi  ...  757074             333.13      76.0

[1 rows x 12 columns]

使用 pop 方法移除指定列：

>>> meta = df.pop("Title").to_frame() # 移除 Title 列

確認是否已被移除：

>>> df.head(1) # df 變為 11列
   Rank                    Genre  ... Revenue (Millions) Metascore
0     1  Action,Adventure,Sci-Fi  ...             333.13      76.0

[1 rows x 11 columns]

2 統計標題單詞數

pop 后得到 meta，顯示 meta 前 3 行：

>>> meta.head(3)
                     Title
0  Guardians of the Galaxy
1               Prometheus
2                    Split

標題是由單詞組成，中間用空格分隔。

# .str.count(" ") + 1 得到單詞個數 
>>> meta["words_count"] = meta["Title"].str.count(" ") + 1 
>>> meta.head(3) # words_count 列代表單詞個數
                     Title  words_count
0  Guardians of the Galaxy            4
1               Prometheus            1
2                    Split            1

3 Genre 頻次統計

下面統計電影 Genre 的頻次，

>>> vc = df["Genre"].value_counts()

下面顯示電影 Genre 的 Top5 ，最高頻為出現 50 次的 Action,Adventure,Sci-Fi 類，次之為 48 次的 Drama 類：

>>> vc.head()
Action,Adventure,Sci-Fi    50
Drama                      48
Comedy,Drama,Romance       35
Comedy                     32
Drama,Romance              31
Name: Genre, dtype: int64

展示 Top5 的餅狀圖：

>>> import matplotlib.pyplot as plt
>>> vc[:5].plot(kind='pie')
<matplotlib.axes._subplots.AxesSubplot object at 0x000001D65B114948>
>>> plt.show()

Python數據處理的實用技巧有哪些

以上就是“Python數據處理的實用技巧有哪些”這篇文章的所有內容，感謝各位的閱讀！相信大家閱讀完這篇文章都有很大的收獲，小編每天都會為大家更新不同的知識，如果還想學習更多的知識，請關注億速云行業資訊頻道。

向AI問一下細節

91超碰碰碰碰久久久久久综合_超碰av人澡人澡人澡人澡人掠_国产黄大片在线观看画质优化_txt小说免费全本

Python數據處理的實用技巧有哪些

1 Pandas 移除某列

2 統計標題單詞數

3 Genre 頻次統計

猜你喜歡

91超碰碰碰碰久久久久久综合_超碰av人澡人澡人澡人澡人掠_国产黄大片在线观看画质优化_txt小说免费全本

Python數據處理的實用技巧有哪些

1 Pandas 移除某列

2 統計標題單詞數

3 Genre 頻次統計

猜你喜歡

最新資訊

相關推薦

相關標簽