熊貓 - 數組groupBy結果

我試圖得到一組國家使用大熊貓最流行的名字。我已經在片段中看到了一半，但我不清楚如何將groupedByCountry轉換爲有序表格。熊貓 - 數組groupBy結果

import math 
import pandas 
csv = pandas.read_csv("./name_country.csv.gz", compression="gzip") 

data = csv[["name",'country']] 

filtered = roleIni[data.country.notnull()] 

groupedByCountry = filtered.groupby("country")

來源

2015-11-06 henry.oswald

您可以使用GROUPBY size，然後使用nlargest：

In [11]: df = pd.DataFrame([["andy", "GB"], ["bob", "US"], ["chris", "GB"]], columns=["name", "country"]) 

In [12]: df.groupby("country").size().nlargest(1) 
Out[12]: 
country 
GB 2 
dtype: int64

它可能更但是能有效地完成對列直接value_counts，然後採取head（head(n)將得到的前n最受歡迎的國家）：

In [21]: df["country"].value_counts().head(1) 
Out[21]: 
GB 2 
Name: country, dtype: int64

來源

2015-11-06 18:23:39

熊貓 - 數組groupBy結果

回答

相關問題