0
我有一個CSV文件看起來像這樣:Python的 - 每月彙總並計算平均
Date,Sentiment
2014-01-03,0.4
2014-01-04,-0.03
2014-01-09,0.0
2014-01-10,0.07
2014-01-12,0.0
2014-02-24,0.0
2014-02-25,0.0
2014-02-25,0.0
2014-02-26,0.0
2014-02-28,0.0
2014-03-01,0.1
2014-03-02,-0.5
2014-03-03,0.0
2014-03-08,-0.06
2014-03-11,-0.13
2014-03-22,0.0
2014-03-23,0.33
2014-03-23,0.3
2014-03-25,-0.14
2014-03-28,-0.25
etc
我的目標是月彙總的日期和計算月平均。日期可能不會以1或1月開始。問題是我有很多數據,這意味着我有更多年。爲此,我想找到最快的日期(月),並從那裏開始計算月份和平均值。例如:
Month count, average
1, 0.4 (<= the earliest month)
2, -0.3
3, 0.0
...
12, 0.1
13, -0.4 (<= new year but counting of month is continuing)
14, 0.3
我用熊貓來打開CSV
data = pd.read_csv("pks.csv", sep=",")
所以在data['Date']
我有日期和data['Sentiment']
我有值。任何想法如何做到這一點?
太棒了,那正是我需要的。非常感謝你! –