2016-08-02 34 views
0

我想在鏈接http://bokeh.pydata.org/en/latest/docs/gallery/stocks.html集團在散景小時,情節

2004-01-05,00:00:00,01:00:00,Mon,20504,792 
2004-01-05,01:00:00,02:00:00,Mon,16553,783 
2004-01-05,02:00:00,03:00:00,Mon,18944,790 
2004-01-05,03:00:00,04:00:00,Mon,17534,750 
2004-01-06,00:00:00,01:00:00,Tue,17262,747 
2004-01-06,01:00:00,02:00:00,Tue,19072,777 
2004-01-06,02:00:00,03:00:00,Tue,18275,785 

我想用列2中得到像在背景虛化股票數據的情節,如:開始時間和5:計數,我想按列day分組並在相應小時內對counts進行求和。

代碼:不給輸出

import numpy as np 
import pandas as pd 
#from bokeh.layouts import gridplot 
from bokeh.plotting import figure, show, output_file 

data = pd.read_csv('one_hour.csv') 
data.column = ['date', 'startTime', 'endTime', 'day', 'count', 'unique'] 

p1 = figure(x_axis_type='startTime', y_axis_type='count', title="counts per hour") 
p1.grid.grid_line_alpha=0.3 
p1.xaxis.axis_label = 'startTime' 
p1.yaxis.axis_label = 'count' 

output_file("count.html", title="time_graph.py") 
show(gridplot([[p1]], plot_width=400, plot_height=400)) # open a browser 

讀列和情節沒有任何問題,但通過與和運算的列數據應用組的東西,我不能執行。

感謝幫助,謝謝!

+0

@JoeR我能得到的散點圖,但並不是說我想 –

+0

你有沒有實際創建數據框的一個?你發佈的內容不是數據框。 –

+0

@JoeR我已經更新了代碼,至今我已經嘗試過了。 –

回答

1

聽起來這是你需要的東西:

data.groupby('startTime')['count'].sum() 

輸出:

00:00:00 37766 
01:00:00 35625 
02:00:00 37219 
03:00:00 17534 
+0

感謝解決方案,但對如何將其與散景代碼混淆有點困惑? –