2016-03-03 88 views
2

我試圖提取自動從這個site。一旦電力報價我設定的郵編(即:300),我可以下載(手動)的PDF文件提取結果

我使用httr包:

library(httr) 
qr<- POST("http://www.qenergy.com.au/What-Are-Your-Options", 
    query=list(postcode=3000)) 
res <- htmlParse(content(qr)) 

的問題是,該文件的URL不在查詢響應。請任何幫助。

回答

2

試試這個

library(httr) 
qr<- POST("http://www.qenergy.com.au/What-Are-Your-Options", 
      encode="form", 
      body=list(postcode=3000)) 
res <- content(qr) 
pdfs <- as(res['//a[contains(@href, "pdf")]/@href'], "character") 
head(pdfs) 
# [1] "flux-content/qenergy/pdf/VIC price fact sheet jemena distribution zone business/Jemena-Freedom-Biz-5-Day-Time-of-Use-A210.pdf" 
# [2] "flux-content/qenergy/pdf/VIC price fact sheet jemena distribution zone business/Jemena-Freedom-Biz-7-Day-Time-of-Use-A250.pdf" 
# [3] "flux-content/qenergy/pdf/VIC price fact sheet jemena distribution zone business/Jemena-Freedom-Biz-Single-Rate-CL.pdf"   
# [4] "flux-content/qenergy/pdf/VIC price fact sheet jemena distribution zone business/Jemena-Freedom-Biz-Single-Rate.pdf"    
# [5] "flux-content/qenergy/pdf/VIC price fact sheet united energy distribution zone business/United-Freedom-Biz-5-Day-Time-of-Use.pdf" 
# [6] "flux-content/qenergy/pdf/VIC price fact sheet united energy distribution zone business/United-Freedom-Biz-7-Day-Time-of-Use.pdf"