2013-07-10 54 views
1

我試着在R上讀一些在線pdf文檔。我用readRDF函數。我的劇本是這樣的readPDF(tm package)in R

safex <- readPDF(PdftotextOptions='-layout')(elem=list(uri='C:/Users/FCG/Desktop/NoteF7000.pdf'),language='en',id='id1') 

[R顯示運行命令有狀態309.我嘗試了不同的pdftotext選項的消息。然而,這是相同的信息。並且創建的文本文件沒有內容。

誰能read this pdf

+0

我假設你有所有的'PATH's? –

+0

@RomanLuštrik,你的意思是我編輯Windows環境變量的路徑? – Lawrence

+0

是的,這些路徑是爲了什麼?你能否使用該功能使用的所有程序? –

回答

3

readPDFbugs,可能是不值得費心(看看this well-documented struggle吧)。

假設......

  1. 你有xpdf安裝(見here瞭解詳細信息)

  2. 你的路都是爲了(見here對於如何做到這一點的詳細信息)並且您已重新啓動計算機。

那麼你可能會更好避免readPDF,而是使用此解決方法:

system(paste('"C:/Program Files/xpdf/pdftotext.exe"', 
      '"C:/Users/FCG/Desktop/NoteF7000.pdf"'), wait=FALSE) 

然後閱讀文本文件導入R是這樣的...

require(tm) 
mycorpus <- Corpus(URISource("C:/Users/FCG/Desktop/NoteF7001.txt")) 

而且具有看看確認它進展順利:

inspect(mycorpus) 

A corpus with 1 text document 

The metadata consists of 2 tag-value pairs and a data frame 
Available tags are: 
    create_date creator 
Available variables in the data frame are: 
    MetaID 

[[1]] 
Market Notice 
Number: Date F7001 08 May 2013 

New IDX SSF (EWJG) The following new IDX SSF contract will be added to the list and will be available for trade today. 

Summary Contract Specifications Contract Code Underlying Instrument Bloomberg Code ISIN Code EWJG EWJG IShares MSCI Japan Index Fund (US) EWJ US EQUITY US4642868487 1 (R1 per point) 

Contract Size/Nominal 

Expiry Dates & Times 

10am New York Time; 14 Jun 2013/16 Sep 2013 

Underlying Currency Quotations Minimum Price Movement (ZAR) Underlying Reference Price 

USD/ZAR Bloomberg Code (USDZAR Currency) Price per underlying share to two decimals. R0.01 (0.01 in the share price) 

4pm underlying spot level as captured by the JSE. 

Currency Reference Price 

The same method as the one utilized for the expiry of standard currency futures on standard quarterly SAFEX expiry dates. 

JSE Limited Registration Number: 2005/022939/06 One Exchange Square, Gwen Lane, Sandown, South Africa. Private Bag X991174, Sandton, 2146, South Africa. Telephone: +27 11 520 7000, Facsimile: +27 11 520 8584, www.jse.co.za 

Executive Director: NF Newton-King (CEO), A Takoordeen (CFO) Non-Executive Directors: HJ Borkum (Chairman), AD Botha, MR Johnston, DM Lawrence, A Mazwai, Dr. MA Matooane , NP Mnxasana, NS Nematswerani, N Nyembezi-Heita, N Payne Alternate Directors: JH Burke, LV Parsons 

Member of the World Federation of Exchanges 

Company Secretary: GC Clarke 
Settlement Method 

Cash Settled 

- 

Clearing House Fees - 

On-screen IDX Futures Trading: o 1 BP for Taker (Aggressor) o Zero Booking Fees for Maker (Passive) o No Cap o Floor of 0.01 Reported IDX Futures Trades o 1.75 BP for both buyer and seller o No Cap o Floor of 0.01 

Initial Margin Class Spread Margin V.S.R. Expiry Date 

R 10.00 R 5.00 3.5 14/06/2013, 16/09/2013 

The above instrument has been designated as "Foreign" by the South African Reserve Bank 

Should you have any queries regarding IDX Single Stock Futures, please contact the IDX team on 011 520-7399 or [email protected] 

Graham Smale Director: Bonds and Financial Derivatives Tel: +27 11 520 7831 Fax:+27 11 520 8831 E-mail: [email protected] 

Distributed by the Company Secretariat +27 11 520 7346 

Page 2 of 2