0
品牌全新爲R,所以我會盡我所能解釋這一點。 我一直在玩使用「rvest」軟件包進行數據抓取。在這個例子中,我從維基百科的桌子上刮掉了美國的州人口。我使用的代碼是:如何將rvest輸出轉爲表格
library(rvest)
statepop = read_html("https://en.wikipedia.org/wiki/List_of_U.S._states_and_territories_by_population")
forecasthtml = html_nodes(statepop, "td")
forecasttext = html_text(forecasthtml)
forecasttext
輸出結果如下:
[2] "7000100000000000000♠1"
[3] " California"
[4] "39,250,017"
[5] "37,254,503"
[6] "7001530000000000000♠53"
[7] "738,581"
[8] "702,905"
[9] "12.15%"
[10] "7000200000000000000♠2"
[11] "7000200000000000000♠2"
[12] " Texas"
[13] "27,862,596"
[14] "25,146,105"
[15] "7001360000000000000♠36"
[16] "763,031"
[17] "698,487"
[18] "8.62%"
我怎麼能拒絕文本的這些字符串放到設置類似於它呈現的方式表原始維基百科頁面(包括列,行等)?
使用'html_table()'而不是'html_text()' – Nate