我試圖將此乳房癌威斯康星州數據集從列表轉換爲包含列的數據框。將url中的數據列表轉換爲python中的csv
這些列名:
# Attribute Domain
-- -----------------------------------------
1. Sample code number id number
2. Clump Thickness 1 - 10
3. Uniformity of Cell Size 1 - 10
4. Uniformity of Cell Shape 1 - 10
5. Marginal Adhesion 1 - 10
6. Single Epithelial Cell Size 1 - 10
7. Bare Nuclei 1 - 10
8. Bland Chromatin 1 - 10
9. Normal Nucleoli 1 - 10
10. Mitoses 1 - 10
11. Class: (2 for benign, 4 for malignant)
我導入的數據集分成蟒蛇這樣
導入請求
link = "http://archive.ics.uci.edu/ml/machine-learning-databases/breast-cancer-wisconsin/breast-cancer-wisconsin.data"
f = requests.get(link)
print (f.text)
並將數據視爲列表機智^ h逗號:
1000025,5,1,1,1,2,1,3,1,1,2
1002945,5,4,4,5,7,10,3,2,1,2
1015425,3,1,1,1,2,2,3,1,1,2
1016277,6,8,8,1,3,4,3,7,1,2
1017023,4,1,1,3,2,1,3,1,1,2
我需要逗號分隔成列和名稱添加到列
我試過,但沒有奏效
import requests
import pandas as pd
import io
urlData = requests.get(f.text).content
rawData = pd.read_csv(io.StringIO(urlData.decode('utf-8')))
可能重複[鏈接](https://stackoverflow.com/a/41880513/3959965 ) – dalonlobo
[Pandas read \ _csv from url]可能重複(https://stackoverflow.com/questions/32400867/pandas-read-csv-from-url) – miradulo
只是'pd.read_csv(link,header = None)' - 比較簡單:) – miradulo