0
我有一個火花數據幀,結果,有兩個字符串列我想轉換爲數字:鑄造字符串爲int空問題
>>> results.show()
+--------------------+-----------------+------------------------+
| Hospital Name|HCAHPS Base Score|HCAHPS Consistency Score|
+--------------------+-----------------+------------------------+
|"ADIRONDACK MEDIC...| "43"| "20"|
|"BAYLOR MEDICAL C...| "32"| "20"|
|"GOOD SHEPHERD ME...| "25"| "20"|
|"GOOD SHEPHERD ME...| "25"| "20"|
|"MASONIC HOME AND...| "Not Available"| "Not Available"|
|"ST HELENA HOSPITAL"| "41"| "20"|
| "TOURO INFIRMARY"| "15"| "18"|
|"WAHIAWA GENERAL ...| "17"| "10"|
|"ANNA JAQUES HOSP...| "27"| "18"|
| "CMC-BLUE RIDGE"| "31"| "18"|
|"EVANSTON REGIONA...| "15"| "15"|
|"OKLAHOMA SPINE H...| "79"| "20"|
|"PICKENS COUNTY M...| "Not Available"| "Not Available"|
|"PORTNEUF MEDICAL...| "11"| "17"|
|"PRESENCE SAINT J...| "20"| "17"|
|"RIVERSIDE MEDICA...| "39"| "20"|
|"RIVERSIDE MEDICA...| "39"| "20"|
|"RIVERSIDE MEDICA...| "39"| "20"|
|"SOUTH GEORGIA ME...| "3 out of 10"| "24"|
|"TAMPA GENERAL HO...| "23"| "16"|
+--------------------+-----------------+------------------------+
嘗試這樣讓我空值的表:
>>> results2 = results.select(results["Hospital Name"], results["HCAHPS Base Score"].cast(pe()).alias("HCAHPS Base Score"), results["HCAHPS Consistency Score"].cast(IntegerType()).aHPS Consistency Score"))
>>> results2.show()
+--------------------+-----------------+------------------------+
| Hospital Name|HCAHPS Base Score|HCAHPS Consistency Score|
+--------------------+-----------------+------------------------+
|"ADIRONDACK MEDIC...| null| null|
|"BAYLOR MEDICAL C...| null| null|
|"GOOD SHEPHERD ME...| null| null|
|"GOOD SHEPHERD ME...| null| null|
|"MASONIC HOME AND...| null| null|
|"ST HELENA HOSPITAL"| null| null|
| "TOURO INFIRMARY"| null| null|
|"WAHIAWA GENERAL ...| null| null|
|"ANNA JAQUES HOSP...| null| null|
| "CMC-BLUE RIDGE"| null| null|
|"EVANSTON REGIONA...| null| null|
|"OKLAHOMA SPINE H...| null| null|
|"PICKENS COUNTY M...| null| null|
|"PORTNEUF MEDICAL...| null| null|
|"PRESENCE SAINT J...| null| null|
|"RIVERSIDE MEDICA...| null| null|
|"RIVERSIDE MEDICA...| null| null|
|"RIVERSIDE MEDICA...| null| null|
|"SOUTH GEORGIA ME...| null| null|
|"TAMPA GENERAL HO...| null| null|
+--------------------+-----------------+------------------------+
only showing top 20 rows
是不是可以將字符串列轉換爲pyspark中的整數?