這是我的數據的一個樣本:刪除標點符號格式的文本 - 星火
case time (especially it's purse), read manual care, follow care instructions make stays waterproof -- example, inspect rubber seals doors (especially battery/memory card door open time)
xm "life support" picture . flip part bit flimsy guessing won't long . sound great altec speaker dock it! chance back base (xm3020) . traveling bag connect laptop extra speaker . amount paid ($25).
我想刪除所有標點符號除了點,並與length < = 2
刪除的話,比如我的預期輸出()。是:
case time especially its purse read manual care follow care instructions . make stays waterproof example inspect rubber seals doors especially batterymemory card door open time
life support picture . flip part bit flimsy guessing wont long . sound great altec speaker dock chance back base xm3020 . traveling bag connect laptop extra speaker . amount paid $25 .
,這應該在Scala中實現, 我已經試過:
replaceAll("""\\W\s""", "")
replaceAll(""""[^a-zA-Z\.]""", "")
但無法正常工作,任何人都可以幫助我嗎?
'$ 25'有一個特殊的字符,你沒有刪除。 – tuxdna