我想清理使用命令額外空格的數據框中的一列。數據幀有近800萬條記錄Python熊貓錯誤,同時刪除額外的空白空間
datt2.My_variable=datt2.My_variable.str.replace('\s+', ' ')
我最終得到下面的錯誤
MemoryError Traceback (most recent call last)
<ipython-input-10-158a51cfaa3d> in <module>()
----> 1 datt2.My_variable=datt2.My_variable.str.replace('\s+', ' ')
c:\python27\lib\site-packages\pandas\core\strings.pyc in replace(self, pat, repl, n, case, flags)
1504 def replace(self, pat, repl, n=-1, case=True, flags=0):
1505 result = str_replace(self._data, pat, repl, n=n, case=case,
-> 1506 flags=flags)
1507 return self._wrap_result(result)
1508
c:\python27\lib\site-packages\pandas\core\strings.pyc in str_replace(arr, pat, repl, n, case, flags)
334 f = lambda x: x.replace(pat, repl, n)
335
--> 336 return _na_map(f, arr)
337
338
c:\python27\lib\site-packages\pandas\core\strings.pyc in _na_map(f, arr, na_result, dtype)
152 def _na_map(f, arr, na_result=np.nan, dtype=object):
153 # should really _check_ for NA
--> 154 return _map(f, arr, na_mask=True, na_value=na_result, dtype=dtype)
155
156
c:\python27\lib\site-packages\pandas\core\strings.pyc in _map(f, arr, na_mask, na_value, dtype)
167 try:
168 convert = not all(mask)
--> 169 result = lib.map_infer_mask(arr, f, mask.view(np.uint8), convert)
170 except (TypeError, AttributeError):
171
pandas\src\inference.pyx in pandas.lib.map_infer_mask (pandas\lib.c:65837)()
pandas\src\inference.pyx in pandas.lib.maybe_convert_objects (pandas\lib.c:56806)()
MemoryError:
如果你使用'datt2.My_variable.str.replace(r'\ s +','',inplace = True,regex = True)'? –
@WiktorStribiżew,這個函數沒有參數'inplace'或'regex'。我仍然繼續前進並執行它,並且錯誤消息正如所預期的那樣:TypeError:replace()得到了一個意外的關鍵字參數'inplace'' – Enthusiast
這是很多你正在處理的數據嗎? – languitar