我試圖計算條件概率爲P(A=a|B=b,C=c)
其中a
是['high', 'medium', 'low']
的元素,b爲在['0-20', '20-40', '40-60', '60-80', '80-inf']
和c
一個元件處於['male', 'female']
的元件。計算條件概率從頻率在Python
我有一個看起來像這樣的頻率詞典:
{('high', '0-20', 'female'): 11,
('high', '0-20', 'male'): 43,
('high', '20-40', 'female'): 10,
('high', '20-40', 'male'): 17,
('high', '40-60', 'female'): 11,
('high', '40-60', 'male'): 10,
('high', '60-80', 'female'): 2,
('high', '60-80', 'male'): 1,
('high', '80-inf', 'female'): 0,
('high', '80-inf', 'male'): 0,
('low', '0-20', 'female'): 130,
('low', '0-20', 'male'): 159,
('low', '20-40', 'female'): 186,
('low', '20-40', 'male'): 297,
('low', '40-60', 'female'): 71,
('low', '40-60', 'male'): 144,
('low', '60-80', 'female'): 35,
('low', '60-80', 'male'): 53,
('low', '80-inf', 'female'): 1,
('low', '80-inf', 'male'): 2,
('medium', '0-20', 'female'): 90,
('medium', '0-20', 'male'): 194,
('medium', '20-40', 'female'): 72,
('medium', '20-40', 'male'): 116,
('medium', '40-60', 'female'): 46,
('medium', '40-60', 'male'): 49,
('medium', '60-80', 'female'): 12,
('medium', '60-80', 'male'): 22,
('medium', '80-inf', 'female'): 1,
('medium', '80-inf', 'male'): 2}
我要的是一本字典,看起來像:
{('high', '0-20', 'female'): P(A='high'| B='0-20', C='female'),
etc...,
}
那你試試這麼遠嗎? – tinySandy
那些看起來不像頻率...他們看起來更像是需要除以總計數才能產生頻率的計數...是否處理給定的數據以獲得您所用的起始頻率不理解,或條件概率的概念,以及如何從離散概率計算它們? – twalberg
@twalberg你是對的,這些是整體計數。它的計算條件概率的概念我遇到了麻煩,因爲它們受多個變量的限制。 –