我有一組特徵向量,因爲我已使用獲得的句子:試圖追加值,以矢量的numpy的陣列設有
with open(sys.argv[1]) as trainingSentences:
sentence2region2value = json.loads(trainingSentences.read())
train_wordlist = []
for sentence,locations in sentence2region2value.iteritems():
train_wordlist.append(" ".join(sentence_to_words(sentence, True)))
vectorizer = CountVectorizer(analyzer = "word", \
tokenizer = None, \
preprocessor = None, \
stop_words = None, \
max_features = 5000)
train_data_features = vectorizer.fit_transform(train_wordlist)
train_data_features = train_data_features.toarray()
欲還添加標籤爲所有這些492個的特徵向量進行邏輯迴歸。這種「預測」標籤包含在sentence2region2value
詞典:
{sentence: Y
{parsedsentence: Z
{prediction: X,
location-values:{"Qatar": [32,221,31]},{"Dubai": [12,123,421]},.....}
目前,我嘗試使用此:
for prediction in sentence2region2value["sentence"]["parsedsentence"].iteritems():
for i in train_data_features:
train_data_features[i] = np.append(train_data_features[i],np.array(prediction))
但它無法正常工作。有任何想法嗎?