Python：分類圖像中的對象

camera = webcam; % Connect to the camera 
nnet = alexnet; % Load the neural net 

while true 
    picture = camera.snapshot;    % Take a picture  
    picture = imresize(picture,[227,227]); % Resize the picture 

    label = classify(nnet, picture);  % Classify the picture 

    image(picture);  % Show the picture 
    title(char(label)); % Show the label 
    drawnow; 
end

我在互聯網上找到了這個matlab代碼。它顯示一個窗口，上面有來自網絡攝像頭的圖片，並且很快也會命名圖片中的內容（「鍵盤」，「bootle」，「鉛筆」，「時鐘」...）。我想在python中做到這一點。到目前爲止，我有這樣的：Python：分類圖像中的對象

import cv2 
import sys 
faceCascade = cv2.CascadeClassifier("haarcascade_frontalface_default.xml") 
video_capture = cv2.VideoCapture(0) 

while True: 
    ret, frame = video_capture.read() 
    gray = cv2.cvtColor(frame, cv2.COLOR_BGR2GRAY) 

    faces = faceCascade.detectMultiScale(
     gray, 
     scaleFactor=1.1, 
     minNeighbors=5, 
     minSize=(30, 30), 
     flags=cv2.cv.CV_HAAR_SCALE_IMAGE 
    ) 

    # Draw a rectangle around the faces 
    for (x, y, w, h) in faces: 
     cv2.rectangle(frame, (x, y), (x+w, y+h), (0, 255, 0), 2) 
    cv2.imshow('Video', frame) 

    if cv2.waitKey(1) & 0xFF == ord('q'): 
     break

這是alreay很相似，但只檢測面。 matlab代碼使用alexnet。我想這是一個基於imagenet數據的預訓練網絡（http://www.image-net.org/）。但它不再可用。我將如何在python中做到這一點？

（這裏有類似的問題，但它是4歲，我認爲現在有更新的技術）。

來源

2017-05-28 jms

第一個示例使用1000個類中的一個對象選擇標記整個圖片。第二個發現圖片中只有一個類對象，但包含每個實例的位置。你能否澄清一下，你只想複製Python中的第一個例子？將兩者結合也是可能的（圍繞多個類別標識對象的框），但是可以比任何一個例子都要複雜得多。 –

我只對「label = classify（nnet，picture）」感興趣;％分類圖片「 I.e.爲pciture獲得一個分類標籤。 – jms

通過「tensorflow」軟件包和預先培訓的網絡「vgg16」，解決方案非常簡單。請參閱https://github.com/machrisaa/tensorflow-vgg/blob/master/test_vgg16.py

來源

2017-05-31 14:01:25 jms

Python：分類圖像中的對象

回答

相關問題