PyCaffe - 改變腳本以10個圖像作爲輸入，而不是一個

我有一個項目，我想在一個python腳本中使用Yahoo的OpenNSFW網絡，但是現在，腳本只接受一個圖像示例，需要大約270ms計算正向傳球（有點太慢）。PyCaffe - 改變腳本以10個圖像作爲輸入，而不是一個

將圖像分攤爲50張圖像我認爲會更快，但我不確定是否可以使用deployprototxt文檔來完成此操作。

我在這裏改變了deploy.prototxt文檔通過改變暗淡1 - > 10，象這樣：

name: "ResNet_50_1by2_nsfw" 
layer { 
    name: "data" 
    type: "Input" 
    top: "data" 
    input_param { shape: { dim: 10 dim: 3 dim: 224 dim: 224 } } 
} 
...

現在我需要一種方法來change it in the Python script it uses代碼在這裏：

#!/usr/bin/env python 
""" 
Copyright 2016 Yahoo Inc. 
Licensed under the terms of the 2 clause BSD license. 
Please see LICENSE file in the project root for terms. 
""" 

import numpy as np 
import os 
import sys 
import argparse 
import glob 
import time 
from PIL import Image 
from StringIO import StringIO 
import caffe 


def resize_image(data, sz=(256, 256)): 
    """ 
    Resize image. Please use this resize logic for best results instead of the 
    caffe, since it was used to generate training dataset 
    :param str data: 
     The image data 
    :param sz tuple: 
     The resized image dimensions 
    :returns bytearray: 
     A byte array with the resized image 
    """ 
    img_data = str(data) 
    im = Image.open(StringIO(img_data)) 
    if im.mode != "RGB": 
     im = im.convert('RGB') 
    imr = im.resize(sz, resample=Image.BILINEAR) 
    fh_im = StringIO() 
    imr.save(fh_im, format='JPEG') 
    fh_im.seek(0) 
    return bytearray(fh_im.read()) 

def caffe_preprocess_and_compute(pimg, caffe_transformer=None, caffe_net=None, 
    output_layers=None): 
    """ 
    Run a Caffe network on an input image after preprocessing it to prepare 
    it for Caffe. 
    :param PIL.Image pimg: 
     PIL image to be input into Caffe. 
    :param caffe.Net caffe_net: 
     A Caffe network with which to process pimg afrer preprocessing. 
    :param list output_layers: 
     A list of the names of the layers from caffe_net whose outputs are to 
     to be returned. If this is None, the default outputs for the network 
     are returned. 
    :return: 
     Returns the requested outputs from the Caffe net. 
    """ 
    if caffe_net is not None: 

     # Grab the default output names if none were requested specifically. 
     if output_layers is None: 
      output_layers = caffe_net.outputs 

     img_data_rs = resize_image(pimg, sz=(256, 256)) 
     image = caffe.io.load_image(StringIO(img_data_rs)) 

     H, W, _ = image.shape 
     _, _, h, w = caffe_net.blobs['data'].data.shape 
     h_off = max((H - h)/2, 0) 
     w_off = max((W - w)/2, 0) 
     crop = image[h_off:h_off + h, w_off:w_off + w, :] 
     transformed_image = caffe_transformer.preprocess('data', crop) 
     transformed_image.shape = (1,) + transformed_image.shape 

     input_name = caffe_net.inputs[0] 
     all_outputs = caffe_net.forward_all(blobs=output_layers, 
        **{input_name: transformed_image}) 

     outputs = all_outputs[output_layers[0]][0].astype(float) 
     return outputs 
    else: 
     return [] 


def main(argv): 
    pycaffe_dir = os.path.dirname(__file__) 

    parser = argparse.ArgumentParser() 
    # Required arguments: input file. 
    parser.add_argument(
     "input_file", 
     help="Path to the input image file" 
    ) 

    # Optional arguments. 
    parser.add_argument(
     "--model_def", 
     help="Model definition file." 
    ) 
    parser.add_argument(
     "--pretrained_model", 
     help="Trained model weights file." 
    ) 

    args = parser.parse_args() 
    image_data = open(args.input_file).read() 

    # Pre-load caffe model. 
    nsfw_net = caffe.Net(args.model_def, # pylint: disable=invalid-name 
     args.pretrained_model, caffe.TEST) 

    # Load transformer 
    # Note that the parameters are hard-coded for best results 
    caffe_transformer = caffe.io.Transformer({'data': nsfw_net.blobs['data'].data.shape}) 
    caffe_transformer.set_transpose('data', (2, 0, 1)) # move image channels to outermost 
    caffe_transformer.set_mean('data', np.array([104, 117, 123])) # subtract the dataset-mean value in each channel 
    caffe_transformer.set_raw_scale('data', 255) # rescale from [0, 1] to [0, 255] 
    caffe_transformer.set_channel_swap('data', (2, 1, 0)) # swap channels from RGB to BGR 

    # Classify. 
    scores = caffe_preprocess_and_compute(image_data, caffe_transformer=caffe_transformer, caffe_net=nsfw_net, output_layers=['prob']) 

    # Scores is the array containing SFW/NSFW image probabilities 
    # scores[1] indicates the NSFW probability 
    print "NSFW score: " , scores[1] 



if __name__ == '__main__': 
    main(sys.argv)

有一個簡單的如何做到這一點？

來源

2017-01-16 lollercoaster

您可以在classifier.py caffe提供的示例腳本中看到將多個圖像送入分類器的示例。

您基本上需要使transformed_image成爲一個4D陣列，不同的圖像沿着第0軸堆疊。

來源

2017-01-17 07:06:59 Shai

PyCaffe - 改變腳本以10個圖像作爲輸入，而不是一個

回答

相關問題