《TensorFlow實例之 VGG》

文獻來源： https://www.cs.toronto.edu/~frossard/post/vgg16/

VGG卷積神經網絡是牛津大學在2014年提出來的模型。當這個模型被提出時，由於它的簡潔性和實用性，馬上成爲了當時最流行的卷積神經網絡模型。它在圖像分類和目標檢測任務中都表現出非常好的結果。在2014年的ILSVRC比賽中，VGG 在Top-5中取得了92.3%的正確率。

衆所周知，VGG是一個良好的特徵提取器，其與訓練好的模型也經常被用來做其他事情，比如計算perceptual loss(風格遷移和超分辨率任務中)，儘管現在resnet和inception網絡等等具有很高的精度和更加簡便的網絡結構，但是在特徵提取上，VGG一直是一個很好的網絡，所以說，當你的某些任務上resnet或者inception等表現並不好時，不妨試一下VGG，或許會有意想不到的結果。

VGG之所以是一個很好的特徵提取器，除了和它的網絡結構有關，還和它的訓練方式有關係，VGG並不是直接訓練完成的，它使用了逐層訓練的方法。

另外，VGG對於Alexnet來說，改進並不是很大，主要改進就在於使用了小卷積核，網絡是分段卷積網絡，通過maxpooling過度，同時網絡更深更寬。

VGG16命名由來即該神經網絡層結構爲：13層卷積層，3層鏈接層。

運行環境：python3，Ubuntu

文件列表：

　　 1:模型權重：vgg16_weights.npz

　　2:TensorFlow模型：vgg16.py

　　3:類名列表：imagenet_classes.py

　　4:輸入實例：m1.jpg

代碼實例：

vgg16.py

########################################################################################

# Davi Frossard, 2016                                                                  #

# VGG16 implementation in TensorFlow                                                   #

# Details:                                                                             #

# http://www.cs.toronto.edu/~frossard/post/vgg16/                                      #

#                                                                                      #

# Model from https://gist.github.com/ksimonyan/211839e770f7b538e2d8#file-readme-md     #

# Weights from Caffe converted using https://github.com/ethereon/caffe-tensorflow      #

########################################################################################



import tensorflow as tf

import numpy as np

from scipy.misc import imread, imresize

from imagenet_classes import class_names





class vgg16:

    def __init__(self, imgs, weights=None, sess=None):

        self.imgs = imgs

        self.convlayers()

        self.fc_layers()

        self.probs = tf.nn.softmax(self.fc3l)

        if weights is not None and sess is not None:

            self.load_weights(weights, sess)





    def convlayers(self):

        self.parameters = []



        # zero-mean input

        with tf.name_scope('preprocess') as scope:

            mean = tf.constant([123.68, 116.779, 103.939], dtype=tf.float32, shape=[1, 1, 1, 3], name='img_mean')

            images = self.imgs-mean



        # conv1_1

        with tf.name_scope('conv1_1') as scope:

            kernel = tf.Variable(tf.truncated_normal([3, 3, 3, 64], dtype=tf.float32,

                                                     stddev=1e-1), name='weights')

            conv = tf.nn.conv2d(images, kernel, [1, 1, 1, 1], padding='SAME')

            biases = tf.Variable(tf.constant(0.0, shape=[64], dtype=tf.float32),

                                 trainable=True, name='biases')

            out = tf.nn.bias_add(conv, biases)

            self.conv1_1 = tf.nn.relu(out, name=scope)

            self.parameters += [kernel, biases]



        # conv1_2

        with tf.name_scope('conv1_2') as scope:

            kernel = tf.Variable(tf.truncated_normal([3, 3, 64, 64], dtype=tf.float32,

                                                     stddev=1e-1), name='weights')

            conv = tf.nn.conv2d(self.conv1_1, kernel, [1, 1, 1, 1], padding='SAME')

            biases = tf.Variable(tf.constant(0.0, shape=[64], dtype=tf.float32),

                                 trainable=True, name='biases')

            out = tf.nn.bias_add(conv, biases)

            self.conv1_2 = tf.nn.relu(out, name=scope)

            self.parameters += [kernel, biases]



        # pool1

        self.pool1 = tf.nn.max_pool(self.conv1_2,

                               ksize=[1, 2, 2, 1],

                               strides=[1, 2, 2, 1],

                               padding='SAME',

                               name='pool1')



        # conv2_1

        with tf.name_scope('conv2_1') as scope:

            kernel = tf.Variable(tf.truncated_normal([3, 3, 64, 128], dtype=tf.float32,

                                                     stddev=1e-1), name='weights')

            conv = tf.nn.conv2d(self.pool1, kernel, [1, 1, 1, 1], padding='SAME')

            biases = tf.Variable(tf.constant(0.0, shape=[128], dtype=tf.float32),

                                 trainable=True, name='biases')

            out = tf.nn.bias_add(conv, biases)

            self.conv2_1 = tf.nn.relu(out, name=scope)

            self.parameters += [kernel, biases]



        # conv2_2

        with tf.name_scope('conv2_2') as scope:

            kernel = tf.Variable(tf.truncated_normal([3, 3, 128, 128], dtype=tf.float32,

                                                     stddev=1e-1), name='weights')

            conv = tf.nn.conv2d(self.conv2_1, kernel, [1, 1, 1, 1], padding='SAME')

            biases = tf.Variable(tf.constant(0.0, shape=[128], dtype=tf.float32),

                                 trainable=True, name='biases')

            out = tf.nn.bias_add(conv, biases)

            self.conv2_2 = tf.nn.relu(out, name=scope)

            self.parameters += [kernel, biases]



        # pool2

        self.pool2 = tf.nn.max_pool(self.conv2_2,

                               ksize=[1, 2, 2, 1],

                               strides=[1, 2, 2, 1],

                               padding='SAME',

                               name='pool2')



        # conv3_1

        with tf.name_scope('conv3_1') as scope:

            kernel = tf.Variable(tf.truncated_normal([3, 3, 128, 256], dtype=tf.float32,

                                                     stddev=1e-1), name='weights')

            conv = tf.nn.conv2d(self.pool2, kernel, [1, 1, 1, 1], padding='SAME')

            biases = tf.Variable(tf.constant(0.0, shape=[256], dtype=tf.float32),

                                 trainable=True, name='biases')

            out = tf.nn.bias_add(conv, biases)

            self.conv3_1 = tf.nn.relu(out, name=scope)

            self.parameters += [kernel, biases]



        # conv3_2

        with tf.name_scope('conv3_2') as scope:

            kernel = tf.Variable(tf.truncated_normal([3, 3, 256, 256], dtype=tf.float32,

                                                     stddev=1e-1), name='weights')

            conv = tf.nn.conv2d(self.conv3_1, kernel, [1, 1, 1, 1], padding='SAME')

            biases = tf.Variable(tf.constant(0.0, shape=[256], dtype=tf.float32),

                                 trainable=True, name='biases')

            out = tf.nn.bias_add(conv, biases)

            self.conv3_2 = tf.nn.relu(out, name=scope)

            self.parameters += [kernel, biases]



        # conv3_3

        with tf.name_scope('conv3_3') as scope:

            kernel = tf.Variable(tf.truncated_normal([3, 3, 256, 256], dtype=tf.float32,

                                                     stddev=1e-1), name='weights')

            conv = tf.nn.conv2d(self.conv3_2, kernel, [1, 1, 1, 1], padding='SAME')

            biases = tf.Variable(tf.constant(0.0, shape=[256], dtype=tf.float32),

                                 trainable=True, name='biases')

            out = tf.nn.bias_add(conv, biases)

            self.conv3_3 = tf.nn.relu(out, name=scope)

            self.parameters += [kernel, biases]



        # pool3

        self.pool3 = tf.nn.max_pool(self.conv3_3,

                               ksize=[1, 2, 2, 1],

                               strides=[1, 2, 2, 1],

                               padding='SAME',

                               name='pool3')



        # conv4_1

        with tf.name_scope('conv4_1') as scope:

            kernel = tf.Variable(tf.truncated_normal([3, 3, 256, 512], dtype=tf.float32,

                                                     stddev=1e-1), name='weights')

            conv = tf.nn.conv2d(self.pool3, kernel, [1, 1, 1, 1], padding='SAME')

            biases = tf.Variable(tf.constant(0.0, shape=[512], dtype=tf.float32),

                                 trainable=True, name='biases')

            out = tf.nn.bias_add(conv, biases)

            self.conv4_1 = tf.nn.relu(out, name=scope)

            self.parameters += [kernel, biases]



        # conv4_2

        with tf.name_scope('conv4_2') as scope:

            kernel = tf.Variable(tf.truncated_normal([3, 3, 512, 512], dtype=tf.float32,

                                                     stddev=1e-1), name='weights')

            conv = tf.nn.conv2d(self.conv4_1, kernel, [1, 1, 1, 1], padding='SAME')

            biases = tf.Variable(tf.constant(0.0, shape=[512], dtype=tf.float32),

                                 trainable=True, name='biases')

            out = tf.nn.bias_add(conv, biases)

            self.conv4_2 = tf.nn.relu(out, name=scope)

            self.parameters += [kernel, biases]



        # conv4_3

        with tf.name_scope('conv4_3') as scope:

            kernel = tf.Variable(tf.truncated_normal([3, 3, 512, 512], dtype=tf.float32,

                                                     stddev=1e-1), name='weights')

            conv = tf.nn.conv2d(self.conv4_2, kernel, [1, 1, 1, 1], padding='SAME')

            biases = tf.Variable(tf.constant(0.0, shape=[512], dtype=tf.float32),

                                 trainable=True, name='biases')

            out = tf.nn.bias_add(conv, biases)

            self.conv4_3 = tf.nn.relu(out, name=scope)

            self.parameters += [kernel, biases]



        # pool4

        self.pool4 = tf.nn.max_pool(self.conv4_3,

                               ksize=[1, 2, 2, 1],

                               strides=[1, 2, 2, 1],

                               padding='SAME',

                               name='pool4')



        # conv5_1

        with tf.name_scope('conv5_1') as scope:

            kernel = tf.Variable(tf.truncated_normal([3, 3, 512, 512], dtype=tf.float32,

                                                     stddev=1e-1), name='weights')

            conv = tf.nn.conv2d(self.pool4, kernel, [1, 1, 1, 1], padding='SAME')

            biases = tf.Variable(tf.constant(0.0, shape=[512], dtype=tf.float32),

                                 trainable=True, name='biases')

            out = tf.nn.bias_add(conv, biases)

            self.conv5_1 = tf.nn.relu(out, name=scope)

            self.parameters += [kernel, biases]



        # conv5_2

        with tf.name_scope('conv5_2') as scope:

            kernel = tf.Variable(tf.truncated_normal([3, 3, 512, 512], dtype=tf.float32,

                                                     stddev=1e-1), name='weights')

            conv = tf.nn.conv2d(self.conv5_1, kernel, [1, 1, 1, 1], padding='SAME')

            biases = tf.Variable(tf.constant(0.0, shape=[512], dtype=tf.float32),

                                 trainable=True, name='biases')

            out = tf.nn.bias_add(conv, biases)

            self.conv5_2 = tf.nn.relu(out, name=scope)

            self.parameters += [kernel, biases]



        # conv5_3

        with tf.name_scope('conv5_3') as scope:

            kernel = tf.Variable(tf.truncated_normal([3, 3, 512, 512], dtype=tf.float32,

                                                     stddev=1e-1), name='weights')

            conv = tf.nn.conv2d(self.conv5_2, kernel, [1, 1, 1, 1], padding='SAME')

            biases = tf.Variable(tf.constant(0.0, shape=[512], dtype=tf.float32),

                                 trainable=True, name='biases')

            out = tf.nn.bias_add(conv, biases)

            self.conv5_3 = tf.nn.relu(out, name=scope)

            self.parameters += [kernel, biases]



        # pool5

        self.pool5 = tf.nn.max_pool(self.conv5_3,

                               ksize=[1, 2, 2, 1],

                               strides=[1, 2, 2, 1],

                               padding='SAME',

                               name='pool4')



    def fc_layers(self):

        # fc1

        with tf.name_scope('fc1') as scope:

            shape = int(np.prod(self.pool5.get_shape()[1:]))

            fc1w = tf.Variable(tf.truncated_normal([shape, 4096],

                                                         dtype=tf.float32,

                                                         stddev=1e-1), name='weights')

            fc1b = tf.Variable(tf.constant(1.0, shape=[4096], dtype=tf.float32),

                                 trainable=True, name='biases')

            pool5_flat = tf.reshape(self.pool5, [-1, shape])

            fc1l = tf.nn.bias_add(tf.matmul(pool5_flat, fc1w), fc1b)

            self.fc1 = tf.nn.relu(fc1l)

            self.parameters += [fc1w, fc1b]



        # fc2

        with tf.name_scope('fc2') as scope:

            fc2w = tf.Variable(tf.truncated_normal([4096, 4096],

                                                         dtype=tf.float32,

                                                         stddev=1e-1), name='weights')

            fc2b = tf.Variable(tf.constant(1.0, shape=[4096], dtype=tf.float32),

                                 trainable=True, name='biases')

            fc2l = tf.nn.bias_add(tf.matmul(self.fc1, fc2w), fc2b)

            self.fc2 = tf.nn.relu(fc2l)

            self.parameters += [fc2w, fc2b]



        # fc3

        with tf.name_scope('fc3') as scope:

            fc3w = tf.Variable(tf.truncated_normal([4096, 1000],

                                                         dtype=tf.float32,

                                                         stddev=1e-1), name='weights')

            fc3b = tf.Variable(tf.constant(1.0, shape=[1000], dtype=tf.float32),

                                 trainable=True, name='biases')

            self.fc3l = tf.nn.bias_add(tf.matmul(self.fc2, fc3w), fc3b)

            self.parameters += [fc3w, fc3b]



    def load_weights(self, weight_file, sess):

        weights = np.load(weight_file)

        keys = sorted(weights.keys())

        for i, k in enumerate(keys):

            print(i, k, np.shape(weights[k]))

            sess.run(self.parameters[i].assign(weights[k]))



if __name__ == '__main__':

    sess = tf.Session()

    imgs = tf.placeholder(tf.float32, [None, 224, 224, 3])

    vgg = vgg16(imgs, 'vgg16_weights.npz', sess)



    img1 = imread('m1.jpg', mode='RGB')

    img1 = imresize(img1, (224, 224))



    prob = sess.run(vgg.probs, feed_dict={vgg.imgs: [img1]})[0]

    preds = (np.argsort(prob)[::-1])[0:5]

    for p in preds:

        print (class_names[p], prob[p])

結果展示：

關於VGG16具體實現見鏈接：https://blog.csdn.net/errors_in_life/article/details/65950699

另一詳細介紹見：http://www.360doc.com/content/17/0624/10/10408243_666125650.shtml

《TensorFlow實例之 VGG》

這個網絡爬蟲代碼，拿到數據之後如何存到csv文件中去？

即刻放大鏡。跟隨鼠標，屏幕任意位置放大

【面試準備】【SQL】數據庫有哪些約束？

.NET開源強大、易於使用的緩存框架 - FusionCache

BGE M3-Embedding 模型介紹

面試，有時候是個運氣活

Pytorch實例----NLP之文本分類

統計學習方法——第1章統計學習方法概論

統計學習方法——第2章感知機模型

Pytorch學習基礎——LSTM從訓練到測試

統計學習方法——第2章感知機模型編程實現

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結