TensorFlow學(xué)習(xí)筆記（4）：基于MNIST數(shù)據(jù)的softmax regression

ACb0y 發(fā)布于2019-07-25 11:22 / 2214人閱讀

摘要：前言本文基于官網(wǎng)的寫成。輸入數(shù)據(jù)是，全稱是，是一組由這個機構(gòu)搜集的手寫數(shù)字掃描文件和每個文件對應(yīng)標簽的數(shù)據(jù)集，經(jīng)過一定的修改使其適合機器學(xué)習(xí)算法讀取。這個數(shù)據(jù)集可以從牛的不行的教授的網(wǎng)站獲取。

前言

本文基于TensorFlow官網(wǎng)的Tutorial寫成。輸入數(shù)據(jù)是MNIST，全稱是Modified National Institute of Standards and Technology，是一組由這個機構(gòu)搜集的手寫數(shù)字掃描文件和每個文件對應(yīng)標簽的數(shù)據(jù)集，經(jīng)過一定的修改使其適合機器學(xué)習(xí)算法讀取。這個數(shù)據(jù)集可以從牛的不行的Yann LeCun教授的網(wǎng)站獲取。

本文首先使用sklearn的LogisticRegression()進行訓(xùn)練，得到的參數(shù)繪制效果如下（紅色表示參數(shù)估計結(jié)果為負，藍色表示參數(shù)估計結(jié)果為正，綠色代表參數(shù)估計結(jié)果為零）：

從圖形效果看，我們發(fā)現(xiàn)藍色點組成的輪廓與對應(yīng)的數(shù)字輪廓還是比較接近的。

然后本文使用tensorflow對同樣的數(shù)據(jù)集進行了softmax regression的訓(xùn)練，得到的參數(shù)繪制效果如下：

藍色點組成的輪廓與對應(yīng)的數(shù)字輪廓比較接近。但是對比上下兩幅截圖，感覺tensorflow的效果更平滑一些。不過從測試集的準確率來看，二者都在92%左右，sklearn稍微好一點。注意，92%的準確率看起來不錯，但其實是一個很低的準確率，按照官網(wǎng)教程的說法，應(yīng)該要感到羞愧。

代碼

#!/usr/bin/env python
# -*- coding=utf-8 -*-
# @author: 陳水平
# @date: 2017-01-10
# @description: implement a softmax regression model upon MNIST handwritten digits
# @ref: http://yann.lecun.com/exdb/mnist/

import gzip
import struct
import numpy as np
from sklearn.linear_model import LogisticRegression
from sklearn import preprocessing
from sklearn.metrics import accuracy_score
import tensorflow as tf

# MNIST data is stored in binary format, 
# and we transform them into numpy ndarray objects by the following two utility functions
def read_image(file_name):
    with gzip.open(file_name, "rb") as f:
        buf = f.read()
        index = 0
        magic, images, rows, columns = struct.unpack_from(">IIII" , buf , index)
        index += struct.calcsize(">IIII")

        image_size = ">" + str(images*rows*columns) + "B"
        ims = struct.unpack_from(image_size, buf, index)
        
        im_array = np.array(ims).reshape(images, rows, columns)
        return im_array

def read_label(file_name):
    with gzip.open(file_name, "rb") as f:
        buf = f.read()
        index = 0
        magic, labels = struct.unpack_from(">II", buf, index)
        index += struct.calcsize(">II")
        
        label_size = ">" + str(labels) + "B"
        labels = struct.unpack_from(label_size, buf, index)

        label_array = np.array(labels)
        return label_array

print "Start processing MNIST handwritten digits data..."
train_x_data = read_image("MNIST_data/train-images-idx3-ubyte.gz")
train_x_data = train_x_data.reshape(train_x_data.shape[0], -1).astype(np.float32)
train_y_data = read_label("MNIST_data/train-labels-idx1-ubyte.gz")
test_x_data = read_image("MNIST_data/t10k-images-idx3-ubyte.gz")
test_x_data = test_x_data.reshape(test_x_data.shape[0], -1).astype(np.float32)
test_y_data = read_label("MNIST_data/t10k-labels-idx1-ubyte.gz")

train_x_minmax = train_x_data / 255.0
test_x_minmax = test_x_data / 255.0

# Of course you can also use the utility function to read in MNIST provided by tensorflow
# from tensorflow.examples.tutorials.mnist import input_data
# mnist = input_data.read_data_sets("MNIST_data/", one_hot=False)
# train_x_minmax = mnist.train.images
# train_y_data = mnist.train.labels
# test_x_minmax = mnist.test.images
# test_y_data = mnist.test.labels

# We evaluate the softmax regression model by sklearn first
eval_sklearn = False
if eval_sklearn:
    print "Start evaluating softmax regression model by sklearn..."
    reg = LogisticRegression(solver="lbfgs", multi_class="multinomial")
    reg.fit(train_x_minmax, train_y_data)
    np.savetxt("coef_softmax_sklearn.txt", reg.coef_, fmt="%.6f")  # Save coefficients to a text file
    test_y_predict = reg.predict(test_x_minmax)
    print "Accuracy of test set: %f" % accuracy_score(test_y_data, test_y_predict)

eval_tensorflow = True
batch_gradient = False
if eval_tensorflow:
    print "Start evaluating softmax regression model by tensorflow..."
    # reformat y into one-hot encoding style
    lb = preprocessing.LabelBinarizer()
    lb.fit(train_y_data)
    train_y_data_trans = lb.transform(train_y_data)
    test_y_data_trans = lb.transform(test_y_data)

    x = tf.placeholder(tf.float32, [None, 784])
    W = tf.Variable(tf.zeros([784, 10]))
    b = tf.Variable(tf.zeros([10]))
    V = tf.matmul(x, W) + b
    y = tf.nn.softmax(V)

    y_ = tf.placeholder(tf.float32, [None, 10])

    loss = tf.reduce_mean(-tf.reduce_sum(y_ * tf.log(y), reduction_indices=[1]))
    optimizer = tf.train.GradientDescentOptimizer(0.5)
    train = optimizer.minimize(loss)

    init = tf.initialize_all_variables()

    sess = tf.Session()
    sess.run(init)

    if batch_gradient:
        for step in range(300):
            sess.run(train, feed_dict={x: train_x_minmax, y_: train_y_data_trans})
            if step % 10 == 0:
                print "Batch Gradient Descent processing step %d" % step
        print "Finally we got the estimated results, take such a long time..."
    else:
        for step in range(1000):
            sample_index = np.random.choice(train_x_minmax.shape[0], 100)
            batch_xs = train_x_minmax[sample_index, :]
            batch_ys = train_y_data_trans[sample_index, :]
            sess.run(train, feed_dict={x: batch_xs, y_: batch_ys})
            if step % 100 == 0:
                print "Stochastic Gradient Descent processing step %d" % step
    np.savetxt("coef_softmax_tf.txt", np.transpose(sess.run(W)), fmt="%.6f")  # Save coefficients to a text file
    correct_prediction = tf.equal(tf.argmax(y, 1), tf.argmax(y_, 1))
    accuracy = tf.reduce_mean(tf.cast(correct_prediction, tf.float32))
    print "Accuracy of test set: %f" % sess.run(accuracy, feed_dict={x: test_x_minmax, y_: test_y_data_trans})

輸出如下：

Start processing MNIST handwritten digits data...
Start evaluating softmax regression model by sklearn...
Accuracy of test set: 0.926300
Start evaluating softmax regression model by tensorflow...
Stochastic Gradient Descent processing step 0
Stochastic Gradient Descent processing step 100
Stochastic Gradient Descent processing step 200
Stochastic Gradient Descent processing step 300
Stochastic Gradient Descent processing step 400
Stochastic Gradient Descent processing step 500
Stochastic Gradient Descent processing step 600
Stochastic Gradient Descent processing step 700
Stochastic Gradient Descent processing step 800
Stochastic Gradient Descent processing step 900
Accuracy of test set: 0.917400

思考

sklearn的估計時間有點長，因為每一輪參數(shù)更新都是基于全量的訓(xùn)練集數(shù)據(jù)算出損失，再算出梯度，然后再改進結(jié)果的。

tensorflow采用batch gradient descent估計算法時，時間也比較長，原因同上。

tensorflow采用stochastic gradient descent估計算法時間短，最后的估計結(jié)果也挺好，相當(dāng)于每輪迭代只用到了部分數(shù)據(jù)集算出損失和梯度，速度變快，但可能bias增加；所以把迭代次數(shù)增多，這樣可以降低variance，總體上的誤差相比batch gradient descent并沒有差多少。

附錄

參數(shù)效果的繪圖采用R實現(xiàn)，示例代碼如下：

library(dplyr)
library(tidyr)
library(ggplot2)

t <- read.table("coef_softmax_tf.txt")

n <- t %>% 
  tibble::rownames_to_column("digit") %>%
  gather(var_name, var_value, -digit) %>%
  mutate(var_name=stringr::str_sub(var_name, 2))
n$var_name <- as.numeric(n$var_name)
n$digit <- as.numeric(n$digit)
n <- n %>% 
  mutate(digit=digit-1, var_name=var_name-1, y=28 - floor(var_name/28), x=var_name %% 28, v=ifelse(var_value>0, 1, ifelse(var_value<0, -1, 0)))

ggplot(n) + geom_point(aes(x=x,y=y,color=as.factor(v))) + facet_wrap(~digit)

GPU云服務(wù)器云服務(wù)器學(xué)習(xí)tensorflow的 tensorflow的學(xué)習(xí) 基于機器學(xué)習(xí)的基于深度學(xué)習(xí)的語音增強

文章版權(quán)歸作者所有，未經(jīng)允許請勿轉(zhuǎn)載,若此文章存在違規(guī)行為，您可以聯(lián)系管理員刪除。

轉(zhuǎn)載請注明本文地址：http://systransis.cn/yun/38362.html

發(fā)表評論

登陸后可評論

0條評論

ACb0y

男|高級講師

我要關(guān)注我要私信

TA的文章

短信發(fā)送平臺的推廣技巧有哪些？3個小技巧要記牢！

閱讀 938·2021-11-22 13:53
Hostigger：2021年黑色星期五促銷Black Friday Discounts開啟，VPS

閱讀 2561·2021-10-15 09:40
沙利文發(fā)布首個2021去中心化云計算市場趨勢概覽安邁云布局切中未來趨勢_云資訊

閱讀 1044·2021-10-14 09:42
網(wǎng)絡(luò)地址和主機地址怎么算-怎么算網(wǎng)絡(luò)地址,主機地址,廣播地址？

閱讀 3679·2021-09-22 15:59
富士通數(shù)據(jù)在暗網(wǎng)出售犯罪團伙是如何“運作”的?

閱讀 924·2021-09-02 09:47
CSS 提示工具(Tooltip)

閱讀 2464·2019-08-30 15:54
淺析CSS定位

閱讀 1473·2019-08-29 17:14
使用 Nginx 編譯 Sass 和 Scss

閱讀 432·2019-08-29 15:15

成人国产在线小视频_日韩寡妇人妻调教在线播放_色成人www永久在线观看_2018国产精品久久_亚洲欧美高清在线30p_亚洲少妇综合一区_黄色在线播放国产_亚洲另类技巧小说校园_国产主播xx日韩_a级毛片在线免费

資訊專欄INFORMATION COLUMN

上云采購季！| 2核2G4M爆款云服務(wù)器低至59元/年，更有多臺、長期優(yōu)惠，快來選購！

TensorFlow學(xué)習(xí)筆記（4）：基于MNIST數(shù)據(jù)的softmax regression

相關(guān)文章

TensorFlow學(xué)習(xí)筆記（7）：TensorBoard——Tensor與Graph可視化

TensorFlow學(xué)習(xí)筆記（8）：基于MNIST數(shù)據(jù)的循環(huán)神經(jīng)網(wǎng)絡(luò)RNN

TensorFlow學(xué)習(xí)筆記（5）：基于MNIST數(shù)據(jù)的卷積神經(jīng)網(wǎng)絡(luò)CNN

發(fā)表評論

0條評論

ACb0y

男|高級講師

TA的文章

短信發(fā)送平臺的推廣技巧有哪些？3個小技巧要記牢！

Hostigger：2021年黑色星期五促銷Black Friday Discounts開啟，VPS

沙利文發(fā)布首個2021去中心化云計算市場趨勢概覽安邁云布局切中未來趨勢_云資訊

網(wǎng)絡(luò)地址和主機地址怎么算-怎么算網(wǎng)絡(luò)地址,主機地址,廣播地址？

富士通數(shù)據(jù)在暗網(wǎng)出售犯罪團伙是如何“運作”的?

CSS 提示工具(Tooltip)

淺析CSS定位

使用 Nginx 編譯 Sass 和 Scss

最新活動

資訊專欄INFORMATION COLUMN

上云采購季！| 2核2G4M爆款云服務(wù)器低至59元/年，更有多臺、長期優(yōu)惠，快來選購！

TensorFlow學(xué)習(xí)筆記（4）：基于MNIST數(shù)據(jù)的softmax regression

相關(guān)文章

發(fā)表評論

0條評論

男|高級講師

TA的文章

最新活動

上云采購季！| 2核2G4M爆款云服務(wù)器低至59元/年，更有多臺、長期優(yōu)惠，快來選購！