當(dāng)前位置：首頁 > 编程语言 > python >内容正文

python

python手写字体程序_深度学习---手写字体识别程序分析（python）

發(fā)布時間：2024/3/12 python 50 豆豆

生活随笔收集整理的這篇文章主要介紹了 python手写字体程序_深度学习---手写字体识别程序分析（python）小編覺得挺不錯的,現(xiàn)在分享給大家,幫大家做個參考.

我想大部分程序員的第一個程序應(yīng)該都是“hello world”，在深度學(xué)習(xí)領(lǐng)域，這個“hello world”程序就是手寫字體識別程序。

這次我們詳細(xì)的分析下手寫字體識別程序，從而可以對深度學(xué)習(xí)建立一個基本的概念。

1.初始化權(quán)重和偏置矩陣，構(gòu)建神經(jīng)網(wǎng)絡(luò)的架構(gòu)

import numpy as np

class network():

def __init__(self, sizes):

self.num_layers = len(sizes)

self.sizes = sizes

self.biases = [ np.random.randn(y,1) for y in sizes[1:] ]

self.weights = [ np.random.randn(y,x) for x,y in zip(sizes(:-1), sizes(1:)) ]

在實例化一個神經(jīng)網(wǎng)絡(luò)時，去初始化權(quán)重和偏置的矩陣，例如

network0 = network([784, 30, 10])

可以初始化一個3層的神經(jīng)網(wǎng)絡(luò)，各層神經(jīng)元的個數(shù)分別為 784， 30 ， 10

2. 如何去反向傳播計算代價函數(shù)的梯度？

這個過程可以大概概括如下：

(1)正向傳播，獲得每個神經(jīng)元的帶權(quán)輸出和激活因子(a)

(2)計算輸出層的誤差

(3)反向傳播計算每一層的誤差和梯度

用python實現(xiàn)的代碼如下：

def backprop(self, x, y):

delta_w = [ np.zeros(w.shape) for w in self.weights]

delta_b = [ np.zeros(b.shape) ?for b in self.biases ]

#計算每個神經(jīng)元的帶權(quán)輸入z及激活值

zs = []

activation = x

activations = [x]

for b,w in zip(self.biases, self.weights):

z = np.dot(w, activation) + b

zs.append(z)

activation = sigmod(z)

activations.append(activation)

#計算輸出層誤差(這里采用的是二次代價函數(shù))

delta = (activations[-1] - y) * sigmod_prime(zs[-1])

delta_w[-1] = np.dot(delta, activations[-2].transpose())

delta_b[-1] = delta

#反向傳播

for l in xrange(2, self.num_layers):

delta = np.dot(delta_w[-l+1].transpose(),delta)*sigmod_prime(zs[-l])

delta_w[-l] = np.dot(delta, activations[-l-1].transpose())

delta_b[-l] = delta

return delta_w, delta_b

3.如何梯度下降，更新權(quán)重和偏置？

通過反向傳播獲得了更新權(quán)重和偏置的增量，進一步進行更新，梯度下降。

def update_mini_batch(self, mini_batch, eta):

delta_w = [ np.zeros(w.shape) for w in self.weights ]

delta_b = [ np.zeros(b.shape) for b in self.biases ]

for x,y in mini_batch:

(這里針對一個小批量內(nèi)所有樣本，應(yīng)用反向傳播，積累權(quán)重和偏置的變化)

delta_w_p, delta_b_p = self.backprop(x,y)

delta_w = [ dt_w + dt_w_p for dt_w,dt_w_p in zip(delta_w, delta_w_p)]

delta_b = [ dt_b + dt_b_p for?dt_b,dt_b_p in zip(delta_b, delta_b_p)]

self.weights = [ w-(eta/len(mini_batch)*nw) for w,nw in zip(self.weights, delta_w)]

self.biases = [ b-(eta/len(mini_batch)*nb) for b,nb in zip(self.biases, delta_b)]

def SGD(self, epochs, training_data, ?mini_batch_size,eta, test_data=None):

if test_data:

n_tests = len(tast_data)

n_training_data = len(training_data)

for i in xrange(0, epochs):

random.shuffle(training_data)

mini_batches = [ ?training_data[k:k+mini_batch_size]

for k in xrange(0, n_training_data, mini_batch_size)

]

for mini_batch in mini_batches:

self.update_mini_batch(mini_batch, eta)

總結(jié)

以上是生活随笔為你收集整理的python手写字体程序_深度学习---手写字体识别程序分析（python）的全部內(nèi)容，希望文章能夠幫你解決所遇到的問題。

如果覺得生活随笔網(wǎng)站內(nèi)容還不錯，歡迎將生活随笔推薦給好友。

上一篇： Java 快速深度克隆对象 [Fast
下一篇： python 爬取链家网北京租房信息

日韩av黄I国产麻豆传媒I国产91av视频在线观看I日韩一区二区三区在线看I美女国产在线I麻豆视频国产在线观看I成人黄色短片

python

python手写字体程序_深度学习---手写字体识别程序分析（python）

總結(jié)