圖像識別的前期工作——使用pillow進行圖像處理

-Advertisement-

本文主要介紹使用pillow對圖像進行簡單處理，進而引出圖像處理與手寫識別的關係。 ...

　　pillow是個很好用的python圖像處理庫，可以到官方網站下載最新的文件。如果官網的任何PIL版本都不能與自己的python版本對應，或安裝成功後發現運行出錯，可以嘗試從一個非官方的whl網站下載：http://www.lfd.uci.edu/~gohlke/pythonlibs/#scipy 這個網站的內容相當豐富，而且版本齊全。

打開圖片

from PIL import Image
import matplotlib.pyplot as plt

img = Image.open('girl.png')
img.show()

　　控制台顯示：size=(461, 603), mode=RGBA, format=PNG

代碼很簡單，但PIL使用操作系統的預設方式打開圖片，我們需要用一些更牛叉的方式打開：

1 from PIL import Image
2 import matplotlib.pyplot as plt
3 
4 img = Image.open('girl0.png')
5 model = img.convert('L')
6 plt.figure("girl")
7 #the argument comp is Colormap
8 plt.imshow(model, cmap='pink')
9 plt.show()

　　其中img.convert指定一種色彩模式：

1 (1-bit pixels, black and white, stored with one pixel per byte)
L (8-bit pixels, black and white)
P (8-bit pixels, mapped to any other mode using a colour palette)
RGB (3x8-bit pixels, true colour)
RGBA (4x8-bit pixels, true colour with transparency mask)
CMYK (4x8-bit pixels, colour separation)
YCbCr (3x8-bit pixels, colour video format)
I (32-bit signed integer pixels)
F (32-bit floating point pixels)

分離rgba

　　rgb指紅綠藍光色三原色，a指alpha通道，一般用作不透明度參數

img = Image.open('girl0.png')
# 分離rgba
r, g, b, a = img.split()  
plt.figure("girl0")
plt.imshow(r)
plt.show()

需要註意的是，並非所有圖片都有alpha通道，此時 img.split()僅能返回r,g,b

顯示多個圖片

from PIL import Image
import matplotlib.pyplot as plt

img = Image.open('girl0.png')
gray = img.convert('L')
# 分離rgba
r, g, b, a = img.split()  
plt.figure("girl")

def setPlot(num, title):
    #subplot(nrows, ncols, plot_number)
    #圖表的整個繪圖區域被等分為numRows行和numCols列，然後按照從左到右、從上到下的順序對每個區域進行編號，左上區域的編號為1
    plt.subplot(2, 3, num)
    plt.title(title)
    plt.axis('off')
    
setPlot(1, 'origin')
plt.imshow(img)

setPlot(2, 'gray')
plt.imshow(gray, cmap='gray')

setPlot(3, 'rgba')
# 合併rgba
plt.imshow(Image.merge('RGBA', (r, g, b, a)))

setPlot(4, 'r')
plt.imshow(r)
  
setPlot(5, 'g')
plt.imshow(g)

setPlot(6, 'b')
plt.imshow(b)

二值化處理

到了關鍵時刻

from PIL import Image
import matplotlib.pyplot as plt

#二值化處理
img = Image.open('girl0.png')
gray = img.convert('L')

WHITE, BLACK = 1, 0
img_new = gray.point(lambda x: WHITE if x > 128 else BLACK)
plt.imshow(img_new, cmap='gray')
plt.show()

　　圖片由像素組成，每個像素對應著rgb值，整個圖片可以看成一個矩陣。我們將大於128的像素點轉換為1，其它轉換為0。如果有一張背景色是彩色的手寫文字，經過二值化處理後得到這樣的圖片：

圖片壓縮

如果圖片大小不一，不利於下一步工作，在此需要將圖片壓縮成統一大小，對於手寫數字，可將其壓縮為32*32

 1 #等比例壓縮圖片
 2 #參考 http://fc-lamp.blog.163.com/blog/static/174566687201282424018946/
 3 def resizeImg(**args):
 4     #dst_w,dst_h  目標圖片大小,  save_q  圖片質量
 5     args_key = {'ori_img':'', 'dst_img':'', 'dst_w':'', 'dst_h':'', 'save_q':75}
 6     arg = {}
 7     for key in args_key:
 8         if key in args:
 9             arg[key] = args[key]
10         
11     im = Image.open(arg['ori_img'])
12     ori_w, ori_h = im.size
13     widthRatio = heightRatio = None
14     ratio = 1
15     if (ori_w and ori_w > arg['dst_w']) or (ori_h and ori_h > arg['dst_h']):
16         if arg['dst_w'] and ori_w > arg['dst_w']:
17             widthRatio = float(arg['dst_w']) / ori_w
18         if arg['dst_h'] and ori_h > arg['dst_h']:
19             heightRatio = float(arg['dst_h']) / ori_h
20 
21         if widthRatio and heightRatio:
22             if widthRatio < heightRatio:
23                 ratio = widthRatio
24             else:
25                 ratio = heightRatio
26 
27         if widthRatio and not heightRatio:
28             ratio = widthRatio
29         if heightRatio and not widthRatio:
30             ratio = heightRatio
31             
32         newWidth = int(ori_w * ratio)
33         newHeight = int(ori_h * ratio)
34     else:
35         newWidth = ori_w
36         newHeight = ori_h
37     
38     im.resize((newWidth, newHeight), Image.ANTIALIAS).save(arg['dst_img'], quality=arg['save_q'])

　　可以將二值化處理後的圖片列印出來

 1 resizeImg(ori_img='7.jpg', dst_img='7_1.jpg', dst_w=32, dst_h=32, save_q=60)
 2 
 3 #二值化處理
 4 img = Image.open('7_1.jpg')
 5 gray = img.convert('L')
 6 
 7 WHITE, BLACK = 1, 0
 8 img_new = gray.point(lambda x: WHITE if x > 128 else BLACK)
 9 arr = nmp.array(img_new)
10 
11 for i in range(arr.shape[0]):
12     print(arr[i].flatten())

　　於是手寫數字變成了這樣：

　　這就好玩了。其基本思路是將多維特征轉換為容易識別的二維特征，使用KNN或神經網路等方法進行學習，從而使電腦識別出正確的數字。後續文章將會介紹如何設別。

參考文獻：

http://fc-lamp.blog.163.com/blog/static/174566687201282424018946

作者：我是8位的

出處：http://www.cnblogs.com/bigmonkey

本文以學習、研究和分享為主，如需轉載，請聯繫本人，標明作者和出處，非商業用途！

您的分享是我們最大的動力!

-Advertisement-

更多相關文章

Build 2017 | 今兒來說說火得不行的認知服務吧（內附微軟開發者大會線上峰會報名地址）

認知服務是一種操作簡單、功能強大的 REST API，只需在應用中加入幾行代碼，就可以藉助強大的演算法開發應用程式。這些功能可跨設備、跨平臺，無論 iOS、Android 或 Windows，都能輕鬆實現。 ...
只為更快、更省、更安全的 Azure CDN

CDN 服務想必大家都不陌生，搞網站的，開發應用的，少不了都要用到。通過將內容緩存在各地的 CDN 節點，讓身處不同地區，或使用不同網路運營商的用戶都可以就近獲取內容，獲得快速流暢的訪問體驗。 ...
“雲贊獎”投票結果出爐！快來看看你和你的心中所屬是否獲獎了！

首先在此感謝所有參與互動的小伙伴，感謝大家對“雲贊獎”活動的支持。本次“雲贊獎”項目大賽，共有 11 個項目參賽，讓粉絲們大喊過癮！上千名粉絲參與投票互動，讓這次大賽的[參賽者]倍感興奮！經過大家幾天的激烈角逐，“雲贊獎”活動圓滿落下了帷幕，最終有三個項目脫穎而出 ...
Azure 5 月新公佈（二）

Azure 5 月新發佈（二）：CDN 圖片處理功能, CDN Restful API, 新版 CDN 管理門戶, 計量名稱變更延期 ...
雲計算安全合規認證哪家強？

由世紀互聯獨立運營的 Microsoft Azure 和 Office 365，作為首個落地中國市場的國際公有雲服務，在採用業界領先的微軟雲計算技術為客戶提供可信賴雲服務的同時，嚴格遵循國際和國內法律法規和標準規定，獲得多項權威認證，同時秉承安全性、隱私保護、合規性及透明度四項原則，為廣大用戶提供基... ...
MD5加密--Java

MD5 Message Digest Algorithm MD5（中文名為消息摘要演算法第五版）為電腦安全領域廣泛使用的一種散列函數，用以提供消息的完整性保護。該演算法的文件號為RFC 1321（R.Rivest,MIT Laboratory for Computer Science and RSA ...
python3高級編程

1. SMTP發送郵件 internet相關協議： http:網頁訪問相關，httplib,urllib,xmlrpclib ftp:文件傳輸相關, ftplib, urllib nntp:新聞和帖子相關, nntplib smtp:發送郵件相關, smtplib pop3:接收郵件相關, popl ...
一鍵上傳

import cn.XXXX.bos.utils.PinYin4jUtils; import org.apache.commons.lang3.StringUtils; @Action("areaAction_uploadFile") public String areaAction_uploadF... ...