Python學習案例之Web版語音合成播報

前言語音合成技術能將用戶輸入的文字，轉換成流暢自然的語音輸出，並且可以支持語速、音調、音量設置，打破傳統文字式人機交互的方式，讓人機溝通更自然。應用場景將游戲場景中的公告、任務或派單信息通過語音播報，讓玩家玩游戲或配送員送貨的同時，也可接聽新任務。文學小說類軟體，可以利用百度語音合成技術將文 ...

前言

語音合成技術能將用戶輸入的文字，轉換成流暢自然的語音輸出，並且可以支持語速、音調、音量設置，打破傳統文字式人機交互的方式，讓人機溝通更自然。

應用場景

將游戲場景中的公告、任務或派單信息通過語音播報，讓玩家玩游戲或配送員送貨的同時，也可接聽新任務。

文學小說類軟體，可以利用百度語音合成技術將文學小說作品進行高質量的朗讀，流暢清晰，解放雙眼，暢聽世界。

軟體架構

Python3.7.2、Django2.1.7、baidu-aip(百度語音API)

案例

這裡只展示部分代碼，有興趣的同學可以自行下載源碼安裝調試。

import os
import time
import codecs
from aip import AipSpeech
from django.shortcuts import render
from django.http import HttpResponse


'''
pip install --upgrade pip
pip install django
pip install baidu-aip
'''


def main(request):
    return render(request, 'index.html')


def m_main(request):
    return render(request, 'm_index.html')


def convert(request):
    message = request.POST.get("message")
    switch = request.POST.get("switch")
    mp3 = du_say(message, switch)
    return HttpResponse(mp3)


def du_say(message, switch):
    write_txt(message)
    app_id = '*****'
    api_key = '*****'
    secret_key = '*****'
    client = AipSpeech(app_id, api_key, secret_key)
    if switch == "true":
        switch = 3
    else:
        switch = 4
    result = client.synthesis(message, 'zh', 1, {
        'vol': 5, 'per': switch,
    })
    t = time.time()
    now_time = lambda: int(round(t * 1000))
    path = os.getcwd() + os.path.sep + "static" + os.path.sep + "audio"+os.path.sep
    audio = path+str(now_time())+'.mp3'
    # 識別正確返回語音二進位 錯誤則返回dict 參照下麵錯誤碼
    if not isinstance(result, dict):
        with open(audio, 'wb') as f:
            f.write(result)
    return str(now_time())+'.mp3'


def write_txt(message):
    t = time.time()
    now_time = lambda: int(round(t * 1000))
    path = os.getcwd() + os.path.sep + "static" + os.path.sep + "text"+os.path.sep
    text = path+str(now_time())+'.txt'
    with codecs.open(text, 'a', encoding='utf8')as f:
        f.write(message)

本地部署

從碼雲拉取項目到本地：

https://gitee.com/52itstyle/baidu-speech.git

配置百度語音API：

# 自行註冊申請
https://console.bce.baidu.com/ai/#/ai/speech/app/list

啟動項目：

# 切換到項目根目錄，執行
manage.py runserver

外網部署

這裡以Linux為例，代理使用 openresty。

安裝 Python3

wget https://www.python.org/ftp/python/3.7.1/Python-3.7.1.tar.xz

事先安裝依賴，否則後期安裝會報錯：

yum -y install zlib*
yum -y install libffi-devel

下麵開始正式安裝：

# 解壓
tar -xvf Python-3.7.1.tar.xz
# 切換大目錄
cd Python-3.7.1
# 配置編譯
./configure
# 編譯安裝
make && make install

安裝 Django

pip install Django

安裝成功以後需要重新配置並編譯安裝 Python3：

# 配置編譯
./configure
# 編譯安裝
make && make install

安裝伺服器 uwsgi

pip3 install uwsgi

上傳項目到伺服器，並切換到 speech 目錄：

# 目錄下新建文件夾
mkdir script

在 script 下新增 uwsgi.ini (項目中已經配置好，自行修改路徑即可)：

# uwsig使用配置文件啟動
[uwsgi]
# 項目目錄
chdir=/www/speech/
# 指定項目的application
module=speech.wsgi:application
# 指定sock的文件路徑       
socket=/www/speech/script/uwsgi.sock
# 進程個數       
workers=5
pidfile=/www/speech/script/uwsgi.pid
# 指定IP埠       
http=127.0.0.1:8001
# 指定靜態文件
static-map=/static=/www/speech/static
# 啟動uwsgi的用戶名和用戶組
uid=root
gid=root
# 啟用主進程
master=true
# 自動移除unix Socket和pid文件當服務停止的時候
vacuum=true
# 序列化接受的內容，如果可能的話
thunder-lock=true
# 啟用線程
enable-threads=true
# 設置自中斷時間
harakiri=30
# 設置緩衝
post-buffering=4096
# 設置日誌目錄
daemonize=/www/speech/script/uwsgi.log

然後使用以下命令啟動：

uwsgi  --ini uwsgi.ini

執行命令，查看是否啟動成功：

[root@AY140216131049Z script]# ps -ef|grep uwsgi  
root      3040     1  0 Nov21 ?        00:00:03 uwsgi --ini uwsgi.ini
root      3041  3040  0 Nov21 ?        00:00:00 uwsgi --ini uwsgi.ini
root      3042  3040  0 Nov21 ?        00:00:00 uwsgi --ini uwsgi.ini
root      3043  3040  0 Nov21 ?        00:00:00 uwsgi --ini uwsgi.ini
root      3044  3040  0 Nov21 ?        00:00:00 uwsgi --ini uwsgi.ini
root      3045  3040  0 Nov21 ?        00:00:00 uwsgi --ini uwsgi.ini
root      3046  3040  0 Nov21 ?        00:00:00 uwsgi --ini uwsgi.ini
root      6606  6580  0 18:13 pts/0    00:00:00 grep --color=auto uwsgi

重啟：

uwsgi --reload uwsgi.pid

配置Nginx代理：

server {
        listen  80;
        server_name  speech.52itstyle.vip;
        charset utf-8;
        location / {
           include uwsgi_params; # 導入一個Nginx模塊他是用來和uWSGI進行通訊的
           uwsgi_connect_timeout 30; # 設置連接uWSGI超時時間
           uwsgi_pass unix:/www/speech/script/uwsgi.sock; # 指定uwsgi的sock文件所有動態請求就會直接丟給他
        }
        # 動靜分離 Nginx 處理靜態請求
        location /static {
            root /www/speech/;
        }
   }

如果啟動HTTPS：

server {
         listen 80;
         listen 443 ssl;
         server_name  speech.52itstyle.vip;
         #ssl on;
         #證書路徑
         ssl_certificate    /usr/local/openresty/nginx/cert/1901523_speech.52itstyle.vip.pem;
         #私鑰路徑
         ssl_certificate_key   /usr/local/openresty/nginx/cert/1901523_speech.52itstyle.vip.key;
         #緩存有效期
         ssl_session_timeout 5m;
         #可選的加密演算法,順序很重要,越靠前的優先順序越高.
         ssl_ciphers ECDHE-RSA-AES128-GCM-SHA256:ECDHE:ECDH:AES:HIGH:!NULL:!aNULL:!MD5:!ADH:!RC4;
         #安全鏈接可選的加密協議
         ssl_protocols TLSv1 TLSv1.1 TLSv1.2;
         ssl_prefer_server_ciphers on;

        location / {
           include uwsgi_params; # 導入一個Nginx模塊他是用來和uWSGI進行通訊的
           uwsgi_connect_timeout 30; # 設置連接uWSGI超時時間
           uwsgi_pass unix:/www/speech/script/uwsgi.sock; # 指定uwsgi的sock文件所有動態請求就會直接丟給他
        }
        # 動靜分離 Nginx 處理靜態請求
        location /static {
            root /www/speech/;
        }
   }