python語音識別whisper如何使用

發布時間：2023-02-27 15:04:20 來源：億速云閱讀：230 作者：iii 欄目：開發技術

這篇文章主要介紹了python語音識別whisper如何使用的相關知識，內容詳細易懂，操作簡單快捷，具有一定借鑒價值，相信大家閱讀完這篇python語音識別whisper如何使用文章都會有所收獲，下面我們一起來看看吧。

whisper語音識別

Whisper 是一種通用的語音識別模型。它在不同音頻的大型數據集上進行訓練，也是一個多任務模型，可以執行多語言語音識別以及語音翻譯和語言識別。
stable-ts在 OpenAI 的 Whisper 之上修改并添加了更大的破解代碼發布，生成更準確的階段時間切換，并在無須額外推介的情況下獲得申領

安裝

pip install openai-whisper 
pip install stable-ts

Size	Parameters	English-only model	Multilingual model	Required VRAM	Relative speed
tiny	39 M	tiny.en	tiny	~1 GB	~32x
base	74 M	base.en	base	~1 GB	~16x
small	244 M	small.en	small	~2 GB	~6x
medium	769 M	medium.en	medium	~5 GB	~2x
large	1550 M	N/A	large	~10 GB	1x

示例

模型越大，越精確，相應話費的時間越長
自帶語言識別功能，language最好加上，下面歌曲識別為英語，加后為中文
stable_whisper 是 whisper 進化版

import whisper
import stable_whisper as whisper

class WhisperTranscriber(object):

    def __init__(self, model_name):
        self.model = whisper.load_model(model_name)

    def whisper_transcribe(self, audio_path):
        audio = self.model.transcribe(audio_path, fp16=False, language='Chinese')
        return audio['text']

if __name__ == '__main__':

    transcriber = WhisperTranscriber("base")
    text = transcriber.whisper_transcribe("257853511.mp3")
    print(text)

python語音識別whisper如何使用

可能是伴奏聲音過大，你才出來這是什么歌了嗎？stable_whisper 別的用法、生成字幕

import stable_whisper
model = stable_whisper.load_model('base')
results = model.transcribe('257853511.mp3', fp16=False, language='Chinese')
stable_whisper.results_to_sentence_srt(results, 'audio')
stable_whisper.results_to_sentence_word_ass(results, 'audio.ass')

封裝工具

buzz

如果遇到簡繁轉換可以石下面

pip install zhconv

zh-cn 大陸簡體
zh-hant 繁體

from zhconv import convert     
convert('Python是一種動態的、面向對象的腳本語言', 'zh-hant')
'Python是一種動態的、面向對象的腳本語言'

關于“python語音識別whisper如何使用”這篇文章的內容就介紹到這里，感謝各位的閱讀！相信大家對“python語音識別whisper如何使用”知識都有一定的了解，大家如果還想學習更多知識，歡迎關注億速云行業資訊頻道。

向AI問一下細節

91超碰碰碰碰久久久久久综合_超碰av人澡人澡人澡人澡人掠_国产黄大片在线观看画质优化_txt小说免费全本

python語音識別whisper如何使用

whisper語音識別

示例

封裝工具

猜你喜歡

91超碰碰碰碰久久久久久综合_超碰av人澡人澡人澡人澡人掠_国产黄大片在线观看画质优化_txt小说免费全本

python語音識別whisper如何使用

whisper語音識別

示例

封裝工具

猜你喜歡

最新資訊

相關推薦

相關標簽