Based on STM32 of CNN Speech Keyword Command Recognition System

Wenbo KUANG; Weiping LUO

doi:10.15878/j.cnki.instrumentation.2023.01.003

Articles

Vol. 10 No. 1 (2023)

Based on STM32 of CNN Speech Keyword Command Recognition System

Wenbo KUANG
Weiping LUO

PDF

DOI: https://doi.org/10.15878/j.cnki.instrumentation.2023.01.003
Submitted: November 26, 2023
Published: 2023-03-10

Abstract

Speech recognition is a hot topic in the field of artificial intelligence. Generally, speech recognition models can only run on large servers or dedicated chips. This paper presents a keyword speech recognition system based on a neural network and a conventional STM32 chip. To address the limited Flash and ROM resources on the STM32 MCU chip, the deployment of the speech recognition model is optimized to meet the requirements of keyword recognition. Firstly, the audio information obtained through sensors is subjected to MFCC (Mel Fre-quency Cepstral Coefficient) feature extraction, and the extracted MFCC features are input into a CNN (Convolutional Neural Network) for deep feature extraction. Then, the features are input into a fully connected layer, and finally, the speech keyword is classified and predicted. Deploying the model to the STM32F429, the prediction model achieves an accuracy of 90.58%, a decrease of less than 1% compared to the accuracy of 91.49% running on a computer, with good performance.

Downloads

Download data is not yet available.

How to Cite

KUANG , W., & LUO , W. (2023). Based on STM32 of CNN Speech Keyword Command Recognition System. Instrumentation, 10(1). https://doi.org/10.15878/j.cnki.instrumentation.2023.01.003

This work is licensed under a Creative Commons Attribution 4.0 International License.