Design of a low-power coprocessor for mid-size vocabulary speech recognition systems

Peng Li, Hua Tang

Research output: Contribution to journalArticlepeer-review

9 Scopus citations

Abstract

Speech recognition systems have gained popularity in consumer electronics. This paper presents a custom-designed coprocessor for output probability calculation (OPC), which is the most computation-intensive processing step in continuous hidden Markov model (CHMM)-based speech recognition algorithms. To save hardware resource and reduce power consumption, a polynomial addition-based method is used to compute add-log instead of the traditional look-up table-based method. In addition, the optimal tradeoff between speech processing delay, energy consumption, and hardware resources is explored for the coprocessor. The proposed coprocessor has been implemented and tested in Xilinx Spartan-3A DSP XC3SD3400A, and also validated using the standard-cell-based approach in IBM μm technology. To implement an entire speech recognition system, SAMSUNG S3C44b0X (containing an ARM7) is used as the micro-controller to execute the rest of speech processing. Tested with a 358-state 3-mixture 27-feature 800-word HMM, S3C44b0X operates at 40 MHz and coprocessor at 10 MHz to meet the real-time requirement, and the recognition accuracy is 95.2%. Power consumption of the micro-controller is 10 mW, and that of the coprocessor 15.2 mW. The overall speech recognition system achieves the lowest energy consumption per word recognition among many reported designs. Experiment and analysis show that the speech recognition system based on the proposed coprocessor is especially suitable for mid-size vocabulary (1001000 words) recognition tasks.

Original languageEnglish (US)
Article number5658175
Pages (from-to)961-970
Number of pages10
JournalIEEE Transactions on Circuits and Systems I: Regular Papers
Volume58
Issue number5
DOIs
StatePublished - Jan 1 2011

Keywords

  • Coprocessor
  • VLSI
  • custom design
  • field-programmable gate array (FPGA)
  • hardware implementation
  • hidden Markov model (HMM)
  • speech recognition

Fingerprint Dive into the research topics of 'Design of a low-power coprocessor for mid-size vocabulary speech recognition systems'. Together they form a unique fingerprint.

Cite this