Status Report #5: 10/26 (Spencer)

  • Since audio transcription is super slow, investigated a signal processing based approach to speed up system.
  • Research on MFCC & its significance wrt speech recognition
  • Ran tests to check speed of MFCC library (librosa).
  • Worked on integration of librosa with audio input from previous weeks. Added timing code – librosa can process an audio chunk from prev system in 0.005 sec, which is good news for us.
  • Next steps: talking to Prof. Stern about MFCC & best way to recognize matching speech. Integration of simple end to end system for in lab demo.

Leave a Reply

Your email address will not be published. Required fields are marked *