WORK ACCOMPLISHED
The past 2 weeks, I was able to accomplish a number of important tasks as well as some deliverables that were overdue.
Firstly, I was able to complete the integration of the USB Microphone with the Raspberry Pi, so I was able to successfully record sample audio of a user giving the location for their journey.
Following having obtained a sample audio, I began to test the Google speech-to-text AI framework and did some testing for which models work best for extracting the required text.
Log of different tests with different models
Sample transcript from audio (actual audio is myself saying: “Ride take me to Tepper”)
I also worked on some of the logic for translating the destination text from speech-to-text endpoint to the longitudinal and latitudinal representation using the Google Maps Geocoding API.
Code Snippet:
Output for Destination of “Tepper School of Business”:
PROGRESS:
I am on schedule as I made some good progress this week. I probably need to speed up a bit to ensure I have ample time to join the different components of my feature together.
NEXT WEEK DELIVERABLES:
I will primarily continue testing for the appropriate model for speech-to-text translation. I will also start working on the logic for navigation suggestion using a R-tree algorithm.