Justin Ankrom’s Status Report for 4/26

This week I did a few things. First, I performed unit tests on the 4 different models we had selected: llama3-8b, falcon-7b, qwen2.5-7b, and vicuna-7b. Testing and results can be found here: https://github.com/jankrom/Voice-Vault/tree/main/server/model-testing. This involved setting up the tests for each model, making a python script, and saving the results in a png for each model. I found that llama3 had the best accuracy at 100%, while qwen and vicuna both did around 90%. Falcon actually had a score of 0% accuracy which was very surprising. I looked into it more and it could be because it is optimized for code, as I saw a lot of the responses the model was giving was in javascript and such, despite the model saying it is optimized for conversations. These results cause me to remove falcon from our options and now we will only offer the other 3 models to pick from. This resulted in me having to modify our website to only include the other 3. Additionally, this week I also deployed our website on Vercel so that it can be accessed from anywhere. Here is the link: https://voice-vault-18500.vercel.app/ I had to learn about how to deploy websites on Vercel and fix issues that were occuring when building my app in production mode. This week was meant to be a lighter week since it was designed to have some room for slack.

My progress is on schedule. I don’t have any more work on the actual project, just logistical stuff such as the final poster, final video, and final report, which is what I will complete in the following week.

Leave a Reply

Your email address will not be published. Required fields are marked *