This week, we set things up for the interim demo. Couple of changes that we made:
- We moved from a 700 Million parameter model to a 165M parameter model.
- We swapped over our quantization techniques due to the custom kernel not being able to accept the smaller model for this quantization form.
- We used a client based access control system, and as Prof. Theo said, we observe a starvation based model in our resource requests.
The details will be analyzed further in individual status reports.
We are well on schedule and have actually hit our basic MVP. The only thing left to do now would be to iterate and try to improve the performance of our system. We are also exploring how to extract power and telemetry signals from the FPGA for simple visualizations.