In the first half of this week I was looking into softcore IPs for CPU and GPU to be synthesized later for performance benchmark. Then after talking with Prof. Benson we decided that we would instead compare our benchmark on commercial CPUs and GPUs using non-quantized version of the models. I also worked on securing and initial setup of the FPGA. We chose and obtained from the class inventory the Ultra-96 V2 FPGA and I am currently trying to boot linux on the FPGA. In case of insufficient onboard ram, I have also obtained permission to use one of the research FPGA from Professor Ken Mai if needs arise later into the semester.
My plan for next week are:
- Complete FPGA setup and look into ways to easily interface into the FPGA
- Start the design of overall architecture of the FPGA for ternary llm inference
- Setup tool chain on Vivado focusing on data transfer between the FPGA PS and PL