ehtang – Team F6: CALL

April 26, 2025April 27, 2025

Eric’s Status Report for 4/26/25

This week, I worked on preparing the slides for our final presentation with Richard and Tzen-Chuen, mainly focusing on testing and verification. I also worked with Richard and Tzen-Chuen on testing the system with the replacement webcam we have been using temporarily. I tested the ML models with Richard on both ideal (phone) quality and webcam photos, and results show that the bottleneck in performance was due to the poor webcam quality, since performance on ideal photos was high, but extremely poor on the webcam photos. Yesterday, we received a new camera, which was the missing piece of hardware and the biggest risk factor. It’s an Arducam model, not the Camera Module 3, which still hasn’t arrived, and doesn’t have IR capabilities, but meets our other requirements for the camera, so this won’t affect our demo and is a pretty suitable replacement. The new camera provides significantly better image quality and focus control, and initial tests show that it will help us better meet our design specifications for license plate capture under real-world conditions..

We are currently basically on schedule, with the delay in receiving the Camera Module 3 slowing our testing down a bit, but we’ve verified functionality and done tests for the other components while waiting, so we stayed mostly on track. Final testing and data collection can now proceed with the correct hardware. By next week, we should be done with capstone deliverables and testing.

April 19, 2025April 20, 2025

Eric’s Status Report for 4/19/25

This week, I worked with Richard to begin integrating AWS Rekognition into our project via Supabase Edge Functions. Specifically, we worked on a function that, when confidence from the RPi is low, takes an image URL as input, processes it through Rekognition’s text detection API, and returns the detected text with confidence scores. For this, we use a cropped image down to the license plate to avoid Rekognition giving us unrelated text present in the image. We also worked on devising a testing strategy for our system given the unexpected delay in receiving the replacement camera, which should have been here more than a week ago. The temporary camera we’re currently using does not meet our resolution or lens requirements, so we have to adjust and still get data with it to check performance of the system given a limited camera. The goal is to extract as much useful validation data as possible while awaiting the final hardware, and things like timing and power / reliability should not be affected by the camera.

Our progress is mostly on schedule, but partially impacted by the hardware delay, and we are still going ahead with testing what we can with the temporary camera. Next week I plan to work on the final presentation slides, if we can the camera, test as soon as possible, and finalize and test the AWS Rekognition pipeline if testing gets finished.

To implement key parts of our project, I had to gain familiarity with several new tools. This included working with YOLOv11 for object detection and PaddleOCR for text recognition. I also learned how to use Supabase Edge Functions to run serverless backend logic and securely integrate them with AWS Rekognition for cloud-based OCR fallback. I relied on informal learning strategies such as reading documentation, watching short tutorials, and testing on my own.

April 12, 2025April 12, 2025

Eric’s Status Report for 4/12/25

This week, I collaborated with Richard to research how to implement AWS SageMaker for our cloud-based license plate inference system. We worked through some of the required setup steps in the AWS environment, and figured out how to ensure that it works as a serverless inference endpoint. I also did additional research into Amazon Rekognition as a possible alternative to running our setup with Sagemaker, which Bhavik mentioned as an option. This would simplify deployment by using pre-built services. I thoroughly tested Rekognition’s OCR capabilities on license plate images and evaluated its strengths and limitations in the context of our use case. The results indicated that Rekognition provides general text detection but lacks license plate localization and precision control, which means we would result in a lot of junk text in addition to the desired license plate readings. Still, it’s relatively simple to filter those out, so we will meet to discuss which solution to work with.

At this stage, my progress is on schedule, and we already have a working system together for MVP since the cloud step isn’t part of it. We plan to work on integrating the cloud solutions and the replacement camera next week, and test using a portable battery once we get the camera connected. We’ve already completed verification for most subsystems, including the edge image capture and processing pipeline. Our current MVP is functional and does not yet include the cloud inference step. Therefore, the next verification task is focused on the cloud inference integration, which we began exploring this week through SageMaker and Rekognition.

March 29, 2025March 29, 2025

Eric’s Status Report for 3/29/25

This week, I worked on adding permissions to the front end to handle different users and permissions, but there are still bugs with the login settings that need to be resolved. This is less of a priority for the demo next week, though. I also worked with Richard to continue debugging the Camera Module 3 and concluded that the Raspberry Pi is not detecting the camera due to a hardware issue with the camera module itself. We decided to go with a simple usb-connected webcam for the demo. Additionally, Richard and I tested sending matches to the Supabase database from the RPI and confirmed that matches are successfully added to the tables. Progress is a bit behind schedule due to the broken camera, and debugging the Camera Module 3 took much more time than expected since it wasn’t a software issue. Next week, I plan to fix the login bugs, get replacement cameras, and continue expanding Supabase integration with sagemaker, which isn’t part of the MVP.

March 22, 2025March 22, 2025

Eric’s Status Report for 3/22/25

This week, I intended to continue testing the end-to-end data upload flow from the Raspberry Pi to Supabase. However, testing was temporarily blocked due to some dependency issues on the RPi, which prevented full integration with the latest version of the upload scripts. In the meantime, I focused on other tasks to continue making progress. I refactored the system architecture to use Supabase database event triggers instead of HTTP POST-based Edge Functions. Since the RPi inserts data directly into the possible_matches table, event triggers should be used instead, and worked out more details of how the parts will communicate with Richard to minimize issues we’ll face during integration. I also added functionality on the database side to send active alerts to the RPi.

Although testing on the RPi is slightly behind schedule due to the dependency issues, we plan to resolve that as soon as possible and begin end-to-end testing. Next week, I aim to begin implementing the Sagemaker api with the Supabase code and start full testing.

March 15, 2025March 15, 2025

Eric’s Status Report for 3/15/25

After Receiving feedback on the design report, we decided to focus more on the cloud and front end part of the pipeline this week. I focused on building the cloud backend pipeline for the CALL ALPR dashcam system. I implemented a very minimal version of the serverless backend using Supabase Edge Functions and PostgreSQL. Specifically, I created the possible_matches table to store incoming data from the Raspberry Pi, including plate text, GPS location, image URL, and timestamps. I also worked on an Edge Function that receives HTTP POST requests from the RPi, parses the incoming data, and inserts it into the database. I also worked on the starter code for calling the edge function that will do ML inferencing. At this point, my progress is still on schedule. The cloud backend MVP has something which we can test and try to integrate with the Rpi. In the coming week, I plan to work with Richard to see if we can start getting actual data from the Rpi to the cloud. I’ll also add additional functionality to the front end to deal with the permissions for respective users.

March 8, 2025March 9, 2025

Eric’s Status Report for 3/8/25

This week, I mainly focused on refining and updating the design report, specifically working on the Use-Case Requirements, Architecture and/or Principle of Operation, and Design Requirements sections. Some specific changes I made are:

Architecture and/or Principle of Operation: I refined the block diagram and system interactions, ensuring that the data flow from image capture → edge processing → cloud verification → database matching was clearly structured. I also improved the documentation/planning of how the edge processor filters and sends high and low-confidence detections to the cloud, reducing unnecessary bandwidth use.
Design Requirements: The biggest change since the Design presentation was updating the cloud reliability target. After reviewing existing cloud service reliability standards, I adjusted the uptime requirement from 95% to 97% to strike a balance between AWS’s guaranteed availability and real-world reliability needs. This change ensures that the system remains operational and responsive in high-demand scenarios, reducing the likelihood of missed detections due to cloud downtime.

I also worked with Richard to further define how the cloud infrastructure handles license plate matching and how that would be implemented, specifically using Supabase and AWS Sagemaker. My progress is on schedule, and we have begun testing timing on the Rpi. Next week I plan to continue working with Richard on testing the models on the Rpi, and hopefully begin testing using images from the camera module.

February 22, 2025February 22, 2025

Eric’s Status Report for 2/22/25

This week, I spent a lot of time working on the design review presentation and practicing since I presented on Wednesday. This involved doing research related to the Amber alert use case, specifically for our timing requirements, since we wanted them to be based on the expected situation. I found that the 60 second requirement was sufficient for an average lane change frequency on the highway (2.71 mi) but not enough for the worst case merging scenario (20 seconds), so I made that change to the requirements. I worked more with the PaddleOCR testing to continue exploring how it performs under more extreme weather conditions. I also worked with Richard to set up the basic pipeline of YOLOv11 to PaddleOCR, where YOLOv11 crops the image down to the plate, and PaddleOCR uses the cropped image to do OCR.

My progress is on schedule. Next week, I plan to continue testing the PaddleOCR with the YOLOv11 model integration, and explore methods to increase performance. I plan to use larger datasets to see how the overall pipeline performs, as well as beginning to check the inference time.

February 15, 2025February 16, 2025

Team’s Status Report for 2/15/25

The most significant risk is that the edge compute solution may not guarantee enough performance (precision and recall) to meet our MVP. The contingency plan is having a two-phase approach where if more accuracy than the edge compute raspberry pi can give us is needed, we then send the image into the cloud, where a more sophisticated model can give us better results.

A change we made to the existing design was that we are now using a Raspberry Pi 4 rather than a 5. This change was made since all the raspberry pi 5s available in storage were claimed very quickly, and since we wanted to test our software as soon as possible on actual hardware, we took a raspberry pi 4 instead. While unfortunate that we’re unable to use the most powerful hardware available, this should not have any impact on our ability to create an MVP or final device since the process for loading the models on these devices are nearly identical. When we run our model, if the performance is in the order of magnitude fitting of a compact processor, we can spend our currently plentiful remaining budget on the more powerful raspberry pi 5.

We have trained the model we will most likely use for our MVP, a YOLOv11n model trained on an open source license plate detection dataset for 400 epochs. It can be found here. We have also looked into existing OCR methods and chosen the PaddleOCR out of them, which we’re currently experimenting with.

Aside from the model, the rpi 4 is currently being developed, with a github repo to be populated by next week. The camera module is also expected to arrive next week as well.

Part A written by Richard Sbaschnig:

Our device aims to improve public safety. This is done by detecting the license plates listed in active amber alerts in a dashcam. Since these alerts are sent to identify suspected kidnappers of children, by increasing the search coverage of amber alerts with our device, law enforcement will be able to find these vehicles sooner and catch kidnappers sooner. This should also have a deterrent effect, since would-be kidnappers would be less inclined knowing that there are these devices all around that can identify their car and notify the police automatically.

Part B written by Tzen-Chuen:

Our device’s social considerations don’t quite appear as an obvious point of concern. The different groups that will be interacting through our device is the manufacturer, the consumer, and the potential child abduction victim. The main point of contention may be between the consumer/end user and the manufacturer, as the manufacturer may install our device without the end user being aware of it, but I believe that this can be mitigated through an explicit opt-in system.

Part C written by Eric:

Our license plate recognition system is designed with one of the focuses being affordability, utilizing low-cost Raspberry Pis and camera modules. This provides a more accessible alternative to expensive surveillance systems, ensuring that even communities with limited resources can use our system. This can be especially beneficial for those in rural areas with little existing infrastructure since our device would be mounted as a dash cam, allowing for wider reach and greater impact.

February 15, 2025February 15, 2025

Eric’s Status Report for 2/15/25

This week, I focused on researching and testing OCR models for license plate recognition. I experimented with PaddleOCR and EasyOCR, since I saw multiple users saying that TessaractOCR doesn’t perform well. I tested PaddleOCR’s and EasyOCR’s performance on license plates with different orientations and angles. To ensure accurate comparisons, I set up a structured testing workflow and collected sample images from various scenarios. After testing, I found that PaddleOCR consistently outperformed EasyOCR when handling rotated or slanted plates. Based on these results, I decided to move forward with PaddleOCR as the primary OCR engine for the project. I also started looking into ways to eliminate detected text that isn’t from the license plate number.

My progress is on schedule. Next week, I plan to work on integrating PaddleOCR with the YOLOv11 model, and figure out what changes are needed for it to run on the Raspberry Pi. If necessary, I will experiment with different PaddleOCR configurations to further refine accuracy and speed.

The image below shows PaddleOCR’s results on an example plate: