After weeks of crawling, preprocessing, modeling, and fine-tuning, this marks the final wrap-up of the Real-Time Daily Activity Recognizer project.
From simple grayscale CNNs to EfficientNetB0 with dropout and augmentation β this journey was a hands-on dive into the full image classification pipeline.
This project combined image crawling, model training, and webcam integration to simulate a real-world computer vision task.
I handled everything from data collection to inference β using Keras, OpenCV, and transfer learning.
selenium
& urllib
EfficientNetB0
+ Dropout(0.5)
cv2.VideoCapture
)ImageDataGenerator
matplotlib
(loss/accuracy curves, saved to /figures
)Most important lesson: Data quality matters more than model architecture.
β Crawling images manually had too many noisy, irrelevant samples β next time Iβll use ImageNet, Kaggle datasets, or structured action datasets.
Technically, integrating OpenCV inference was super fun β it gave an actual feeling of βreal-time AIβ.
This was never meant to be production-grade β but it taught me the entire pipeline.
From crawling to inference, from augmentation to callbacks, I got to see where things break and how to improve them.
Now itβs time to move on to new projects, with better data and deeper models.
β
Project: Completed
π§ Lessons: Internalized
π₯ Next stop: Something bigger.