Reading

Vegetable Vision: A Production-Oriented CNN Image Classification Project Practice

A deep learning course project by Singapore Polytechnic students, demonstrating how to refactor course notebooks into a production-ready project with MLOps practices, CI/CD pipelines, security scans, and complete documentation.

CNN卷积神经网络图像分类TensorFlowKerasMLOps深度学习数据增强CI/CD

Published 2026-04-29 17:14Recent activity 2026-04-29 17:21Estimated read 6 min

Vegetable Vision: A Production-Oriented CNN Image Classification Project Practice

Section 01

Vegetable Vision: A Production-Oriented CNN Image Classification Project Guide

Vegetable Vision is a project by Singapore Polytechnic student Goh Kun Ming, which transforms a deep learning course assignment (Jupyter Notebook) into a production-ready machine learning project. It integrates MLOps practices, CI/CD pipelines, security scans, and complete documentation. The project focuses on vegetable image classification using CNNs, compares multiple architectures, and demonstrates how to bridge academic work to production-level engineering.

Section 02

Project Background & Motivation

The project originated from a course assignment (ST1504 Deep Learning CA1 Part A) at Singapore Polytechnic, where the task was to build a CNN classifier for vegetable images using a single Jupyter Notebook. The author realized the need to refactor the notebook into maintainable, reusable code to enable future reuse, maintenance, and expansion—leading to the birth of Vegetable Vision, which retains academic value while adopting modern software engineering and MLOps best practices.

Section 03

Core Technical Goals & Model Architecture Comparisons

The project's key technical goals include:

Implementing multi-class vegetable image classification with TensorFlow/Keras, comparing input sizes (23px vs.101px) and data augmentation effects.
Evaluating 5 CNN architectures: Sequential model (baseline), Functional API (flexible), Residual connections (alleviate gradient vanishing), Inception-like (multi-scale features), and Depthwise Separable convolution (lightweight, reduced compute).
Refactoring the notebook into a maintainable codebase with production-level standards.

Section 04

Project Structure & MLOps Practices

Structure:

Original notebook kept in root and notebooks/ for academic integrity.
src/vegetable_vision/ for modular code (model definitions, training, data processing).
tests/ for pytest suites (config validation, data loading, notebook tools).
docs/ for comprehensive documentation.
Notebook split script to divide large notebooks into small files and ensure sync with original.

MLOps:

Dependency management: pyproject.toml, separate runtime (requirements.txt) and dev (requirements-dev.txt) dependencies.
Code quality: Ruff (formatting/static check), Bandit (security scan), pip-audit (dependency audit).
CI/CD: GitHub Actions automate tests, code checks, security scans, CodeQL analysis on each commit.

Section 05

Dataset & Training Process

The vegetable image dataset uses standard train/validation/test splits. Due to academic use restrictions, data is not committed to version control (via .env and .gitignore). Training scripts have a command-line interface to specify data directory, model type, image size, and epochs. Evaluation scripts load saved models to generate performance reports, enabling easy integration into automation pipelines.

Section 06

Documentation & Educational/Industry Value

Documentation:

PROJECT_CONTEXT.md (background/decisions), DATASET.md (format/source/restrictions), MLOPS.md (practices), MODEL_CARD.md (model details/limitations), ARCHITECTURE.md (code structure), CI_SECURITY.md (CI/security config).

Value:

Educational: Shows students how to turn academic prototypes into production-ready products—critical for AI industry competitiveness.
Industry: Provides a lightweight MLOps reference using simple tools (pytest, Ruff, GitHub Actions) to build effective quality assurance systems.

Section 07

Limitations & Future Improvement Directions

Limitations:

Full model reproduction requires original dataset and GPU resources.
Test coverage focuses on infrastructure code (not model training logic, which needs heavy compute).

Future Directions:

Integrate experiment tracking tools (Weights & Biases/MLflow).
Add model version management (e.g., DVC).
Implement model serving (e.g., FastAPI).
Expand test coverage to model inference paths.

Continue Reading

Keep going with more reads from the same topic.

SignalCut: An Intelligent Tool for Turning AI Search Visibility Gaps into Video Marketing Campaigns

SignalCut is an innovative web application that analyzes brands' visibility gaps in AI search, automatically generates evidence-based marketing strategies, and creates Hera video materials, helping early-stage brands gain a competitive edge in the AI answer engine era.

Recent activity 2026-04-26 11:27

AWS Open-Sources AI Search Citation Analysis System: Track Brand Exposure in AI Search Engines

An open-source project officially released by AWS, built on Amazon Bedrock, Step Functions, and React to form a complete serverless citation analysis system. It helps enterprises monitor their brand's citation status and competitive landscape in AI searches like ChatGPT, Perplexity, Gemini, and Claude.

Recent activity 2026-03-31 20:49

Next.js Application SEO and GEO Integrated Optimization Solution: Comprehensive Visibility from Search Engines to AI Assistants

This article delves into the stevewerme/seo-geo-nextjs project, an open-source tool designed specifically for Next.js applications to simultaneously optimize traditional search engine rankings (SEO) and generative engine visibility (GEO). It analyzes the project's core architecture, implementation mechanisms, practical application scenarios, and its strategic significance for developers and content creators.

Recent activity 2026-04-03 14:48

Baiyuan GEO Platform Technical White Paper: SaaS Engineering Practice for Generative Engine Optimization (GEO)

This article deeply analyzes the GEO Platform technical white paper developed by Baiyuan Technology, covering the seven-dimensional AI citation rate scoring algorithm, AXP shadow document delivery mechanism, Schema.org three-layer entity knowledge graph, and the hallucination automatic detection and repair closed-loop system, providing an engineering solution for brands to gain visibility in generative AI such as ChatGPT and Claude.

Recent activity 2026-04-18 22:54