Billboard Segmentation Using Segformer
This project leverages the Segformer pretrained model for billboard segmentation via semantic segmentation, with a focus on Indian billboards. By utilizing transfer learning on a custom dataset, the model precisely classifies and outlines billboards within images, facilitating efficient detection and analysis. The methodology incorporates several key improvements, including:
Dataset Augmentation to increase model robustness.
Data Balancing to address class imbalance issues.
Advanced Post-Processing techniques to refine segmentation results.
These techniques significantly enhance segmentation accuracy, addressing challenges commonly encountered in binary semantic segmentation tasks.
Dataset Format
The dataset is structured as follows:
You can download the dataset from the following link:
Google Colab Demo
To train the Segformer model for billboard segmentation, you can follow along with the Google Colab notebook linked below. The notebook provides step-by-step instructions for data preprocessing, model training, and evaluation.
Evaluation
The following graph shows the training and validation performance over time, demonstrating the model's convergence and improvement in segmentation accuracy:
Test Images
Below are sample outputs of the model on test images. These showcase the model's ability to accurately segment billboards:
Final Results
Here are some final outputs after post-processing, showing the refined billboard segmentation:
This README is focused solely on the training aspect of the project. A separate README file will cover API-related details and deployment.
Last updated