Abstract

This project is an Image Search System that employs advanced image segmentation and caption generation techniques. It enables users to upload images and receive relevant product suggestions based on similarities in the generated captions.

Installation

To install the project, follow these steps:

Clone the repository:
Navigate to the project directory:
Install dependencies:

pip install -r requirements.txt

Run the backend.py server:

python backend.py

Run the frontend interface:

gradio frontend.py

After running frontend, upload image of the product you want to search for and click on search button. The model will generate the caption for the image, then copy the image and paste it in the search bar and click on search button. The model will search for the similar products and display the results.

Tech Stack

Python
Gradio
Segmentation Models
Salesforce BLIP
TensorFlow
Hugging Face Transformers

Optimizations

The development methodology of this image search system is comprehensive and involves several key steps:

Image Segmentation: The system begins with the implementation of an image segmentation model using the U2-Net architecture. This model is trained on a dataset containing images and their corresponding masks, allowing it to segment the images effectively.
Caption Generation: Following successful segmentation, the system utilizes the Salesforce BLIP Transformer for caption generation. The BLIP model is fine-tuned on a fashion product dataset consisting of segmented images paired with captions. This process leverages the learned representations from both image and text modalities to generate descriptive captions for segmented images.
Integration with Gradio: Gradio, a user-friendly library for building web-based applications with machine learning models, is integrated into the system. An intuitive user interface is designed to allow users to upload images for segmentation and caption generation. On the backend, functionality is developed to process uploaded images, perform segmentation using the trained U-Net model, and generate captions using the fine-tuned BLIP model. Real-time inference capabilities ensure prompt feedback to users upon image submission.
Product Search Mechanism: Once the system is capable of segmenting images and generating captions in real-time, a mechanism for searching similar products based on the generated captions is implemented. This involves tokenizing the captions to extract meaningful tokens representing product attributes. These tokens are then transformed into feature representations, and a similarity search mechanism is implemented to retrieve products with similar features.

File Structure

.
├── README.md
├── backend.py
├── bg.py
├── cmd
│   ├── cli.py
│   └── server.py
├── final.ipynb
├── flagged
├── frontend.py
├── github.py
├── models
│   ├── u2aa
│   ├── u2ab
│   ├── u2ac
│   ├── u2ad
│   ├── u2haa
│   ├── u2hab
│   ├── u2hac
│   ├── u2had
│   └── u2netp.pth
├── out.txt
├── requirements.txt
├── u2net
│   ├── data_loader.py
│   ├── detect.py
│   └── u2net.py
└── utilities.py

4 directories, 23 files

Assets

References

Authors

Adarsh Anand
Aniket Chaudhari
Rajat Singh
Vivek Bandrele

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Abstract

Installation

Tech Stack

Optimizations

File Structure

Assets

References

Authors

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.cph		.cph
.github/workflows		.github/workflows
cmd		cmd
models		models
u2net		u2net
.gitignore		.gitignore
README.md		README.md
backend.py		backend.py
bg.py		bg.py
final.ipynb		final.ipynb
frontend.py		frontend.py
github.py		github.py
requirements.txt		requirements.txt
utilities.py		utilities.py

adarshanand67/SegCapNet

Folders and files

Latest commit

History

Repository files navigation

Abstract

Installation

Tech Stack

Optimizations

File Structure

Assets

References

Authors

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages