Website Data Scraper

Overview

Website Data Scraper is a Python-based tool that scrapes a specified website URL and extracts data into CSV format. The extracted data includes image URLs and information stored within specific HTML tags (e.g., <h1>, <p>, etc.). This project is ideal for those looking to collect structured data from web pages for analysis or storage.

Features

Extracts text data from HTML tags such as <h1>, <p>, and more.
Extracts image URLs.
Stores extracted data in CSV format.
Easy configuration for different websites and HTML structures.

Steps

Clone the repository:

git clone https://github.com/yourusername/website-data-scraper.git
cd website-data-scraper

now install all the above dependencies
now run the command
```
python AdvancedScraper.py
```

now enter the url for which you want to scrap h1,p,a or any of the tags you can modify it too

Please enter the URL: https://en.wikipedia.org/wiki/Main_Page
Extracting data from https://en.wikipedia.org/wiki/Main_Page...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Website Data Scraper

Overview

Features

Steps

Files

README.md

Latest commit

History

README.md

File metadata and controls

Website Data Scraper

Overview

Features

Steps