Website Data Scraper

Overview

Website Data Scraper is a Python-based tool that scrapes a specified website URL and extracts data into CSV format. The extracted data includes image URLs and information stored within specific HTML tags (e.g., <h1>, <p>, etc.). This project is ideal for those looking to collect structured data from web pages for analysis or storage.

Features

Extracts text data from HTML tags such as <h1>, <p>, and more.
Extracts image URLs.
Stores extracted data in CSV format.
Easy configuration for different websites and HTML structures.

Steps

Clone the repository:

git clone https://github.com/yourusername/website-data-scraper.git
cd website-data-scraper

now install all the above dependencies
now run the command
```
python AdvancedScraper.py
```

now enter the url for which you want to scrap h1,p,a or any of the tags you can modify it too

Please enter the URL: https://en.wikipedia.org/wiki/Main_Page
Extracting data from https://en.wikipedia.org/wiki/Main_Page...

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.idea		.idea
advancescraper		advancescraper
simpleWebScraper		simpleWebScraper
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Website Data Scraper

Overview

Features

Steps

About

Releases

Packages

Languages

bhrigu-verma/csedeptproject

Folders and files

Latest commit

History

Repository files navigation

Website Data Scraper

Overview

Features

Steps

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages