HuggingFace Daily Papers Abstracts Extractor

This project automates the process of downloading, summarizing, and converting daily papers from Hugging Face into easily readable formats.

Features

Download daily papers from Hugging Face API
Extract abstracts and generate markdown summaries
Handle empty files and weekends/holidays
Avoid reprocessing existing files

Project Structure

hf_daily_papers/
│
├── data/
│   ├── input/  # Downloaded JSON files
│   ├── output/ # Generated markdown files
│
├── src/
│   ├── download_daily_papers.py
│   ├── daily_papers_abstract_extractor.py
│
└── README.md

Installation

Clone this repository:

git clone https://github.com/elsatch/daily_hf_papers_abstracts.git
cd hf_daily_papers

Install the required dependencies:
```
pip install requests
```

Usage

Download daily papers:
```
python src/download_daily_papers.py [YYYYMMDD]
```
If no date is provided, it will download papers for the current date.
Process JSON files and generate markdown summaries:
```
python src/daily_papers_abstract_extractor.py
```

Notes

The scripts handle empty files that may occur during weekends or holidays.
Existing processed files are not overwritten to avoid unnecessary reprocessing.
You can run these scripts daily to keep up with the latest papers.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is open source and available under the MIT License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

HuggingFace Daily Papers Abstracts Extractor

Features

Project Structure

Installation

Usage

Notes

Contributing

License

Files

README.md

Latest commit

History

README.md

File metadata and controls

HuggingFace Daily Papers Abstracts Extractor

Features

Project Structure

Installation

Usage

Notes

Contributing

License