Skip to content

saloua7/pmc_pdf_download

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

PMC Article Downloader

This Python script retrieves PMC article identifiers (PMCIDs) based on a search term, downloads the corresponding PDF files from the NCBI FTP server, and extracts the downloaded data.

Features

  • Retrieves PMC article identifiers based on a search term using NCBI's E-utilities.
  • Downloads PDF files from the NCBI FTP server.
  • Extracts and cleans up the downloaded files.

Requirements

  • Python 3.12
  • Required Python libraries:
    • requests
    • pandas
    • beautifulsoup4
    • tarfile (included in Python standard library)
    • shutil (included in Python standard library)
    • os (included in Python standard library)

You can install the required libraries using pip:

pip install requests pandas beautifulsoup4

Releases

No releases published

Packages

No packages published

Languages