Tutorials

Python Download PDF From URL Using BeautifulSoup4 and Requests Library

In this tutorial, I will teach you how to download PDF files from URLs using Python programming language. The complete script to download pdfs from website is given below.

We will make use of Beautiful Soup 4 and Requests libraries to build the functionality of downloading PDF files from URLs.

Python Download PDF From URL

Install Dependencies

pip install requests
pip install bs4

code.py

# Import libraries 
import requests 
from bs4 import BeautifulSoup 

# URL from which pdfs to be downloaded 
url = "https://nanonets.com/blog/deep-learning-ocr/"

# Requests URL and get response object 
response = requests.get(url) 

# Parse text obtained 
soup = BeautifulSoup(response.text, 'html.parser') 

# Find all hyperlinks present on webpage 
links = soup.find_all('a') 

i = 0

# From all links check for pdf link and 
# if present download file 
for link in links: 
    if ('.pdf' in link.get('href', [])): 
        i += 1
        print("Downloading file: ", i) 

        # Get response object for link 
        response = requests.get(link.get('href')) 

        # Write content in pdf file 
        pdf = open("pdf"+str(i)+".pdf", 'wb') 
        pdf.write(response.content) 
        pdf.close() 
        print("File ", i, " downloaded") 

print("All PDF files downloaded")

Run the Project

python code.py
Furqan

Well. I've been working for the past three years as a web designer and developer. I have successfully created websites for small to medium sized companies as part of my freelance career. During that time I've also completed my bachelor's in Information Technology.

Recent Posts

MiniMax-M1 vs GPT-4o vs Claude 3 Opus vs LLaMA 3 Benchmarks

MiniMax-M1 is a new open-weight large language model (456 B parameters, ~46 B active) built with hybrid…

June 22, 2025

How to Use Husky with npm to Manage Git Hooks

Managing Git hooks manually can quickly become tedious and error-prone—especially in fast-moving JavaScript or Node.js…

June 22, 2025

How to Use Lefthook with npm to Manage Git Hooks

Git hooks help teams enforce code quality by automating checks at key stages like commits…

June 22, 2025

Lefthook vs Husky: Which Git Hooks Tool is Better? [2025]

Choosing the right Git hooks manager directly impacts code quality, developer experience, and CI/CD performance.…

June 22, 2025

Llama 3.1 vs GPT-4 Benchmarks

We evaluated the performance of Llama 3.1 vs GPT-4 models on over 150 benchmark datasets…

July 24, 2024

Transforming Manufacturing with Industrial IoT Solutions and Machine Learning

The manufacturing industry is undergoing a significant transformation with the advent of Industrial IoT Solutions.…

July 6, 2024