Tutorials

Python Download PDF From URL Using BeautifulSoup4 and Requests Library

In this tutorial, I will teach you how to download PDF files from URLs using Python programming language. The complete script to download pdfs from website is given below.

We will make use of Beautiful Soup 4 and Requests libraries to build the functionality of downloading PDF files from URLs.

Python Download PDF From URL

Install Dependencies

pip install requests
pip install bs4

code.py

# Import libraries 
import requests 
from bs4 import BeautifulSoup 

# URL from which pdfs to be downloaded 
url = "https://nanonets.com/blog/deep-learning-ocr/"

# Requests URL and get response object 
response = requests.get(url) 

# Parse text obtained 
soup = BeautifulSoup(response.text, 'html.parser') 

# Find all hyperlinks present on webpage 
links = soup.find_all('a') 

i = 0

# From all links check for pdf link and 
# if present download file 
for link in links: 
    if ('.pdf' in link.get('href', [])): 
        i += 1
        print("Downloading file: ", i) 

        # Get response object for link 
        response = requests.get(link.get('href')) 

        # Write content in pdf file 
        pdf = open("pdf"+str(i)+".pdf", 'wb') 
        pdf.write(response.content) 
        pdf.close() 
        print("File ", i, " downloaded") 

print("All PDF files downloaded")

Run the Project

python code.py
Furqan

Well. I've been working for the past three years as a web designer and developer. I have successfully created websites for small to medium sized companies as part of my freelance career. During that time I've also completed my bachelor's in Information Technology.

Recent Posts

AppCleaner vs Pearcleaner: Best Free Mac Cleaner 2025?

Quick Takeaway: If you want a simple, no-fuss app uninstaller that just works, AppCleaner is your best bet.…

October 21, 2025

Mem0 Alternatives: Complete Guide to AI Memory Solutions in 2025

Looking for the right AI memory solution but not sure if Mem0 fits your needs?…

October 21, 2025

Splashtop Alternatives: Top Remote Desktop Solutions for 2025

Looking for better remote access options? You're not alone. Many IT teams and businesses are…

October 21, 2025

Same.new Alternatives: Complete Guide to AI Web App Builders in 2025

Looking for alternatives to Same.new? You're not alone. While Same.new promises to clone websites and…

October 21, 2025

Coolify vs Dokploy: I Tested Both — Here’s What You Need to Know

If you're paying steep bills to Heroku, Vercel, or Netlify and wondering if there's a…

October 21, 2025

MiniMax-M1 vs GPT-4o vs Claude 3 Opus vs LLaMA 3 Benchmarks

MiniMax-M1 is a new open-weight large language model (456 B parameters, ~46 B active) built with hybrid…

August 31, 2025