Tutorials

Python Download PDF From URL Using BeautifulSoup4 and Requests Library

In this tutorial, I will teach you how to download PDF files from URLs using Python programming language. The complete script to download pdfs from website is given below.

We will make use of Beautiful Soup 4 and Requests libraries to build the functionality of downloading PDF files from URLs.

Python Download PDF From URL

Install Dependencies

pip install requests
pip install bs4

code.py

# Import libraries 
import requests 
from bs4 import BeautifulSoup 

# URL from which pdfs to be downloaded 
url = "https://nanonets.com/blog/deep-learning-ocr/"

# Requests URL and get response object 
response = requests.get(url) 

# Parse text obtained 
soup = BeautifulSoup(response.text, 'html.parser') 

# Find all hyperlinks present on webpage 
links = soup.find_all('a') 

i = 0

# From all links check for pdf link and 
# if present download file 
for link in links: 
    if ('.pdf' in link.get('href', [])): 
        i += 1
        print("Downloading file: ", i) 

        # Get response object for link 
        response = requests.get(link.get('href')) 

        # Write content in pdf file 
        pdf = open("pdf"+str(i)+".pdf", 'wb') 
        pdf.write(response.content) 
        pdf.close() 
        print("File ", i, " downloaded") 

print("All PDF files downloaded")

Run the Project

python code.py
Furqan

Well. I've been working for the past three years as a web designer and developer. I have successfully created websites for small to medium sized companies as part of my freelance career. During that time I've also completed my bachelor's in Information Technology.

Recent Posts

Llama 3.1 vs GPT-4 Benchmarks

We evaluated the performance of Llama 3.1 vs GPT-4 models on over 150 benchmark datasets…

July 24, 2024

Transforming Manufacturing with Industrial IoT Solutions and Machine Learning

The manufacturing industry is undergoing a significant transformation with the advent of Industrial IoT Solutions.…

July 6, 2024

How can IT Professionals use ChatGPT?

If you're reading this, you must have heard the buzz about ChatGPT and its incredible…

September 2, 2023

ChatGPT in Cybersecurity: The Ultimate Guide

How to Use ChatGPT in Cybersecurity If you're a cybersecurity geek, you've probably heard about…

September 1, 2023

Add Cryptocurrency Price Widget in WordPress Website

Introduction In the dynamic world of cryptocurrencies, staying informed about the latest market trends is…

August 30, 2023

Best Addons for The Events Calendar Elementor Integration

The Events Calendar Widgets for Elementor has become easiest solution for managing events on WordPress…

August 30, 2023