pdf-image-extractor

v1.0.0

Extract images from PDFs in original, 800x800 web-ready, or compressed under 50KB formats for printing, e-commerce, or social media use.

0· 70· 1 versions· 0 current· 0 all-time· Updated 3d ago· MIT-0

PDF Image Extractor

Extract images from PDF files with three output modes.

Three Modes

ModeCommandBest For
Originalpython scripts/extract_original.pyPrinting, high-quality
Standardpython scripts/extract_standard.pyWebsite, e-commerce
Compresspython scripts/extract_compress.pyWeChat, Douyin, social

Quick Start

# Install dependencies
pip install pymupdf pillow

# Configure
# Edit CONFIG section at top of each script:
# - pdf_folder: Path to your PDF folder
# - output_folder: Where to save images

# Run
python scripts/extract_original.py    # Original quality
python scripts/extract_standard.py    # 800x800 square
python scripts/extract_compress.py    # Compressed sharing

Output Structure

output_folder/
├── PDF_Name_1/
│   ├── page001_img01.jpg
│   └── ...
└── PDF_Name_2/
    └── ...

Requirements

  • Python 3.7+
  • PyMuPDF (pymupdf)
  • Pillow (PIL)

Use Cases

  • Furniture manufacturers: Extract catalog images for printing
  • E-commerce: Prepare product images for websites
  • Sales: Compress images for WeChat Moments, Douyin

License: MIT
Author: 小杰 @ AI家具链

Version tags

latestvk97c1czns11fw3trbz5wb64q3x85h949