Extract Images from Docs

😫 The Pain Point

You received a Word document with 50 embedded images. You need those images as separate files for your website. Copy-paste from the document is manual and loses quality.

🚀 Agentic Solution

An Image Extractor that pulls all embedded media from documents.

Key Features:

Multiple Formats: Word (DOCX), PowerPoint (PPTX), PDF.
Original Quality: Extracts at embedded resolution.
Batch Processing: Process folder of documents.

⚔️ Phase 1: Commander (Quick Fix)

For quick extraction.

Prompt:

“I have a Word document report.docx with embedded images. Write a Python script to:

Extract: All images from the document.

Naming: Save as report_img_001.png, report_img_002.jpg, etc.

Output: Save to extracted_images/ folder.

Print count of extracted images. Handle documents without images gracefully.”

Result: All images extracted at original quality.

🏗️ Phase 2: Architect (Permanent Tool)

Engineering Prompt:

**Role:** Python Tool Developer
**Task:** Create a "Document Image Extractor".

**Requirements:**
1.  **GUI:**
    *   Select document or folder.
    *   Format filter (DOCX, PPTX, PDF).
    *   Preview extracted images.
    *   Naming pattern input.

2.  **Logic:**
    *   DOCX: Use python-docx to access media folder.
    *   PPTX: Use python-pptx for slide images.
    *   PDF: Use image extraction from streams.

3.  **Deliverables:**
    *   `extract_images.py`
    *   `run.bat`, `run.sh`
    *   `requirements.txt`

🧠 Prompt Decoding

DOCX internals: A DOCX file is a ZIP containing XML and media files.

🛠️ Instructions

Install: pip install python-docx python-pptx
Copy Prompt → Run.

😫 The Pain Point

🚀 Agentic Solution

Key Features:

⚔️ Phase 1: Commander (Quick Fix)

🏗️ Phase 2: Architect (Permanent Tool)

🧠 Prompt Decoding

🛠️ Instructions

Related Workflows

PDF Merge

PDF Split

PDF to Images

PDF Watermark

Remove Geo Tag

Data Faker VN

Get Started with Agentic Working

😫 The Pain Point

🚀 Agentic Solution

Key Features:

⚔️ Phase 1: Commander (Quick Fix)

🏗️ Phase 2: Architect (Permanent Tool)

🧠 Prompt Decoding

🛠️ Instructions

Related Workflows

PDF Merge

PDF Split

PDF to Images

PDF Watermark

Remove Geo Tag

Data Faker VN

Get Started with Agentic Working

Get Your Free Starter Kit