PDF Extractor Pro

ExtensionTools3 users

Overview

Extract text and tables from digital and scanned PDFs - fully local, no cloud, no APIs.

PDF Text & Table Extractor - 100% Local OCR Extract text and tables from both digital and scanned PDF files directly inside your browser. Everything runs locally on your device - no servers, no cloud processing, no API keys, and no data uploads. Works completely offline after installation and OCR language data caching. Privacy First • 100% local processing • No backend or cloud services • No external API calls • No account required • No tracking • Works offline after setup Your PDF files never leave your computer. Features Digital PDF Text Extraction Extract text from standard PDFs while preserving the original reading order using PDF.js. OCR for Scanned PDFs Convert image-based PDFs into searchable text using built-in OCR. • English OCR support • Automatically detects scanned pages • No manual page selection required Automatic Table Detection Extract tables without manually drawing boxes or selecting regions. Uses coordinate-based row and column grouping to detect table structures automatically. Multiple Export Formats • JSON – Structured output with text blocks, tables, and OCR content • CSV – Table data only • Plain Text – Text with table rendering • Excel (.xlsx) – Separate sheets for text blocks, tables, and OCR pages Multiple Input Methods Upload PDF Files Process PDF documents directly from your computer. Extract PDFs From Browser Tabs Open any PDF URL in Chrome and extract its contents without downloading files manually. How to Use Upload a PDF: 1. Click the extension icon. 2. Click "Upload PDF". 3. Select a PDF file. 4. Wait while pages are processed (OCR pages typically take 2–5 seconds each). 5. Choose an export format. 6. Download the extracted results. Extract From the Current Browser Tab: 1. Open any PDF in Chrome. 2. Click the extension icon. 3. The "Current Tab" option becomes available automatically. 4. Click it to process the PDF directly from the active tab. OCR Mode ON (Default) • Scanned pages are processed with Tesseract OCR. • Image-based pages are converted into searchable text. OFF • Faster processing. • Image-based pages are skipped. • Useful when working with text-based PDFs only. During the first OCR run, the extension downloads the English language model (eng. traineddata, approximately 10 MB) and stores it locally. Once cached, all future OCR operations work completely offline. Export Formats JSON Structured output containing text blocks, detected tables, and OCR text. CSV Exports detected tables with separate sections for each table and page. Plain Text Exports all extracted text with readable table formatting. Excel (.xlsx) Creates separate worksheets for text blocks, tables, and OCR content. Perfect For • Researchers • Students • Accountants • Data analysts • Office work • Archiving documents • Converting scanned PDFs • Extracting tables from reports and invoices Fast, private, and fully local PDF text and table extraction for Google Chrome.

0 out of 5
No ratings
Learn more about results and reviews.

Details

Version
1.0.1
Updated
July 19, 2026
Flag concern
Offered by
Liminal Vault
Size
773KiB
Languages
English
Developer
Email
contact@liminalvault.com
Non-trader
This developer has not identified itself as a trader. For consumers in the European Union, please note that consumer rights do not apply to contracts between you and this developer.

Privacy

Manage extensions and learn how they're being used in your organization

PDF Extractor Pro has disclosed the following information regarding the collection and usage of your data. More detailed information can be found in the developer's privacy policy.