Paperless-ngx

Paperless-ngx

Document management with OCR and full‑text search

35.5kstars
2.2kforks
Last commit: 11h ago
Repo age: 4y old
Paperless-ngx screenshot

Paperless-ngx is a self-hosted document management system (DMS) focused on turning paper and digital files into searchable, organized records. It ingests documents from multiple sources, runs OCR and text extraction, and provides a web UI and API to manage, find, and automate document handling.

Key Features

  • Automated ingestion (“consume” folder) plus upload via web UI and REST API
  • OCR and text extraction for searchable PDFs/images (typically via Tesseract)
  • Full-text search with filters (tags, correspondents, document types, dates, fields)
  • Metadata model: correspondents, document types, tags, custom fields, and rules
  • Email ingestion (IMAP) to automatically import attachments and assign metadata
  • Document workflows: matching rules, automatic tagging, and metadata assignment
  • Multi-user support with permissions/roles and an admin interface
  • Preview and download originals/archived PDFs; versioned/organized storage
  • Integrations via API and container-first deployment (Docker/Compose)

Use Cases

  • Personal “paperless” home archive for bills, receipts, manuals, and letters
  • Small office record-keeping with consistent naming, tagging, and search
  • Automatic import pipeline from scanner + email for invoices and statements

Limitations and Considerations

  • OCR quality and language support depend on installed OCR language packs and scan quality
  • Accurate auto-classification relies on well-tuned matching rules and consistent inputs

Paperless-ngx is well-suited for users who want reliable OCR-backed search, structured metadata, and automated ingestion to maintain a long-term, searchable archive. Its strong import options and rule-based processing make it practical for both home and small-team document workflows.

Categories:

Tags:

Tech Stack:

Share:

Similar Services

Stirling PDF

Stirling PDF

Self-hosted web app for PDF manipulation and conversion

72.9k
6.2k
Last commit: 1d ago

Web-based PDF toolkit for merge/split/convert/OCR/redact/sign and more, with an optional API and Docker deployment.

Alternative to:
Adobe Acrobat
Adobe Acrobat
+2
Reactive Resume

Reactive Resume

A free and open-source resume builder you can host yourself

34.3k
3.8k
Last commit: 1d ago

Self-hosted resume/CV builder with templates, versioning, JSON import/export, and PDF export for creating and managing multiple resumes.

Alternative to:
Canva Resume Builder
Canva Resume Builder
+5
ConvertX

ConvertX

Self-hosted file conversion server with a web UI and API

13.6k
724
Last commit: 14h ago

ConvertX is a self-hosted file conversion service that provides a web interface and API to convert documents, images, audio, and video using a containerized toolchain.

Alternative to:
Smallpdf
Smallpdf
+3
Documenso

Documenso

Open-source document signing and workflow platform

12.2k
2.2k
Last commit: 1d ago

Self-hosted platform for preparing, sending, and tracking legally binding e-signatures with templates, audit trails, and team workflows.

Alternative to:
DocuSign
DocuSign
+9
DocuSeal

DocuSeal

Open-source document signing and form workflows

11.1k
917
Last commit: 3d ago

Self-hosted eSignature platform for creating templates, collecting form data, and signing PDFs with audit trails and team workflows.

Alternative to:
DocuSign
DocuSign
+9
OmniTools

OmniTools

Self-hosted web toolbox for everyday developer and data tasks

8.1k
492
Last commit: 29d ago

A self-hosted, browser-based collection of utilities for encoding/decoding, text and data conversion, and other common developer-friendly tools in one place.

Alternative to:
CyberChef (hosted/online instances)
CyberChef (hosted/online instances)
+3