Metadata-Version: 2.1
Name: papermerge-core
Version: 2.1.0
Summary: Open source document management system for digital archives
Home-page: https://www.papermerge.com/
License: Apache 2.0
Keywords: PDF,OCR,optical character recognition,PDF/A,scanning,dms,document-management,paperless,archives
Author: Eugen Ciur
Author-email: eugen@papermerge.com
Maintainer: Eugen Ciur
Maintainer-email: eugen@papermerge.com
Requires-Python: >=3.8,<3.11
Classifier: Environment :: Web Environment
Classifier: Framework :: Django
Classifier: License :: OSI Approved :: Apache Software License
Classifier: License :: Other/Proprietary License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 3
Classifier: Programming Language :: Python :: 3.8
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3
Requires-Dist: Whoosh (>=2.7.4,<3.0.0)
Requires-Dist: celery[gevent,redis] (>=5.2,<6.0)
Requires-Dist: channels (>=3.0,<4.0)
Requires-Dist: channels-redis (>=3.2,<4.0)
Requires-Dist: configula (>=0.5,<0.6)
Requires-Dist: daphne (>=3.0.2,<4.0.0)
Requires-Dist: django (>=4.0,<5.0)
Requires-Dist: django-celery-results (>=2.2,<3.0)
Requires-Dist: django-cors-headers (>=3.9.0,<4.0.0)
Requires-Dist: django-dynamic-preferences (>=1.13,<2.0)
Requires-Dist: django-haystack (>=3.2.1,<4.0.0)
Requires-Dist: django-modelcluster (>=6.0,<7.0)
Requires-Dist: django-mptt (>=0.13.4,<0.14.0)
Requires-Dist: django-polymorphic (>=3.0,<4.0)
Requires-Dist: django-polymorphic-tree (>=2.0,<3.0)
Requires-Dist: django-rest-knox (>=4.2.0,<5.0.0)
Requires-Dist: django-taggit (>=3.0.0,<4.0.0)
Requires-Dist: django_filter (>=21.1,<22.0)
Requires-Dist: djangorestframework (>=3.13.1,<4.0.0)
Requires-Dist: djangorestframework-jsonapi (>=5.0.0,<6.0.0)
Requires-Dist: drf-spectacular (>=0.22.0,<0.23.0)
Requires-Dist: drf-spectacular-sidecar (>=2022.3.21,<2023.0.0)
Requires-Dist: elasticsearch (>=7,<8)
Requires-Dist: lxml (>=4.9.0,<5.0.0)
Requires-Dist: ocrmypdf (>=13.5.0,<14.0.0)
Requires-Dist: ocrmypdf-papermerge (>=0.4.5,<0.5.0)
Requires-Dist: pdf2image (>=1.16.0,<2.0.0)
Requires-Dist: persisting-theory (>=1.0,<2.0)
Requires-Dist: psycopg2 (>=2.9.2,<3.0.0)
Requires-Dist: psycopg2-binary (>=2.9.2,<3.0.0)
Requires-Dist: python-magic (>=0.4,<0.5)
Requires-Dist: pyyaml (>=6.0,<7.0)
Requires-Dist: redis (>=4.3.3,<5.0.0)
Requires-Dist: uWSGI (>=2.0.20,<3.0.0)
Requires-Dist: uritemplate (>=4.1.1,<5.0.0)
Requires-Dist: yapian-haystack (>=3.1.0,<4.0.0)
Project-URL: Documentation, https://docs.papermerge.io/
Description-Content-Type: text/markdown

[![Tests](https://github.com/papermerge/papermerge-core/actions/workflows/tests.yml/badge.svg)](https://github.com/papermerge/papermerge-core/actions/workflows/tests.yml)

# Papermerge REST API Server


This python package is the heart of Papermerge project. It consists of a set
of reusable Django apps which are consumed across different bundles of
Papermerge Document Management System (DMS).

Technically speaking, it contains following Django apps:

* ``papermerge.core`` - the epicenter of Papermerge DMS project
* ``papermerge.notifications`` - Django Channels app for sending notifications via websockets
* ``papermerge.search`` - RESTful search. Supports four backends: [Xapian](https://getting-started-with-xapian.readthedocs.io/en/latest/),
  [Whoosh](https://whoosh.readthedocs.io/en/latest/intro.html), [Elasticsearch](https://github.com/elastic/elasticsearch),
  [Solr](https://solr.apache.org/).


## What is Papermerge?

Papermerge is an open source document management system (DMS) primarily
designed for archiving and retrieving your digital documents. Instead of
having piles of paper documents all over your desk, office or drawers - you
can quickly scan them and configure your scanner to directly upload to
Papermerge DMS. Papermerge DMS on its turn will extract text data from the
scanned documents using Optical Character Recognition (OCR) technology the
index it and make it searchable. You will be able to quickly find any
(scanned!) document using full text search capabilities.

Papermerge is perfect tool to manage documents in PDF, JPEG, TIFF and PNG formats.

## Features Highlights

* RESTul API
* Works well with PDF documents
* OCR (Optical Character Recognition) of the documents (uses [OCRmyPDF](https://github.com/ocrmypdf/OCRmyPDF))
* Full Text Search of the scanned documents (supports four search engine backends, uses [Xapian](https://getting-started-with-xapian.readthedocs.io/en/latest/) by default)
* Document Versions
* Tags - assign colored tags to documents or folders
* Documents and Folders - users can organize documents in folders
* Multi-User (supports user groups)
* User permissions management
* Page Management - delete, reorder, cut & paste pages (uses [PikePDF](https://github.com/pikepdf/pikepdf))


## Documentation

Online documentation is available at [https://docs.papermerge.io](https://docs.papermerge.io/)


## Tests

Test suite is divided into two big groups:

1. tests.core
2. tests.search


First group is concerned with tests which do not depend on elasticsearch while
second one **tests.search** is concerned with tests for which **depend on elasticsearch**
and as result run very slow (hence the grouping). In
order to run `tests.core` tests you need to have redis up and running; in
order to run `test.search` you need to both **redis and elasticsearch** up and
running.

Before running core tests suite, make sure redis service is up and running. Run tests:

     poetry run task test-core

Before running search tests suite, make sure both **redis and elasticsearch**
services are up and running:

     poetry run task test-search

In order to run all tests suite (core + search):

    poetry run task test


## Linting

Use following command to make sure that your code is formatted per PEP8 spec:

    poetry run task lint

