Tabla de Contenidos

Paperless-ng

Aplicación para almacenar documentos escaneados y buscar fácilmente en ellos.

Paperless-ng es un derivado del proyecto original Paperless, pero que añada una nueva interfaz y más opciones.

Instalación

Docker

Paperless-ng puede utilizar SQLite o PostgreSQL como motores de bases de datos.

Utilizaremos 3 ficheros:

Fichero docker-compose.env:

# The UID and GID of the user used to run paperless in the container. Set this
# to your UID and GID on the host so that you have write access to the
# consumption directory.
USERMAP_UID=1000
USERMAP_GID=1000
 
# Additional languages to install for text recognition, separated by a
# whitespace. Note that this is
# different from PAPERLESS_OCR_LANGUAGE (default=eng), which defines the
# default language used when guessing the language from the OCR output.
# The container installs English, German, Italian, Spanish and French by
# default.
# See https://packages.debian.org/search?keywords=tesseract-ocr-&searchon=names&suite=buster
# for available languages.
#PAPERLESS_OCR_LANGUAGES=tur ces
 
###############################################################################
# Paperless-specific settings                                                 #
###############################################################################
 
# All settings defined in the paperless.conf.example can be used here. The
# Docker setup does not use the configuration file.
# A few commonly adjusted settings are provided below.
 
# Adjust this key if you plan to make paperless available publicly. It should
# be a very long sequence of random characters. You don't need to remember it.
PAPERLESS_SECRET_KEY=lo-que-quieras
 
# Use this variable to set a timezone for the Paperless Docker containers. If not specified, defaults to UTC.
PAPERLESS_TIME_ZONE=Europe/Madrid
 
# The default language to use for OCR. Set this to the language most of your
# documents are written in.
PAPERLESS_OCR_LANGUAGE=spa

Para elegir el idioma español en Tesseract (el software que usa Paperless-ng para el OCR), se indica con el valor spa.

Utilizando SQLite:

version: "3.4"
services:
  broker:
    image: redis:6.0
    container_name: redis-paperless
    restart: always

  webserver:
    image: jonaswinkler/paperless-ng:0.9.13
    container_name: paperless-app
    restart: always
    depends_on:
      - broker
    ports:
      - 8000:8000
    healthcheck:
      test: ["CMD", "curl", "-f", "http://localhost:8000"]
      interval: 30s
      timeout: 10s
      retries: 5
    volumes:
      - data:/usr/src/paperless/data
      - media:/usr/src/paperless/media
      - ./export:/usr/src/paperless/export
      - ./consume:/usr/src/paperless/consume
    env_file: docker-compose.env
    environment:
      PAPERLESS_REDIS: redis://broker:6379
 

volumes:
  data:
  media:

Tras crear los contenedores (docker-compose up -d), hay que crear una cuenta de administrador:

docker-compose run --rm webserver createsuperuser

Nos hará preguntas:

Starting redis-paperless ... done
Operations to perform:
  Apply all migrations: admin, auth, authtoken, contenttypes, django_q, documents, paperless_mail, sessions
Running migrations:
  No migrations to apply.
Username (leave blank to use 'paperless'): foo
Email address: foo@example.org
Password:
Password (again):
Superuser created successfully.