Deploy Container API
Everything you need to know to deploy the pdfRest Container API with your Docker framework.
Loading the image
The Docker container can be obtained in tar form from links provided by the Datalogics Sales team. Questions regarding container licensing can be directed towards sales@datalogics.com.
Once the image has been downloaded you will load it into your environment, likely with docker load
.
Official Docker documentation is located here
Using the Container API image
This guide covers two containerized deployment methods, Docker and Kubernetes. It references some Environment Variables which are covered in-depth in the Configure Container API Guide.
Docker
The simplest Docker Compose file to define the pdfRest Container API can be found below:
This will instruct Docker to set up a pdfRest instances on a Linux machine using the pdfRest container image previously loaded with docker load
.
The optional PDFREST_SERVER_DOMAIN
Environment Variable formats the input and output URLs of documents uploaded to and processed by the API.
The ports
section of the YAML instructs the stack to listen for API calls on port 80 and forward those calls to port 3000 inside of the container.
You can change the listener to a port other than 80, but 3000 is mandatory for the forwarding port.
If you require shared storage between multiple running instances, set up a shared volume as described in the Docker storage volume documentation and configure the volumes
section of the YAML to mount that volume as shown below:
Kubernetes
Using the image previously loaded with docker load
you can now create and expose a deployment. The only item to note is that the pdfRest server listens on port 3000
When running multiple instances you will want to set up a shared volume so that all instances of pdfRest can read and write to the same location. This ensures that all input and output documents are available no matter which instance a processing or retrieval API call load-balances to. This requires the creation of a PersistentVolume.
Here, we will demonstrate with a hostPath PersistentVolume for testing and development purposes. It is not recommended to use a hostPath in a production cluster. A cluster administrator should provision a networked resource such as an NFS share, a Google Cloud persistent disk, Azure File Share, or an AWS EFS volume.
First, create the PersistentVolume:
save it as pdfrest-pv.yaml
and apply it to your deployment with:
This configuration specifies that the volume will be found at /mnt/data/
on the cluster Node. It also defines the StorageClass name for the PersistentVolume as manual
, which will be used to bind PersistentVolumeClaim requests to this PersistentVolume.
Then, create the PersistentVolumeClaim:
Save it as pdfrest-pvc.yaml
and apply it to your deployment with:
The PersistentVolumeClaim should be automatically bound to the PersistentVolume you created.
Then configure the pdfRest deployment to use the PersistentVolumeClaim you made as a volume.
Edit the deployment.yaml
, under spec > template > spec
: