The Apache Tika™ toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF). All of these file types can be parsed through a single interface, making Tika useful for search engine indexing, content analysis, translation, and much more.
To create a new Proxmox VE Apache Tika LXC, run the command below in the Proxmox VE Shell.
To Update Apache Tika, run the command below (or type update) in the LXC Console.
bash -c "$(wget -qLO - https://github.com/community-scripts/ProxmoxVE/raw/main/ct/apache-tika.sh)"
Default settings
CPU: 1vCPU
RAM: 1GB
HDD: 10GB
Default Interface: IP:9998