-
Type:
Bug
-
Status: Closed
-
Resolution: Fixed
-
Affects Version/s: 7.0.X, 7.1.X, 7.2.X, Master
-
Fix Version/s: 7.0.0 DXP FP88, 7.0.10.13 DXP SP13, 7.0.X, 7.1.10 DXP FP15, 7.1.X, 7.2.10 DXP FP1, 7.2.1 CE GA2, 7.2.X, 7.3.10 DXP GA1, Master
-
Component/s: Core Infrastructure, Documents & Media
-
Branch Version/s:7.2.x, 7.1.x, 7.0.x
-
Backported to Branch:Committed
-
Fix Priority:2
-
Sprint:027 - Sandshrew
-
Git Pull Request:
Tika writes some warnings traces due to missing libraries:
- jai-imageio-core
- jai-imageio-jpeg2000
- jbig2-imageio
- sqlite-jdbc
All except sqlite-jdbc are indirect dependencies of PDFBox
Steps to reproduce
- Before starting Liferay, configure in portal-ext.properties:
text.extraction.fork.process.enabled=true text.extraction.fork.process.mime.types=application/x-tika-ooxml,application/pdf
- Upload a PDF file to document library
- Expected behavior: no warning is written to log file
- Wrong behavior: following warnings are written to log file
Loading jar:file:/C:/Liferay/Bundles_Builds/portal-master/tomcat-9.0.17/webapps/ROOT/WEB-INF/lib/portal-impl.jar!/system.properties Jun 12, 2019 10:08:07 AM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem WARNING: TIFFImageWriter not loaded. tiff files will not be processed See https://pdfbox.apache.org/2.0/dependencies.html#jai-image-io for optional dependencies. J2KImageReader not loaded. JPEG2000 files will not be processed. See https://pdfbox.apache.org/2.0/dependencies.html#jai-image-io for optional dependencies. Jun 12, 2019 10:08:07 AM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem WARNING: org.xerial's sqlite-jdbc is not loaded. Please provide the jar on your classpath to parse sqlite files. See tika-parsers/pom.xml for the correct version.
- Expected behavior: no warning is written to log file
- Discovered while testing
-
LPS-96651 FileUtil.extractText is not working correctly in case of configuring text.extraction.fork.process.enabled=true
- Closed