-
Type:
Bug
-
Status: Closed
-
Resolution: Fixed
-
Affects Version/s: 6.2.X EE, 7.0.X, 7.1.X, 7.2.X, Master
-
Fix Version/s: 6.2.X EE, 7.0.0 DXP FP84, 7.0.10.12 DXP SP12, 7.0.X, 7.1.10 DXP FP13, 7.1.X, 7.2.10 DXP FP1, 7.2.X, 7.2.1 CE GA2, 7.3.10 DXP GA1, Master
-
Component/s: Documents & Media
-
Branch Version/s:7.2.x, 7.1.x, 7.0.x, 6.2.x
-
Backported to Branch:Committed
-
Fix Priority:3
-
Sprint:027 - Sandshrew
-
Git Pull Request:
FileUtil.extractText is not working correctly in case of configuring text.extraction.fork.process.enabled=true in portal-ext.properties
Steps to reproduce
- Before starting Liferay, configure text.extraction.fork.process.enabled=true in portal-ext.properties
- Create a docx or odt file with content:
This is a test
- Create a site
- Attach created file to document library in previous created site
- Get indexed data from elasticsearch, opening following URL: (note: set your groupId)
http://localhost:9200/_search?q=%2BgroupId:<<YOUR_GROUPID>>+%2BentryClassName:com.liferay.document.library.kernel.model.DLFileEntry&pretty
- Review "content_en_US" field of each returned document:
- Expected behavior: the document has a "This is a test" text in "content_en_US" field
- Wrong behavior: the document doesn't have any text in "content_en_US"
- Expected behavior: the document has a "This is a test" text in "content_en_US" field
- relates
-
LPE-16800 FileUtil.extractText is not working correctly when text.extraction.fork.process.enabled=true
-
- Closed
-
- Testing discovered
-
LPS-96654 LPS-70810 is reproduced in case of enabling text.extraction.fork.process.* in portal-ext.properties
- Closed
-
LPS-96779 Tika writes some warnings traces due to missing libraries
- Closed
-
LPS-96932 Thumbnails and Previews are not generated and an error is thrown with some PDFs generated from scanned documents
- Closed
-
LPS-96652 An ExceptionInInitializerError is thrown in case of enabling text.extraction.fork.process.* in portal-ext.properties
- Closed