Uploaded image for project: 'PUBLIC - Liferay Portal Community Edition'
  1. PUBLIC - Liferay Portal Community Edition
  2. LPS-96651

FileUtil.extractText is not working correctly in case of configuring text.extraction.fork.process.enabled=true

Details

    Description

      FileUtil.extractText is not working correctly in case of configuring text.extraction.fork.process.enabled=true in portal-ext.properties

      Steps to reproduce

      1. Before starting Liferay, configure text.extraction.fork.process.enabled=true in portal-ext.properties
      2. Create a docx or odt file with content:

        This is a test

      3. Create a site
      4. Attach created file to document library in previous created site
      5. Get indexed data from elasticsearch, opening following URL: (note: set your groupId)
        http://localhost:9200/_search?q=%2BgroupId:<<YOUR_GROUPID>>+%2BentryClassName:com.liferay.document.library.kernel.model.DLFileEntry&pretty
      6. Review "content_en_US" field of each returned document:
        • Expected behavior: the document has a "This is a test" text in "content_en_US" field
        • Wrong behavior: the document doesn't have any text in "content_en_US"

      Attachments

        Issue Links

          Activity

            People

              yvonne.han Yvonne Han
              jorge.diaz Jorge Diaz
              Kiyoshi Lee Kiyoshi Lee
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:
                3 years, 51 weeks, 2 days ago

                Packages

                  Version Package
                  6.2.X EE
                  7.0.0 DXP FP84
                  7.0.10.12 DXP SP12
                  7.0.X
                  7.1.10 DXP FP13
                  7.1.X
                  7.2.10 DXP FP1
                  7.2.1 CE GA2
                  7.2.X
                  7.3.10 DXP GA1
                  Master