Solr-web-plugin doesn't get snippets from the localized fields and provides incosistent results in case of searches from other locales for terms occured only in the default-locale version of a Web Content

Description

Solr-web-plugin currently is not designed to get snippets from the localised content fields from the result SolrDocument (field "highlights").

By default, only the field "content" is declared as stored and indexed field in the schema.xml of solr-web-plugin. However, if a customer creates a translation for a given web content, the localised versions of the fields (sush as "content" --> "content_xx_XX") are saved in the Document stored by Solr. Therefore, only searches from the default-locale will be succeed.

Adding the new field(s) (e.g. "content_en_US" and "content_el_GR" to the "schema.xml" solves the problem above, but the results never be highlighted. It's because the method "SolrIndexSearcherImpl.getSnippet()" reads the snippets only from field "content" (even if the localised versions also exist). Because, the field "content" always stores the content corresponding to the current default-locale, results of searches from the default-locale always be highlighted.

By default, Solr generates snippets for the field declared in the element "<defaultSearchField>" in the "schema.xml". It is a fallback field and deprecated from solr-3.6.0. Solr uses it when no field(s) is/are passed as param(s) "hl.fl" to the given solrQuery.

There are two ways to tell Solr explicitly for which fields to generate highlighted snippets:
1. Adding a param to the "SolrQuery" when executing the method "SolrIndexSearcherImpl.translateQuery()" :
e.g. solrQuery.setParam("hl.fl", "content_el_GR");

2. Adding the following line to the "requestHandler" called "standard" (under solr-1.4.1, or "search" under solr-3.5.0) in the "solrconfig.xml":
e.g. <str name="hl.fl">content,content_en_US,content_el_GR</str>

--> Thus, if a customer is planning to support translations, they have to modify the "schema.xml" and take into account the special characteristics of the given language. Different languages may need different "tokanizer" and/or "filter(s)" hence new "fieldType" declaration in the "schema.xml".

The other problem is that search from other locales for a term, which occurs only in the default-locale version, generates confusing result in case of Web Contents: there will be result, but the portal will display the localized version of the content, which obviously does not contain the term! No highlight, no term in the displayed content. It's because the field "content" is always part of the query, thus, Solr will find matches, and Solr-web-plugin will pass the Document (built from the result SolrDocument) to the portal, where the displaying process tries to highlight in the localized field (without success, of course). Unfortunately, Lucene (LuceneIndexSearcherImpl) works like this, too. (Tested on trunk)

Attachments

7

Activity

Show:

Sophia.Zhang January 22, 2013 at 1:20 AM

Passed it due to I passed , they fixed together so that I can't make comments for them separately.

Marko Ivkovic September 7, 2012 at 4:22 AM

still having the same problem on liferay 6.1.1 ga2 CE, solr 1.4.1 and solr web plugin from Liferay marketplace (for liferay 6.1 ga2)...
the problem is with Spanish content, followed the steps from this ticket, but it did not work and the search
only works for the default language of the portal (in my case English)..
any ideas for a solution, it seems to me that this issue is not yet fixed?

Michael Saechang July 25, 2012 at 11:06 AM
Edited

Committed on:
Plugins 6.1.x CE GIT ID: 5578d6035df03868bf835cae5739a75a825ac60d.
Plugins 6.2.x GIT ID: dd23e8dbc26d399abfb06a63002daedd3b32c3e9.

Tibor Lipusz June 8, 2012 at 2:37 AM

Modifying summary and description

Tibor Lipusz June 4, 2012 at 5:39 AM

Check the attachments and use the files if necessary.

Fixed

Details

Assignee

Reporter

Labels

Branch Version/s

6.1.x

Backported to Branch

Committed

Fix Priority

3

Git Pull Request

Components

Affects versions

Priority

Zendesk Support

Created June 4, 2012 at 2:06 AM
Updated June 24, 2023 at 3:53 PM
Resolved June 9, 2012 at 1:11 PM
Loading...