apache - Lucence SOLR not highlighting special characters like dot, slash in search result -


like in titile, got result solr , special characters not highlighting in searching word

<em>00</em>:<em>00.000z</em>  

solr parameter

&hl.simple.pre=<em>&hl.simple.post=</em> 

example query: all:* , hello/world

<em>hello</em> / <em>world</em> 

field analyzer:

<fieldtype name="text_en" class="solr.textfield" positionincrementgap="100">   <analyzer type="index">     <tokenizer class="solr.standardtokenizerfactory"/>     <!-- in example, use synonyms @ query time     <filter class="solr.synonymfilterfactory" synonyms="index_synonyms.txt" ignorecase="true" expand="false"/>     -->     <!-- case insensitive stop word removal.       add enablepositionincrements=true in both index , query       analyzers leave 'gap' more accurate phrase queries.     -->     <filter class="solr.stopfilterfactory"             ignorecase="true"             words="lang/stopwords_en.txt"             enablepositionincrements="true"             />     <filter class="solr.lowercasefilterfactory"/>     <filter class="solr.englishpossessivefilterfactory"/>     <filter class="solr.keywordmarkerfilterfactory" protected="protwords.txt"/> <!-- optionally may want use less aggressive stemmer instead of porterstemfilterfactory:     <filter class="solr.englishminimalstemfilterfactory"/> -->     <filter class="solr.porterstemfilterfactory"/>   </analyzer>   <analyzer type="query">     <tokenizer class="solr.standardtokenizerfactory"/>     <filter class="solr.synonymfilterfactory" synonyms="synonyms.txt" ignorecase="true" expand="true"/>     <filter class="solr.stopfilterfactory"             ignorecase="true"             words="lang/stopwords_en.txt"             enablepositionincrements="true"             />     <filter class="solr.lowercasefilterfactory"/>     <filter class="solr.englishpossessivefilterfactory"/>     <filter class="solr.keywordmarkerfilterfactory" protected="protwords.txt"/> <!-- optionally may want use less aggressive stemmer instead of porterstemfilterfactory:     <filter class="solr.englishminimalstemfilterfactory"/> -->     <filter class="solr.porterstemfilterfactory"/>   </analyzer> </fieldtype> 

standardtokenizer weed out characters / , perhaps you're looking here whitespacetokenizer . other that, colon : sign has special significane lucene query parser , edismax, perhaps want try luck simpler more robust dismax query parser


Comments

Popular posts from this blog

javascript - RequestAnimationFrame not working when exiting fullscreen switching space on Safari -

Python ctypes access violation with const pointer arguments -