Can we configure analyzers to not exclude specific characters

Shivashankar Maddanimath Tue, 27 Jan 2015 23:04:18 -0800

Hi,

I am using  Lucene standard and uax29urlemailtokenizer. These analysers are 
excluding some characters like "+" ( I can't search C++). Is there any way we 
can  configure analyzers to include specific characters in analyzers while 
tokenising?

Regards,
Shiv

-----Original Message-----
From: "Luis A Lastras" <[email protected]>
Sent: ‎25-‎01-‎2015 08:05 AM
To: "[email protected]" <[email protected]>
Subject: Absolute term position in scoring

Is it possible to incorporate in Lucene's scoring function the position of a 
matching term (say as measured from the top of the document). The scenario is, 
if the set of documents tend to lk about the most important stuff at the 
beginning of the document, then we would like to give preference to documents 
that mention a term close to the top.

Thanks,

Luis

Luis A Lastras, Ph.D.
Research Staff Member & Manager, Concept Analytics, IBM Watson
Member of the iBM Academy of Technology
IBM Master Inventor
email: [email protected] | Tel: 914-945-3613 | Cell: 914-382-1879
address:  1101 Kitchawan Rd, Office 28-132, Yorktown Heights, NY, 10598

Can we configure analyzers to not exclude specific characters

Reply via email to