Hi Robert Muir, we will check on this. Thanks a lot for the pointers.
--
*K*umaran
*R*
On Mon, Jan 16, 2023 at 11:16 PM Robert Muir wrote:
> On Tue, Jan 10, 2023 at 2:04 AM Kumaran Ramasubramanian
> wrote:
> >
> > For handling Indian regional languages, what is the
For handling Indian regional languages, what is the advisable approach?
1. Indexing each language data(Tamil, Hindi etc) in specific fields like
content_tamil, content_hindi with specific per field Analyzer like Tamil
for content_tamil, HindiAnalyzer for content_hindi?
2. Indexing all language da
Thanks a lot for your inputs Michael. I will check about FunctionQuery.
Thanks again :-)
--
Kumaran R
Chennai, India
On Fri, Sep 3, 2021 at 9:22 PM Michael Sokolov wrote:
> Sorry I'm not sure I understand what you're trying to do. Maybe you
> want to match a document having
with 150 INR records). Any pointers to know about this in
detail?
Kumaran R
Chennai, India
On Fri, Sep 3, 2021 at 12:08 AM Michael Sokolov wrote:
> Have you looked at the expressions module? It provides support for
> user-defined computation using values from the index based on a
s. Thanks in advance
--
Kumaran R
Chennai, India
Hi All,
Any input would be appreciated.. thanks in advance.
Kumaran R
On Tue, Mar 10, 2020, 11:44 PM Kumaran Ramasubramanian
wrote:
>
>
> Hi Mikhail
>
> Thanks for the input. But i would like to suggest title of the available
> documents (for the query typed in search b
its help widget )
[image: Screenshot from 2020-03-10 23-40-14.png]
--
*K*umaran
*R*
On Wed, Mar 4, 2020 at 2:44 AM Mikhail Khludnev wrote:
> Hi,
>
> org.apache.lucene.search.spell.DirectSpellChecker
>
>
> On Tue, Mar 3, 2020 at 8:14 AM Kumaran Ramasubramanian >
> wr
Hi All,
I would like to compute autocompletion based on one field's data. For
example, title field of a list of webpages. Is there anyway to achieve
this?
Regards
Kumaran R
arch
https://www.elastic.co/guide/en/elasticsearch/reference/current/mapping.html
https://www.elastic.co/guide/en/elasticsearch/reference/current/cluster-state.html
--
Kumaran R
I got it Erick.. Thank you..
-
Kumaran R
On Fri, Aug 11, 2017 at 10:35 PM, Erick Erickson
wrote:
> Encrypting the _tokens_ inevitably leads to reduced capabilities BTW.
> Trivial example:
> I have these tokens in my index
> run
> runner
> running
> runs
>
>
Hi Ishan, thank you :-)
-
-
Kumaran R
On Mon, Aug 7, 2017 at 10:53 PM, Ishan Chattopadhyaya <
ichattopadhy...@gmail.com> wrote:
> Harry Ochiai (Hitachi) has some index encryption solution,
> https://www.slideshare.net/maggon/securing-solr-search-data-in-the-cloud
> I think it
Hi Erick, i want to encrypt some fields of an document which has personal
identifiable information ( both indexed and stored data)... for eg: email,
mobilenumber etc.. i am able to find LUCENE-6966 alone while googling it..
any related pointers in solr or latest lucene version?
-
-
Kumaran R
Hi Erick,
Thanks for the information. Any pointers about encryption options in
solr?
--
Kumaran R
On Mon, Aug 7, 2017 at 9:17 PM, Erick Erickson
wrote:
> Encryption in Solr has a bunch of ramifications. Do you care about
>
> - encryption at rest or in memory?
> - en
ith sorting. "More Like
> This" would be unreliable. There are many other features of a robust search
> engine that would be impacted, and an index with encrypted terms would be
> useful for only exact matches, which usually results in a poor search
> experience.
https://stackoverflow.com/questions/36604551/adding-encryption-to-solr-lucene-indexes
--
Kumaran R
ery$FilterStrategy.filteredBulkScorer(FilteredQuery.java:504)
> at
> org.apache.lucene.search.FilteredQuery$1.bulkScorer(FilteredQuery.java:150)
If i use BooleanQuery and wrap it with ConstantScoreQuery, shall i use 1
lakh boolean clauses in booleanquery ?
-
-
Kumaran R
On Wed, Jul 19,
f you have
many clauses. But like TermsQuery, BooleanFilter always consume all
matching documents from all its clauses. So if you intersect it with a
selective query, it is wasteful.
Le mar. 18 juil. 2017 à 11:42, Kumaran Ramasubramanian
a écrit :
> Hi Adrien,
>
> Thanks for your in
> FilterClause filterClause = new FilterClause(filter,
> BooleanClause.Occur.SHOULD);
> boolFilter.add(filterClause);
> }
Do you see any problem in using
TermsFilter over TermsQuery?
btw, i will test with TermsQuery and let you know.
--
teredScorer(FilteredQuery.java:542)
>
> at
>> org.apache.lucene.search.FilteredQuery$FilterStrategy.filteredBulkScorer(FilteredQuery.java:504)
>
> at
>> org.apache.lucene.search.FilteredQuery$1.bulkScorer(FilteredQuery.java:150)
>
>
Any pointers are much appreciated... Thank you..
--
Kumaran R
Hi Steve
Thanks for the input. How to apply WordDelimiterGraphFilter
/ WordDelimiterFilter for email tokens alone using email regex ? i want to
have only analyzed tokens for other tokens with other type of special
characters...
--
Kumaran R
On Thu, Jun 15, 2017 at 7:43 PM, Steve Rowe
n my existing custom analyzer without using email
analyzer ( perfieldanalyzer ) for email field.. And so i can apply this
tokenizer for email terms of all fields..
-
Kumaran R
gt; bytes[] bin1bytes = bin1ref.bytes;
>
>
Snippet from: http://stackoverflow.com/a/34324561/1382168
--
Kumaran R
On Thu, Feb 16, 2017 at 12:38 AM, Adrien Grand wrote:
> I think it is hard to come up with a general rule, but there is certainly a
> per-field overhead. There are som
all stored fields ( except field to be
updated) and index everything again ( remove and add remaining fields ).
While searching, i use _all_ blob field to search in texts of all fields
data.
--
Kumaran R
I always use filter when i need to add more than 1024 ( for no scoring
cases ). If filter is removed in lucene 6, what will happen to
maxbooleanclauses limit? Am i missing anything?
-
Kumaran R
On Jan 12, 2017 5:01 AM, "Trejkaz" wrote:
On Thu, Jan 21, 2016 at 4:25 AM, Adrien Gr
Hi
I want to know the purpose of having final in analyzers.
For eg: classicanalyzer. It will be easy to add asciifolding filter over
classicanalyzer.
-
Kumaran R
On Jan 12, 2017 5:41 AM, "Michael McCandless"
wrote:
I don't think it's about efficiency but rather about n
the link mike. i will look into LongPoint in recent versions.
--
Kumaran R
On Fri, Dec 23, 2016 at 4:51 PM, Michael McCandless <
luc...@mikemccandless.com> wrote:
> Note that Erick is giving you the Solr syntax below, but if you are
> using Lucene directly, that obviously d
Thank you Adrien.
"NumericDocValuesField is the one that supports sorting."
Does this mean LongField/IntField just supports lexicographic order in
sorting?
-
Kumaran R
On Dec 22, 2016 11:28 PM, "Adrien Grand" wrote:
Le jeu. 22 déc. 2016 à 18:50, Kumaran Ramasubrama
in both 1 and 2 cases?
-
Kumaran R
Thanks Mike. We are planning to move MMapDirectory in both indexing and
searching.Regarding ulimit change and read during merging, i just tried
to know the impact of mmapdir during indexing.
-
Kumaran R
On Nov 30, 2016 4:18 AM, "Michael McCandless"
wrote:
>
> It's OK
Thanks a lot Uwe!!! Do we get any benefit on using MMapDirectory over
NIOFSDir during indexing? During merging? Is it ok to change to
MMapDirectory during search alone?
--
Kumaran R
On Nov 24, 2016 11:27 PM, "Erick Erickson" wrote:
>
> Thanks Uwe!
>
>
>
>
> On Th
opy blocks of data
> between the disk or filesystem cache and our buffers residing in Java heap.
> This needs to be done on every search request, over and over again.
--
Kumaran R
On Wed, Nov 23, 2016 at 9:17 PM, Erick Erickson
wrote:
> see Uwe's blog:
> http://blog.thetaphi.de
--
Kumaran R
Hi All,
Can anyone say, is it advisable to have index with both analyzed and
not_analyzed values in one field?
Use case: i have custom fields in my product which can be configured
differently ( ANALYZED and NOT_ANALYZED ) in different modules
--
Kumaran R
On Wed, Oct 26, 2016 at 12
join & elasticsearch uses aggregations ) Is
there any limitations in lucene's implementation ?
--
Kumaran R
Hi Rajnish
It is not advisable to index values with two data types in a field.
Features like phrase query, sorting may break in those indexes.
related previous discussion :
http://www.gossamer-threads.com/lists/lucene/java-user/289159?do=post_view_flat#289159
-
Kumaran R
On Fri, Nov 4
,omitNorms,indexOptions=DOCS_ONLY]
--
Kumaran R
Hi Adrien
Thanks a lot for the pointer.
--
Kumaran R
On Wed, Oct 19, 2016 at 8:07 PM, Adrien Grand wrote:
> You would need to override the wrapComponents method in order to wrap the
> tokenstream. See for instance Lucene's LimitTokenCountAnalyzer.
>
> Le mar. 18 oct. 2016
Hi Adrien
How to do this? Any Pointers?
> If it is fine to add the ascii folding filter at the end of the analysis
chain, then you could use AnalyzerWrapper.
>
-
Kumaran R
On Tue, Oct 11, 2016 at 9:59 PM, Kumaran Ramasubramanian wrote:
>
>
> @Ahmet, Uwe: T
etaphi.de
> eMail: u...@thetaphi.de
>
> > -Original Message-
> > From: Adrien Grand [mailto:jpou...@gmail.com]
> > Sent: Tuesday, October 11, 2016 4:37 PM
> > To: java-user@lucene.apache.org
> > Subject: Re: How to add ASCIIFoldingFilter in ClassicAnalyzer
> >
>
?
public final class ClassicAnalyzer extends StopwordAnalyzerBase
>
--
Kumaran R
Thank you shai. Will check them and let you know for clarifications.
-
Kumaran R
On Sep 27, 2016 10:05 AM, "Shai Erera" wrote:
>
> Hey,
>
> Here's a blog I wrote a couple years ago about using facet associations:
> http://shaierera.blogspot.com/2013/01/facet-asso
ation using
SortedSetDocValues and got it simply by changing the query and opening the
IndexReader for each level of query using SortedSetDocValuesReaderState. "
Is it possible easily?
-
Kumaran R
On Sep 27, 2016 9:38 AM, "Michael McCandless"
wrote:
>
> Weighted facets is
er features, but its a nice option. I
> personally don't think its the end of the world if Mike's patch doesnt
> support all the features of the faceting module initially or even ever.
what
is meant by
weighted facets
? what are
othe
r
facets
features ?
--
Kumaran R
).
Because, i want to make all existing messages in a group as visible to any
new member... so i need to reindex all messages with that newly added
member id..
Is index time join (for second case ) or query time join ( for first case )
can be best fit?
--
Kumaran R
On Tue, Aug 30, 2016 at 1:55 PM
to delete a member id or
edit a group name without reindexing of its children??
is it possible to avoid reindexing? Which lucene class is best fit for
this?
Related Article:
http://blog.mikemccandless.com/2012/01/searching-relational-content-with.html
--
Kumaran R
functionality
we will miss because of this?
other than faster faceting...Is there any benefit of using taxonomy index
over docvalues field for faceting?
--
Kumaran R
find😃f😄g😅h
Any further pointers on the same ??
--
Kumaran R
On Mon, Aug 1, 2016 at 12:07 AM, Kumaran Ramasubramanian
wrote:
>
> Hi All,
>
> Is there any pointers on storing smileys in lucene index?? Any help is
much appreciated.
>
> Thank you.
>
> --
>
Hi All,
Is there any pointers on storing smileys in lucene index?? Any help is much
appreciated.
Thank you.
--
Kumaran R
On Jul 30, 2016 12:24 PM, "Kumaran Ramasubramanian"
wrote:
>
> Hi All,
>
> Am using lucene 4.10.4. Using lucene index, Is there any way to store and
Hi All,
Am using lucene 4.10.4. Using lucene index, Is there any way to store and
retrieve latest utf8 and utf8mb4 emoji / smiley characters?? In any latest
lucene version??
Thanks in advance.
--
Kumaran R
Ok mike.. thanks for the explanation... i have another doubt...
i read in some article like, we can have one storedfield & docvalue field
with same field... is it so?
--
Kumaran R
On Thu, Jul 28, 2016 at 9:29 PM, Michael McCandless <
luc...@mikemccandless.com> wrote:
> O
Hi Mike,
For your information, am using lucene 4.10.4.. am i missing anything?
--
Kumaran R
On Wed, Jul 27, 2016 at 1:52 AM, Kumaran Ramasubramanian wrote:
>
> Hi Mike,
>
> 1.if we index one field as analyzed and not analyzed using same name,
> phrase queries are not
t; Expect many more anomalies. At best you need to
> index both forms as text rather than mixing numeric
> and text data.
Thanks in advance...
--
Kumaran R
On Sun, Jul 24, 2016 at 1:54 AM, Michael McCandless <
luc...@mikemccandless.com> wrote:
> On Sat, Jul 23, 2016 at 4:48
Hi Mike,
*Two different fields can be the same name*
Is it so? You mean we can index one field as docvaluefield and also stored
field, Using same name?
And AFAIK, We cannot index one field as analyzed and not analyzed using the
same name. Am i right?
Kumaran R
On Jul 21, 2016 11:50 PM
inconsistency expected ? Any pointers is much appreciated...
Thank you :-)
--
Kumaran R
Dear All,
Is there pointers on how to take my first step regarding below mentioned
requirement? Any help is much appreciated. Thanks.
when i dont not get matched records for a query, i want to try term
> modifiers ( as per order ) to retrieve related results ( at least )
>
>
--
:-)
--
**
Kumaran R
Thanks a lot for the clarification mike :-)
--
**
Kumaran R
On Thu, Jun 2, 2016 at 2:38 AM, Michael McCandless <
luc...@mikemccandless.com> wrote:
> Well, each access may involve disk seeks (if the pages are not already hot
> in the OS's IO cache).
>
> So if you do
performance or time taken
issue in retrieving values of a docvalue field using atomicreader over
StoredField?
Please clarify me what am i missing?
Thanks in advance
--
Kumaran R
s before adding to index*
stored
Thanks in advance :-)
--
Kumaran R
yes. All fields are stored in index. so sounds like working.
Thanks a lot jack :-) Nice to meet you here :-)
--
Kumaran R
On Fri, Dec 18, 2015 at 12:49 AM, Jack Krupansky
wrote:
> You could certainly read your stored values from your current index and
> then write new documents to
Hi Jack Krupansky
Thanks for the reply. That will work fine. But i am trying to use the
stored values instead of hitting database for reindex. Isn't it better way
to reindex? Any inputs?
--
Kumaran R
On Thu, Dec 17, 2015 at 11:50 PM, Jack Krupansky
wrote:
> Delete the full i
need to reindex all of your data.
>
> -- Jack Krupansky
>
> On Thu, Dec 17, 2015 at 6:10 AM, Kumaran Ramasubramanian > wrote:
>
>> Dear All
>>
>> i am using lucene 4.10.4. Is there any more information i missed to
>> provide? Please let me know.
>>
>&
Dear All
i am using lucene 4.10.4. Is there any more information i missed to
provide? Please let me know.
--
Kumaran R**
On Wed, Dec 16, 2015 at 10:35 PM, Kumaran Ramasubramanian <
kums@gmail.com> wrote:
>
> Hi All,
>
> Previous Post -
> http://www.gossamer-thr
Hi All,
Previous Post -
http://www.gossamer-threads.com/lists/lucene/java-user/289159
i have indexed one field "STATUS" as both IntField & String field in
same index. Now i want to take IntField containing documents and change the
value of field "STATUS" to string with norms & positions ( t
Hi Uwe
Thanks for the clarification.
--
Kumaran R
On Wed, Nov 25, 2015 at 2:32 PM, Uwe Schindler wrote:
> Hi,
>
> To use a real phrase (more than one term) as part of a filter, you have to
> convert the PhraseQuery to a Filter: new QueryWrapperFilter(phrasequery).
> The
Hi All,
Am using lucene 4.10.4. Is it right to add analyzed multi valued fields
& phrase query for the same field in boolean filter. i believe we could not
apply analyzers to values in filters. So am not getting results for those
filters' match.
String phraseTerm = "hello world"
> Term term =
data. Since you're not sorting you should be
> OK with the caveat that searching for
> "02" won't match an indexed value of "2" unless you
> remove all leading zeros at both index and query
> time.
>
> Best,
> Erick
>
> On Tue, Oct 27, 2015 at
Yes Will, You are right. But i dont use "status" field for sorting. i have
other fields that is being used for sorting specifically. And so i dont
face any issues in sorting as of now.
--
Kumaran R
On Tue, Oct 27, 2015 at 7:20 PM, will wrote:
> Kumaran -
>
> Are
it is supported to have both Field & LongField with same field name in same
index. And also Please suggest me any articles discussing this kind of
problem.
Thanks :-)
Related links:
http://www.gossamer-threads.com/lists/lucene/java-user/109530
--
**
Kumaran R
Hi Sachin
i want to look into ur indexing code. please share it
-
Kumaran R
On Tue, Aug 19, 2014 at 7:18 PM, Sachin Kulkarni
wrote:
> Hi,
>
> Sorry for all the code, It got sent out accidentally.
>
> The following code is part of the Benchmark utility in Lucen
Hi Sachin Kulkarni,
If possible, Please share your code.
-
Kumaran R
On Tue, Aug 19, 2014 at 9:07 AM, Sachin Kulkarni
wrote:
> Hi,
>
> I am using Lucene 4.6.0.
>
> I have been storing 5 fields for my documents in the index, namely body,
> title, docname, docdate
Hi Gaurav
Thanks for the clarification. If possible, please share your NRT
manager API related code example. i believe, it will help me to understand
little better.
-
Kumaran R
On Tue, Aug 5, 2014 at 12:39 PM, Gaurav gupta
wrote:
> Thanks Kumaran and Erik for resolving my quer
on this )
2. Merge will be taken care by default values( merge factor 2) of
lucene. If u need to control more on merge policy, please go through
about merge by size or by number of segments or many merge policies.
Hope this will help you a little bit.
--
Kumaran R
Sent from Phone
> On 04-
You should know two things to get this.
1.Indexed fields can be searched.
2.Stored fields can be fetched.
Check your code whether you are storing all fields.
--
Kumaran R
Sent from Phone
> On 04-Aug-2014, at 7:13 pm, Sachin Kulkarni wrote:
>
> Hi,
>
> I am using lucene 4
limitations and drawbacks in using hash values in lucene index.
-
Kumaran R
Provide some more information like lucene version, sample code,
parameters involved in indexing and searching.
--
Kumaran R
> On 18-Jul-2014, at 6:52 pm, Priyanka Tufchi
> wrote:
>
> Hi All
>
> I am matching and ranking two set of Docs using apache lucene and I passes
> p
Hi David
Thanks for your response. Actually i dont know where to post. Thats why.
i will make sure sending to one list in future. could you please direct me
on my question?
- Kumaran
> You'll be lucky to get help with the spatial module from Lucene 3.x, which
> is what you are
pache.lucene.search.IndexSearcher.searchWithFilter(IndexSearcher.java:258)
> at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:218) at
> org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:199) at
> org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:177) at
> org.apache.lucene.search.Searcher.search(Searcher.java:49) at
> com.zoho.training.RadialSearch.search(RadialSearch.java:246) at
> com.zoho.training.RadialSearch.main(RadialSearch.java:281)
Kumaran R
Am using lucene 3.0.3. i have two filters (BooleanFilter and
DistanceFilter).what is the best way to do logical operations between these
two filters?
BooleanFilter: ( OR,AND filters using FilterClause )
BooleanFilter boolFilter = new BooleanFilter();
Term term = new Term("city", "texas");
stackoverflow.com/questions/22929423/how-lucene-handles-sortfield-with-comparator-if-sortfield-is
Kumaran R
pache.lucene.search.IndexSearcher.searchWithFilter(IndexSearcher.java:258)
> at org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:218) at
> org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:199) at
> org.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:177) at
> org.apache.lucene.search.Searcher.search(Searcher.java:49) at
> com.zoho.training.RadialSearch.search(RadialSearch.java:246) at
> com.zoho.training.RadialSearch.main(RadialSearch.java:281)
Kumaran R
Hi,
I've indexed Book Title,Author Name,Contents and some other fields.
Previously I gave option to search string in any of those fields and I
displayed results from getting fields "Title","Author Name","Contents" from
hits resulted docs.
Now I want to display "Title" & "Author Name" list w
Hi,
Does Lucene search FSDirectory as well as buffered in-memory docs while
we are calling searcher.search(query)?
Why I'm asking this is, I've indexed my doc with mergeFactor &
Max.Buff.Docs = 50 and I've optimized and closed it at mid-night
only.Beforeoptimization, my search gives partial
all your indexes? That would help folks
give you better responses and perhaps suggest other ways of
solving your problem.
Erick
On 3/14/07, senthil kumaran <[EMAIL PROTECTED]> wrote:
>
> Hi.
> I have more index directories (>6) all in GB,and searching my query
> with
>
Hi.
I have more index directories (>6) all in GB,and searching my query with
single IndexSearcher to all indexes one after another.i.e. I create one
IndexSearcher for index1 and search over that.Finally I close that and
create new IndexSearcher for index2 and so on. If i get 200 total results
Hi,
I've indexed 4 among 5 fields with Field.Store.YES & Field.Index.NO. And
indexed the remaining one, say it's Field Name is *content*, with
Field.Store.YES & Field.Index.Tokenized(It's value is collective value of
other 4 fields and some more values).So my search always based on
*content*fie
85 matches
Mail list logo