jflex) plays a
vital role to escape characters to get an expected token while indexing.
Any help is much appreciated...!!!
--
Regards,
Chitra
etInstance(ICUFoldingFilter.class.
> getResourceAsStream("utr30.nrm"), "utr30", Normalizer2.Mode.COMPOSE);
Normalizer2 normalizeFilter = new FilteredNormalizer2(base, unicodeSet);
> TokenStream tok = new ICUNormalizer2Filter(tok, normalizeFilter);
Kindly help me to resolve this.
--
Regards,
Chitra
nsliterator rule formed right? Kindly look at the below code
TokenStream tok = new ICUTransformFilter(tok,
Transliterator.getInstance("Greek;
> Lower; NFD; [:Nonspacing Mark:] Remove; NFC;"));
Kindly help me to resolve this.
Regards,
Chitra
Hi,
I found the difference and understand the behavior of both
tokenizers appropriately.
Could you please suggest me which one is the better to use
ClassicTokenizer/StandardTokenizer?
--
Regards,
Chitra
20, 2017 at 12:10 AM, Robert Muir <rcm...@gmail.com> wrote:
> easy, don't use classictokenizer: use standardtokenizer instead.
>
> On Thu, Oct 19, 2017 at 9:37 AM, Chitra <chithu.r...@gmail.com> wrote:
> > Hi,
> > I indexed a term 'ⒶeŘꝋꝒɫⱯŋɇ' (aeropl
d void setReader(final Reader reader) throws
> IOException
> {
>
> src.setMaxTokenLength(ClassicAnalyzer.DEFAULT_MAX_TOKEN_LENGTH);
> super.setReader(reader);
> }
> };
> }
Am I missing anything? Is that expected behavior for my input or any reason
behind such abnormal behavior?
--
Regards,
Chitra
Hi all,
Any help would be greatly appreciated.
--
Regards,
Chitra
... It has internalization support for all
languages. Here, I am not sure whether it will break my existing language
behavior in the index.
Is there any way to make ICUFoldingFilter as language specific?
Kindly post your suggestions.
--
Regards,
Chitra
Kindly post your suggestions.
--
Regards,
Chitra
ated.
--
Regards,
Chitra
Hi Ahmet,
Thank you so much for the reply.
I have tried but it seems, ASCIIFoldingFilter is not supporting greek
accent characters and it supports only Latin like accent characters. Am I
missing anything?
Chitra
On Wed, Sep 27, 2017 at 5:47 PM, Ahmet Arslan <i
<http://www.unicode.org/charts/PDF/U0370.pdf> for providing accent
insensitive search...
Kindly suggest the better solution to achieve this...? Does
ICUFoldingFilter solve my use-case?
--
Regards,
Chitra
ery for multi-select faceting?? Both the classes are
inter-connected.
Kindly provide your valuable suggestions...
Thanks,
Chitra
each dim using the implementation that dim used (taxonomy, sorted
> set, or range). The TestRangeFacetCounts shows another example of
> this.
>
> Mike McCandless
>
> http://blog.mikemccandless.com
>
>
> On Sat, Feb 18, 2017 at 6:33 AM, Chitra R <chithu.r...@gmail.com> wrote:
>
y. So only it was throwing
an exception when I search through drillsideways search. Am I right or
missed anything?
Kindly help me to solve my issue.
Regards,
Chitra
On Sat, Feb 18, 2017 at 4:29 PM, Michael McCandless <
luc...@mikemccandless.com> wrote:
> Hi,
>
> I think you are c
che.
>
> In case you want to warm the filesystem cache explicitly, which could be a
> good idea if you have plenty of filesystem cache for your index (ie. the
> unused memory of the system is larger than the index), you can look into
> using MMapDirectory.setPreload.
>
> Le ven.
entException: dimension "price" was not indexed
at org.apache.lucene.facet.sortedset.SortedSetDocValuesFacetCounts.
> getTopChildren(SortedSetDocValuesFacetCounts.java:91)
at org.apache.lucene.facet.MultiFacets.getAllDims(MultiFacets.java:74)
Am I did anything wrong???
Kindly post your suggestions.
T
Any suggestions Kindly help me to move forward.
Regards,
Chitra
On Wed, Feb 15, 2017 at 9:23 PM, Chitra R <chithu.r...@gmail.com> wrote:
> Hi,
> Thanks for the suggestion. But in the case of drill sideways
> search, retrieving allDimensions (using Facets.
is,
*At initial search, Whether the required page (os pages of Lucene index
files) for that field (a single field) will be loaded to filesystem cache
or all the fields info will be loaded to filesystem cache from disk?*
Regards,
Chitra
On Fri, Feb 17, 2017 at 7:05 PM, Adrien Grand <jpou...@gmail.com>
: *
- During the first search, I ran the Query Say (new TermQuery(new
Term("name","Chitra"))) and which yields 1 lakh documents as result. Time
taken for first search = 50 - 60 ms nearly.
- And for the second search, I ran the Query Say (new TermQuery(new
Term("ani
ll be used.
So how can I add path-traversed numericRanges?
Am I missed anything?
Kindly post your suggestions.
Regards,
Chitra
On Wed, Feb 15, 2017 at 3:28 PM, Michael McCandless <
luc...@mikemccandless.com> wrote:
> Hi, have a look at the RangeFacetsExample.java under the lucene/demo
&g
is the better way to add pathtraversed ranges
( ie adding pathtraversed ranges in numericRangeQuery or
adding pathtraversed ranges in filter).??Or Any other solution.???
Thanks & Regards,
Chitra
Sent from my iPhone
-
To unsubsc
fic
> field which map per-segment ordinals to/from global ordinal space.
>
Thanks,
Chitra
Hi,
How can I improve the performance of SortedSetDocValuesReaderState
which is responsible for computing '$facets' global ordinals at search time?
Is there any better way to warm up this state instead of putting it in
cache?
Any help is much appreciated.
Thanks & Regards,
Chitra
On
because of old indexSearcher in DocValuesReaderState and
newIndexSearcher in topDocs computation)?
2. How can we maintain SortedSetDocValuesReaderState in cache?
Kindly post your suggestions.
Thanks & Regards,
Chitra
Thank you so much, Shai...
Chitra
On Wed, Nov 30, 2016 at 2:17 PM, Shai Erera <ser...@gmail.com> wrote:
> This feature is not available in Lucene currently, but it shouldn't be hard
> to add it. See Mike's comment here:
> http://blog.mikemccandless.com/2013/05/d
would like to implement it in numeric range faceting by splitting the
numeric values (getting from resulted documents) into ranges.
Chitra
On Wed, Nov 30, 2016 at 3:51 AM, Michael McCandless <
luc...@mikemccandless.com> wrote:
> Doc values fields are never loaded into memory; at most s
(say a single
field which is used for sorting) load in memory?
Any help is much appreciated.
Regards,
Chitra
Hi,
When opening SortedSetDocValuesReaderState at search time, whether
the whole doc value files (.dvd & .dvm) information are loaded in memory or
specified field information(say $facets field) alone load in memory?
Any help is much appreciated.
Regards,
Chitra
On Tue, Nov 22,
.
Regards,
Chitra
Kindly post your suggestions.
Regards,
Chitra
On Sat, Nov 19, 2016 at 1:38 PM, Chitra R <chithu.r...@gmail.com> wrote:
> Hey, I got it clearly. Thank you so much. Could you please help us to
> implement it in our use case?
>
>
> In our case,
. And hope this
won't put overload at index time, also at search time this will be better.
Kindly post your suggestions.
Regards,
Chitra
On Fri, Nov 18, 2016 at 7:15 PM, Michael McCandless <
luc...@mikemccandless.com> wrote:
> I think you've summed up exactly the differences!
>
suggestions...
Regards,
Chitra
On Thu, Nov 17, 2016 at 6:40 PM, Chitra R <chithu.r...@gmail.com> wrote:
> Okay. I agree with you, Taxonomy maintains and supports hierarchical
> facets during indexing. Hope hierarchical in the sense, we might index the
> field
> Publish date : 201
missed anything...
So What is the need of hierarchical facets? Could you please explain
it(hierarchical facets) in the real-world use case?
Regards,
Chitra
On Wed, Nov 16, 2016 at 7:36 PM, Michael McCandless <
luc...@mikemccandless.com> wrote:
> You store dimension + string (a single v
ed search UI because we
don't "lose" the facet counts after drilling in) behind storing path and
dimension for the given SSDVF field? Else anything?
Regards,
Chitra
Hey, thank you so much for the fast response, I agree NRT refresh is
somewhat costly operations and this is the major pitfa
to store path and dimension in
SSDVF field?
Kindly post your suggestions.
Regards,
Chitra
On Sat, Nov 12, 2016 at 4:03 AM, Michael McCandless <
luc...@mikemccandless.com> wrote:
> On Fri, Nov 11, 2016 at 5:21 AM, Chitra R <chithu.r...@gmail.com> wrote:
>
> >
this method concurrently?
Kindly post your suggestions.
Thanks,
Chitra
On Thu, Nov 10, 2016 at 4:34 PM, Shai Erera <ser...@gmail.com> wrote:
> Hi
>
> The reason IMO is historic - ES and Solr had faceting solutions before
> Lucene had it. There were discussions in the past abou
Hi,
Kindly post your suggestions..
Chitra
On Mon, Sep 26, 2016 at 3:48 PM, Chitra R <chithu.r...@gmail.com> wrote:
> Hi,
>
> Issues(LUCENE-4795): Add FacetsCollector based on SortedSetDocValues
>
> https://issues.apache.org/jira/browse/LUCENE-4795
>
>
&
.
Thanks in advance.
Regards,
Chitra
ur search speed will suffer _greatly_ as you get
> more and more documents in your corpus if you use only
> DocValues but don't have indexed="true" set.
>
>
> Best,
> Erick
>
> On Tue, May 31, 2016 at 10:51 PM, Chitra R <chithu.r...@gmail.com> wrote:
> &g
ocvalue fields? is range query
not possible?? or anyother?
Thanks,
Chitra
Hi,
Could you please explain how to store the SortedNumericDocValuesField
to include in the code?
Thanks,
Chitra
On Tue, May 31, 2016 at 3:02 PM, Chitra R <chithu.r...@gmail.com> wrote:
> Thanks.
>
> On Fri, May 27, 2016 at 6:02 PM, Adrien Grand <jpou..
ent lower level functionality like building
> a histogram of the prices of all matching documents, you would need to
> build a custom collector and use these sorted numeric doc values to get the
> prices.
>
> Le ven. 27 mai 2016 à 14:10, Chitra R <chithu.r...@gmail.com> a écrit :
>
&
arch(query, 10,sort);
>
> for (ScoreDoc scoreDoc : topdocs.scoreDocs) {
> doc = reader.document(scoreDoc.doc);
> System.out.println(scoreDoc);
> }
>
So what is the purpose of sortedDocValues obtained
from DocValues.getSortedNumeric(atomicReader, "Numericdoc_pri
Hi,
Actually I like to print the sorted numeric list from the
sortedDocValues and I dont know which api have to use . Could you please
help me to achieve this?
Thanks,
Chitra
On Thu, May 26, 2016 at 8:17 PM, Michael McCandless <
luc...@mikemccandless.com> wrote:
> This looks ab
doc_price", 15L));
..
for (AtomicReaderContext context : indexReader.leaves()) {
AtomicReader atomicReader = context.reader();
SortedNumericDocValues
sortedDocValues=DocValues.getSortedNumeric(atomicReader,
"Numericdoc_price");
}
Thanks,
Chitra
46 matches
Mail list logo