[jira] [Commented] (SOLR-4824) Fuzzy / Faceting results are changed after ingestion of documents past a certain number

2013-10-11 Thread Lakshmi Venkataswamy (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-4824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13792742#comment-13792742
 ] 

Lakshmi Venkataswamy commented on SOLR-4824:


I have tested 4.5.0 version and the same behavior has been observed.  So we are 
staying with 3.6 in production for now.

 Fuzzy / Faceting results are changed after ingestion of documents past a 
 certain number 
 

 Key: SOLR-4824
 URL: https://issues.apache.org/jira/browse/SOLR-4824
 Project: Solr
  Issue Type: Bug
Affects Versions: 4.2, 4.3
 Environment: Ubuntu 12.04 LTS 12.04.2 
 jre1.7.0_17
 jboss-as-7.1.1.Final
Reporter: Lakshmi Venkataswamy

 In upgrading from SOLR 3.6 to 4.2/4.3 and comparing results on fuzzy queries, 
 I found that after a certain number of documents were ingested the fuzzy 
 query had drastically lower number of results.  We have approximately 18,000 
 documents per day and after ingesting approximately 40 days of documents, the 
 next incremental day of documents results in a lower number of results of a 
 fuzzy search.
 The query :  
 http://10.100.1.xx:8080/solr/corex/select?q=cc:worde~1facet=onfacet.field=datefl=datefacet.sort
 produces the following result before the threshold is crossed
 responselst name=responseHeader
 int name=status0/intint name=QTime2349/intlst name=paramsstr 
 name=faceton/strstr name=fldate/strstr name=facet.sort/
 str name=qcc:worde~1/strstr 
 name=facet.fielddate/str/lst/lstresult name=response 
 numFound=362803 start=0/result
 lst name=facet_countslst name=facet_queries/lst 
 name=facet_fieldslst name=date
 int name=2012-12-312866/int
 int name=2013-01-0111372/int
 int name=2013-01-0211514/int
 int name=2013-01-0312015/int
 int name=2013-01-0411746/int
 int name=2013-01-0510853/int
 int name=2013-01-0611053/int
 int name=2013-01-0711815/int
 int name=2013-01-0811427/int
 int name=2013-01-0911475/int
 int name=2013-01-1011461/int
 int name=2013-01-1112058/int
 int name=2013-01-1211335/int
 int name=2013-01-1312039/int
 int name=2013-01-1412064/int
 int name=2013-01-1512234/int
 int name=2013-01-1612545/int
 int name=2013-01-1711766/int
 int name=2013-01-1812197/int
 int name=2013-01-1911414/int
 int name=2013-01-2011633/int
 int name=2013-01-2112863/int
 int name=2013-01-2212378/int
 int name=2013-01-2311947/int
 int name=2013-01-2411822/int
 int name=2013-01-2511882/int
 int name=2013-01-2610474/int
 int name=2013-01-2711051/int
 int name=2013-01-2811776/int
 int name=2013-01-2911957/int
 int name=2013-01-3011260/int
 int name=2013-01-318511/int
 /lst/lstlst name=facet_dates/lst 
 name=facet_ranges//lst/response
 Once the 40 days of documents ingested threshold is crossed the results drop 
 as show below for the same query
 responselst name=responseHeader
 int name=status0/intint name=QTime2/intlst name=paramsstr 
 name=faceton/strstr name=fldate/strstr name=facet.sort/str 
 name=qcc:worde~1/strstr name=facet.fielddate/str/lst/lst
 result name=response numFound=1338 start=0/result
 lst name=facet_countslst name=facet_queries/lst 
 name=facet_fieldslst name=date
 int name=2012-12-310/int
 int name=2013-01-0141/int
 int name=2013-01-0221/int
 int name=2013-01-0324/int
 int name=2013-01-0419/int
 int name=2013-01-059/int
 int name=2013-01-0611/int
 int name=2013-01-0717/int
 int name=2013-01-0814/int
 int name=2013-01-0924/int
 int name=2013-01-1043/int
 int name=2013-01-1114/int
 int name=2013-01-1252/int
 int name=2013-01-1357/int
 int name=2013-01-1425/int
 int name=2013-01-1517/int
 int name=2013-01-1634/int
 int name=2013-01-1711/int
 int name=2013-01-1816/int
 int name=2013-01-19121/int
 int name=2013-01-2033/int
 int name=2013-01-2126/int
 int name=2013-01-2259/int
 int name=2013-01-2327/int
 int name=2013-01-2410/int
 int name=2013-01-259/int
 int name=2013-01-266/int
 int name=2013-01-2716/int
 int name=2013-01-2811/int
 int name=2013-01-2915/int
 int name=2013-01-3021/int
 int name=2013-01-31109/int
 int name=2013-02-0111/int
 int name=2013-02-027/int
 int name=2013-02-0310/int
 int name=2013-02-048/int
 int name=2013-02-0513/int
 int name=2013-02-0675/int
 int name=2013-02-0777/int
 int name=2013-02-0831/int
 int name=2013-02-0935/int
 int name=2013-02-1022/int
 int name=2013-02-1118/int
 int name=2013-02-1211/int
 int name=2013-02-1368/int
 int name=2013-02-1440/int
 /lst/lstlst name=facet_dates/lst 
 name=facet_ranges//lst/response
 I have also tested this with different months of data and have seen the same 
 issue  around the number of documents.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Commented] (SOLR-4824) Fuzzy / Faceting results are changed after ingestion of documents past a certain number

2013-05-22 Thread Lakshmi Venkataswamy (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-4824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13664179#comment-13664179
 ] 

Lakshmi Venkataswamy commented on SOLR-4824:


That makes sense.  So as a test I tried to restrict the search query to only 30 
days of data after I had ingested the additional 11 days.  This should have 
returned the same number 362,803 as before but it did not.  I got 1263 results. 

I also noticed something else.  We have a production system that is using Solr 
3.5.  I also have test systems on Solr 3.6.2 and Solr 4.3 using a smaller 
subset of production data.  The physical size of the index is very different in 
4.3 for the same data , number of fields, configuration etc.

Solr 3.5 Averages 150 Kb / document 
Solr 3.6.2   Averages 148 KB / document
Solr 4.3 Averages 75 KB / document


 Fuzzy / Faceting results are changed after ingestion of documents past a 
 certain number 
 

 Key: SOLR-4824
 URL: https://issues.apache.org/jira/browse/SOLR-4824
 Project: Solr
  Issue Type: Bug
Affects Versions: 4.2, 4.3
 Environment: Ubuntu 12.04 LTS 12.04.2 
 jre1.7.0_17
 jboss-as-7.1.1.Final
Reporter: Lakshmi Venkataswamy

 In upgrading from SOLR 3.6 to 4.2/4.3 and comparing results on fuzzy queries, 
 I found that after a certain number of documents were ingested the fuzzy 
 query had drastically lower number of results.  We have approximately 18,000 
 documents per day and after ingesting approximately 40 days of documents, the 
 next incremental day of documents results in a lower number of results of a 
 fuzzy search.
 The query :  
 http://10.100.1.xx:8080/solr/corex/select?q=cc:worde~1facet=onfacet.field=datefl=datefacet.sort
 produces the following result before the threshold is crossed
 responselst name=responseHeader
 int name=status0/intint name=QTime2349/intlst name=paramsstr 
 name=faceton/strstr name=fldate/strstr name=facet.sort/
 str name=qcc:worde~1/strstr 
 name=facet.fielddate/str/lst/lstresult name=response 
 numFound=362803 start=0/result
 lst name=facet_countslst name=facet_queries/lst 
 name=facet_fieldslst name=date
 int name=2012-12-312866/int
 int name=2013-01-0111372/int
 int name=2013-01-0211514/int
 int name=2013-01-0312015/int
 int name=2013-01-0411746/int
 int name=2013-01-0510853/int
 int name=2013-01-0611053/int
 int name=2013-01-0711815/int
 int name=2013-01-0811427/int
 int name=2013-01-0911475/int
 int name=2013-01-1011461/int
 int name=2013-01-1112058/int
 int name=2013-01-1211335/int
 int name=2013-01-1312039/int
 int name=2013-01-1412064/int
 int name=2013-01-1512234/int
 int name=2013-01-1612545/int
 int name=2013-01-1711766/int
 int name=2013-01-1812197/int
 int name=2013-01-1911414/int
 int name=2013-01-2011633/int
 int name=2013-01-2112863/int
 int name=2013-01-2212378/int
 int name=2013-01-2311947/int
 int name=2013-01-2411822/int
 int name=2013-01-2511882/int
 int name=2013-01-2610474/int
 int name=2013-01-2711051/int
 int name=2013-01-2811776/int
 int name=2013-01-2911957/int
 int name=2013-01-3011260/int
 int name=2013-01-318511/int
 /lst/lstlst name=facet_dates/lst 
 name=facet_ranges//lst/response
 Once the 40 days of documents ingested threshold is crossed the results drop 
 as show below for the same query
 responselst name=responseHeader
 int name=status0/intint name=QTime2/intlst name=paramsstr 
 name=faceton/strstr name=fldate/strstr name=facet.sort/str 
 name=qcc:worde~1/strstr name=facet.fielddate/str/lst/lst
 result name=response numFound=1338 start=0/result
 lst name=facet_countslst name=facet_queries/lst 
 name=facet_fieldslst name=date
 int name=2012-12-310/int
 int name=2013-01-0141/int
 int name=2013-01-0221/int
 int name=2013-01-0324/int
 int name=2013-01-0419/int
 int name=2013-01-059/int
 int name=2013-01-0611/int
 int name=2013-01-0717/int
 int name=2013-01-0814/int
 int name=2013-01-0924/int
 int name=2013-01-1043/int
 int name=2013-01-1114/int
 int name=2013-01-1252/int
 int name=2013-01-1357/int
 int name=2013-01-1425/int
 int name=2013-01-1517/int
 int name=2013-01-1634/int
 int name=2013-01-1711/int
 int name=2013-01-1816/int
 int name=2013-01-19121/int
 int name=2013-01-2033/int
 int name=2013-01-2126/int
 int name=2013-01-2259/int
 int name=2013-01-2327/int
 int name=2013-01-2410/int
 int name=2013-01-259/int
 int name=2013-01-266/int
 int name=2013-01-2716/int
 int name=2013-01-2811/int
 int name=2013-01-2915/int
 int name=2013-01-3021/int
 int name=2013-01-31109/int
 int name=2013-02-0111/int
 int name=2013-02-027/int
 int name=2013-02-0310/int
 int name=2013-02-048/int
 int name=2013-02-0513/int
 int name=2013-02-0675/int
 int name=2013-02-0777/int
 int name=2013-02-0831/int
 int name=2013-02-0935/int
 int name=2013-02-1022/int
 int 

[jira] [Created] (SOLR-4824) Faceting results are changed after ingestion of documents past a certain number

2013-05-15 Thread Lakshmi Venkataswamy (JIRA)
Lakshmi Venkataswamy created SOLR-4824:
--

 Summary: Faceting results are changed after ingestion of documents 
past a certain number 
 Key: SOLR-4824
 URL: https://issues.apache.org/jira/browse/SOLR-4824
 Project: Solr
  Issue Type: Bug
Affects Versions: 4.3, 4.2
 Environment: Ubuntu 12.04 LTS 12.04.2 
jre1.7.0_17
jboss-as-7.1.1.Final
Reporter: Lakshmi Venkataswamy


In upgrading from SOLR 3.6 to 4.2/4.3 I and comparing results on fuzzy queries, 
I found that after a certain number of documents were ingested the fuzzy query 
has drastically lower number of results.  We have approximately 18,000 
documents per day and after ingesting approximately 40 days of documents, the 
next incremental day of documents results in a lower number of results of a 
fuzzy search.

The query :  
http://10.100.1.48:8080/solr/coreTV3/select?q=cc:worde~1facet=onfacet.field=datefl=datefacet.sort

produces the following result before the threshold is crossed

responselst name=responseHeader
int name=status0/intint name=QTime2349/intlst name=paramsstr 
name=faceton/strstr name=fldate/strstr name=facet.sort/
str name=qcc:worde~1/strstr 
name=facet.fielddate/str/lst/lstresult name=response 
numFound=362803 start=0/result
lst name=facet_countslst name=facet_queries/lst 
name=facet_fieldslst name=date
int name=2012-12-312866/int
int name=2013-01-0111372/int
int name=2013-01-0211514/int
int name=2013-01-0312015/int
int name=2013-01-0411746/int
int name=2013-01-0510853/int
int name=2013-01-0611053/int
int name=2013-01-0711815/int
int name=2013-01-0811427/int
int name=2013-01-0911475/int
int name=2013-01-1011461/int
int name=2013-01-1112058/int
int name=2013-01-1211335/int
int name=2013-01-1312039/int
int name=2013-01-1412064/int
int name=2013-01-1512234/int
int name=2013-01-1612545/int
int name=2013-01-1711766/int
int name=2013-01-1812197/int
int name=2013-01-1911414/int
int name=2013-01-2011633/int
int name=2013-01-2112863/int
int name=2013-01-2212378/int
int name=2013-01-2311947/int
int name=2013-01-2411822/int
int name=2013-01-2511882/int
int name=2013-01-2610474/int
int name=2013-01-2711051/int
int name=2013-01-2811776/int
int name=2013-01-2911957/int
int name=2013-01-3011260/int
int name=2013-01-318511/int
/lst/lstlst name=facet_dates/lst name=facet_ranges//lst/response

Once the 40 days of documents ingested threshold is crossed the results drop as 
show below for the same query

responselst name=responseHeader
int name=status0/intint name=QTime2/intlst name=paramsstr 
name=faceton/strstr name=fldate/strstr name=facet.sort/str 
name=qcc:worde~1/strstr name=facet.fielddate/str/lst/lst
result name=response numFound=1338 start=0/result
lst name=facet_countslst name=facet_queries/lst 
name=facet_fieldslst name=date
int name=2012-12-310/int
int name=2013-01-0141/int
int name=2013-01-0221/int
int name=2013-01-0324/int
int name=2013-01-0419/int
int name=2013-01-059/int
int name=2013-01-0611/int
int name=2013-01-0717/int
int name=2013-01-0814/int
int name=2013-01-0924/int
int name=2013-01-1043/int
int name=2013-01-1114/int
int name=2013-01-1252/int
int name=2013-01-1357/int
int name=2013-01-1425/int
int name=2013-01-1517/int
int name=2013-01-1634/int
int name=2013-01-1711/int
int name=2013-01-1816/int
int name=2013-01-19121/int
int name=2013-01-2033/int
int name=2013-01-2126/int
int name=2013-01-2259/int
int name=2013-01-2327/int
int name=2013-01-2410/int
int name=2013-01-259/int
int name=2013-01-266/int
int name=2013-01-2716/int
int name=2013-01-2811/int
int name=2013-01-2915/int
int name=2013-01-3021/int
int name=2013-01-31109/int
int name=2013-02-0111/int
int name=2013-02-027/int
int name=2013-02-0310/int
int name=2013-02-048/int
int name=2013-02-0513/int
int name=2013-02-0675/int
int name=2013-02-0777/int
int name=2013-02-0831/int
int name=2013-02-0935/int
int name=2013-02-1022/int
int name=2013-02-1118/int
int name=2013-02-1211/int
int name=2013-02-1368/int
int name=2013-02-1440/int
/lst/lstlst name=facet_dates/lst name=facet_ranges//lst/response

I have also tested this with different months of data and have seen the same 
issue  around the number of documents.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

-
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org



[jira] [Updated] (SOLR-4824) Fuzzy / Faceting results are changed after ingestion of documents past a certain number

2013-05-15 Thread Lakshmi Venkataswamy (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-4824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lakshmi Venkataswamy updated SOLR-4824:
---

Summary: Fuzzy / Faceting results are changed after ingestion of documents 
past a certain number   (was: Faceting results are changed after ingestion of 
documents past a certain number )

 Fuzzy / Faceting results are changed after ingestion of documents past a 
 certain number 
 

 Key: SOLR-4824
 URL: https://issues.apache.org/jira/browse/SOLR-4824
 Project: Solr
  Issue Type: Bug
Affects Versions: 4.2, 4.3
 Environment: Ubuntu 12.04 LTS 12.04.2 
 jre1.7.0_17
 jboss-as-7.1.1.Final
Reporter: Lakshmi Venkataswamy

 In upgrading from SOLR 3.6 to 4.2/4.3 I and comparing results on fuzzy 
 queries, I found that after a certain number of documents were ingested the 
 fuzzy query has drastically lower number of results.  We have approximately 
 18,000 documents per day and after ingesting approximately 40 days of 
 documents, the next incremental day of documents results in a lower number of 
 results of a fuzzy search.
 The query :  
 http://10.100.1.48:8080/solr/coreTV3/select?q=cc:worde~1facet=onfacet.field=datefl=datefacet.sort
 produces the following result before the threshold is crossed
 responselst name=responseHeader
 int name=status0/intint name=QTime2349/intlst name=paramsstr 
 name=faceton/strstr name=fldate/strstr name=facet.sort/
 str name=qcc:worde~1/strstr 
 name=facet.fielddate/str/lst/lstresult name=response 
 numFound=362803 start=0/result
 lst name=facet_countslst name=facet_queries/lst 
 name=facet_fieldslst name=date
 int name=2012-12-312866/int
 int name=2013-01-0111372/int
 int name=2013-01-0211514/int
 int name=2013-01-0312015/int
 int name=2013-01-0411746/int
 int name=2013-01-0510853/int
 int name=2013-01-0611053/int
 int name=2013-01-0711815/int
 int name=2013-01-0811427/int
 int name=2013-01-0911475/int
 int name=2013-01-1011461/int
 int name=2013-01-1112058/int
 int name=2013-01-1211335/int
 int name=2013-01-1312039/int
 int name=2013-01-1412064/int
 int name=2013-01-1512234/int
 int name=2013-01-1612545/int
 int name=2013-01-1711766/int
 int name=2013-01-1812197/int
 int name=2013-01-1911414/int
 int name=2013-01-2011633/int
 int name=2013-01-2112863/int
 int name=2013-01-2212378/int
 int name=2013-01-2311947/int
 int name=2013-01-2411822/int
 int name=2013-01-2511882/int
 int name=2013-01-2610474/int
 int name=2013-01-2711051/int
 int name=2013-01-2811776/int
 int name=2013-01-2911957/int
 int name=2013-01-3011260/int
 int name=2013-01-318511/int
 /lst/lstlst name=facet_dates/lst 
 name=facet_ranges//lst/response
 Once the 40 days of documents ingested threshold is crossed the results drop 
 as show below for the same query
 responselst name=responseHeader
 int name=status0/intint name=QTime2/intlst name=paramsstr 
 name=faceton/strstr name=fldate/strstr name=facet.sort/str 
 name=qcc:worde~1/strstr name=facet.fielddate/str/lst/lst
 result name=response numFound=1338 start=0/result
 lst name=facet_countslst name=facet_queries/lst 
 name=facet_fieldslst name=date
 int name=2012-12-310/int
 int name=2013-01-0141/int
 int name=2013-01-0221/int
 int name=2013-01-0324/int
 int name=2013-01-0419/int
 int name=2013-01-059/int
 int name=2013-01-0611/int
 int name=2013-01-0717/int
 int name=2013-01-0814/int
 int name=2013-01-0924/int
 int name=2013-01-1043/int
 int name=2013-01-1114/int
 int name=2013-01-1252/int
 int name=2013-01-1357/int
 int name=2013-01-1425/int
 int name=2013-01-1517/int
 int name=2013-01-1634/int
 int name=2013-01-1711/int
 int name=2013-01-1816/int
 int name=2013-01-19121/int
 int name=2013-01-2033/int
 int name=2013-01-2126/int
 int name=2013-01-2259/int
 int name=2013-01-2327/int
 int name=2013-01-2410/int
 int name=2013-01-259/int
 int name=2013-01-266/int
 int name=2013-01-2716/int
 int name=2013-01-2811/int
 int name=2013-01-2915/int
 int name=2013-01-3021/int
 int name=2013-01-31109/int
 int name=2013-02-0111/int
 int name=2013-02-027/int
 int name=2013-02-0310/int
 int name=2013-02-048/int
 int name=2013-02-0513/int
 int name=2013-02-0675/int
 int name=2013-02-0777/int
 int name=2013-02-0831/int
 int name=2013-02-0935/int
 int name=2013-02-1022/int
 int name=2013-02-1118/int
 int name=2013-02-1211/int
 int name=2013-02-1368/int
 int name=2013-02-1440/int
 /lst/lstlst name=facet_dates/lst 
 name=facet_ranges//lst/response
 I have also tested this with different months of data and have seen the same 
 issue  around the number of documents.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (SOLR-4824) Fuzzy / Faceting results are changed after ingestion of documents past a certain number

2013-05-15 Thread Lakshmi Venkataswamy (JIRA)

 [ 
https://issues.apache.org/jira/browse/SOLR-4824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lakshmi Venkataswamy updated SOLR-4824:
---

Description: 
In upgrading from SOLR 3.6 to 4.2/4.3 and comparing results on fuzzy queries, I 
found that after a certain number of documents were ingested the fuzzy query 
had drastically lower number of results.  We have approximately 18,000 
documents per day and after ingesting approximately 40 days of documents, the 
next incremental day of documents results in a lower number of results of a 
fuzzy search.

The query :  
http://10.100.1.xx:8080/solr/corex/select?q=cc:worde~1facet=onfacet.field=datefl=datefacet.sort

produces the following result before the threshold is crossed

responselst name=responseHeader
int name=status0/intint name=QTime2349/intlst name=paramsstr 
name=faceton/strstr name=fldate/strstr name=facet.sort/
str name=qcc:worde~1/strstr 
name=facet.fielddate/str/lst/lstresult name=response 
numFound=362803 start=0/result
lst name=facet_countslst name=facet_queries/lst 
name=facet_fieldslst name=date
int name=2012-12-312866/int
int name=2013-01-0111372/int
int name=2013-01-0211514/int
int name=2013-01-0312015/int
int name=2013-01-0411746/int
int name=2013-01-0510853/int
int name=2013-01-0611053/int
int name=2013-01-0711815/int
int name=2013-01-0811427/int
int name=2013-01-0911475/int
int name=2013-01-1011461/int
int name=2013-01-1112058/int
int name=2013-01-1211335/int
int name=2013-01-1312039/int
int name=2013-01-1412064/int
int name=2013-01-1512234/int
int name=2013-01-1612545/int
int name=2013-01-1711766/int
int name=2013-01-1812197/int
int name=2013-01-1911414/int
int name=2013-01-2011633/int
int name=2013-01-2112863/int
int name=2013-01-2212378/int
int name=2013-01-2311947/int
int name=2013-01-2411822/int
int name=2013-01-2511882/int
int name=2013-01-2610474/int
int name=2013-01-2711051/int
int name=2013-01-2811776/int
int name=2013-01-2911957/int
int name=2013-01-3011260/int
int name=2013-01-318511/int
/lst/lstlst name=facet_dates/lst name=facet_ranges//lst/response

Once the 40 days of documents ingested threshold is crossed the results drop as 
show below for the same query

responselst name=responseHeader
int name=status0/intint name=QTime2/intlst name=paramsstr 
name=faceton/strstr name=fldate/strstr name=facet.sort/str 
name=qcc:worde~1/strstr name=facet.fielddate/str/lst/lst
result name=response numFound=1338 start=0/result
lst name=facet_countslst name=facet_queries/lst 
name=facet_fieldslst name=date
int name=2012-12-310/int
int name=2013-01-0141/int
int name=2013-01-0221/int
int name=2013-01-0324/int
int name=2013-01-0419/int
int name=2013-01-059/int
int name=2013-01-0611/int
int name=2013-01-0717/int
int name=2013-01-0814/int
int name=2013-01-0924/int
int name=2013-01-1043/int
int name=2013-01-1114/int
int name=2013-01-1252/int
int name=2013-01-1357/int
int name=2013-01-1425/int
int name=2013-01-1517/int
int name=2013-01-1634/int
int name=2013-01-1711/int
int name=2013-01-1816/int
int name=2013-01-19121/int
int name=2013-01-2033/int
int name=2013-01-2126/int
int name=2013-01-2259/int
int name=2013-01-2327/int
int name=2013-01-2410/int
int name=2013-01-259/int
int name=2013-01-266/int
int name=2013-01-2716/int
int name=2013-01-2811/int
int name=2013-01-2915/int
int name=2013-01-3021/int
int name=2013-01-31109/int
int name=2013-02-0111/int
int name=2013-02-027/int
int name=2013-02-0310/int
int name=2013-02-048/int
int name=2013-02-0513/int
int name=2013-02-0675/int
int name=2013-02-0777/int
int name=2013-02-0831/int
int name=2013-02-0935/int
int name=2013-02-1022/int
int name=2013-02-1118/int
int name=2013-02-1211/int
int name=2013-02-1368/int
int name=2013-02-1440/int
/lst/lstlst name=facet_dates/lst name=facet_ranges//lst/response

I have also tested this with different months of data and have seen the same 
issue  around the number of documents.

  was:
In upgrading from SOLR 3.6 to 4.2/4.3 I and comparing results on fuzzy queries, 
I found that after a certain number of documents were ingested the fuzzy query 
has drastically lower number of results.  We have approximately 18,000 
documents per day and after ingesting approximately 40 days of documents, the 
next incremental day of documents results in a lower number of results of a 
fuzzy search.

The query :  
http://10.100.1.48:8080/solr/coreTV3/select?q=cc:worde~1facet=onfacet.field=datefl=datefacet.sort

produces the following result before the threshold is crossed

responselst name=responseHeader
int name=status0/intint name=QTime2349/intlst name=paramsstr 
name=faceton/strstr name=fldate/strstr name=facet.sort/
str name=qcc:worde~1/strstr 
name=facet.fielddate/str/lst/lstresult name=response 
numFound=362803 start=0/result
lst name=facet_countslst name=facet_queries/lst 
name=facet_fieldslst name=date
int name=2012-12-312866/int
int name=2013-01-0111372/int
int name=2013-01-0211514/int
int 

[jira] [Commented] (SOLR-4824) Fuzzy / Faceting results are changed after ingestion of documents past a certain number

2013-05-15 Thread Lakshmi Venkataswamy (JIRA)

[ 
https://issues.apache.org/jira/browse/SOLR-4824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13658732#comment-13658732
 ] 

Lakshmi Venkataswamy commented on SOLR-4824:


Not sure I understand.  When I have 30 days of data I get 362,803 results.  
When I add another 11 days worth of data the same search returns 1,338 results. 
 Even if there is a maximum limit would I not see a capping of the results as 
opposed to a drastic drop ?  

 Fuzzy / Faceting results are changed after ingestion of documents past a 
 certain number 
 

 Key: SOLR-4824
 URL: https://issues.apache.org/jira/browse/SOLR-4824
 Project: Solr
  Issue Type: Bug
Affects Versions: 4.2, 4.3
 Environment: Ubuntu 12.04 LTS 12.04.2 
 jre1.7.0_17
 jboss-as-7.1.1.Final
Reporter: Lakshmi Venkataswamy

 In upgrading from SOLR 3.6 to 4.2/4.3 and comparing results on fuzzy queries, 
 I found that after a certain number of documents were ingested the fuzzy 
 query had drastically lower number of results.  We have approximately 18,000 
 documents per day and after ingesting approximately 40 days of documents, the 
 next incremental day of documents results in a lower number of results of a 
 fuzzy search.
 The query :  
 http://10.100.1.xx:8080/solr/corex/select?q=cc:worde~1facet=onfacet.field=datefl=datefacet.sort
 produces the following result before the threshold is crossed
 responselst name=responseHeader
 int name=status0/intint name=QTime2349/intlst name=paramsstr 
 name=faceton/strstr name=fldate/strstr name=facet.sort/
 str name=qcc:worde~1/strstr 
 name=facet.fielddate/str/lst/lstresult name=response 
 numFound=362803 start=0/result
 lst name=facet_countslst name=facet_queries/lst 
 name=facet_fieldslst name=date
 int name=2012-12-312866/int
 int name=2013-01-0111372/int
 int name=2013-01-0211514/int
 int name=2013-01-0312015/int
 int name=2013-01-0411746/int
 int name=2013-01-0510853/int
 int name=2013-01-0611053/int
 int name=2013-01-0711815/int
 int name=2013-01-0811427/int
 int name=2013-01-0911475/int
 int name=2013-01-1011461/int
 int name=2013-01-1112058/int
 int name=2013-01-1211335/int
 int name=2013-01-1312039/int
 int name=2013-01-1412064/int
 int name=2013-01-1512234/int
 int name=2013-01-1612545/int
 int name=2013-01-1711766/int
 int name=2013-01-1812197/int
 int name=2013-01-1911414/int
 int name=2013-01-2011633/int
 int name=2013-01-2112863/int
 int name=2013-01-2212378/int
 int name=2013-01-2311947/int
 int name=2013-01-2411822/int
 int name=2013-01-2511882/int
 int name=2013-01-2610474/int
 int name=2013-01-2711051/int
 int name=2013-01-2811776/int
 int name=2013-01-2911957/int
 int name=2013-01-3011260/int
 int name=2013-01-318511/int
 /lst/lstlst name=facet_dates/lst 
 name=facet_ranges//lst/response
 Once the 40 days of documents ingested threshold is crossed the results drop 
 as show below for the same query
 responselst name=responseHeader
 int name=status0/intint name=QTime2/intlst name=paramsstr 
 name=faceton/strstr name=fldate/strstr name=facet.sort/str 
 name=qcc:worde~1/strstr name=facet.fielddate/str/lst/lst
 result name=response numFound=1338 start=0/result
 lst name=facet_countslst name=facet_queries/lst 
 name=facet_fieldslst name=date
 int name=2012-12-310/int
 int name=2013-01-0141/int
 int name=2013-01-0221/int
 int name=2013-01-0324/int
 int name=2013-01-0419/int
 int name=2013-01-059/int
 int name=2013-01-0611/int
 int name=2013-01-0717/int
 int name=2013-01-0814/int
 int name=2013-01-0924/int
 int name=2013-01-1043/int
 int name=2013-01-1114/int
 int name=2013-01-1252/int
 int name=2013-01-1357/int
 int name=2013-01-1425/int
 int name=2013-01-1517/int
 int name=2013-01-1634/int
 int name=2013-01-1711/int
 int name=2013-01-1816/int
 int name=2013-01-19121/int
 int name=2013-01-2033/int
 int name=2013-01-2126/int
 int name=2013-01-2259/int
 int name=2013-01-2327/int
 int name=2013-01-2410/int
 int name=2013-01-259/int
 int name=2013-01-266/int
 int name=2013-01-2716/int
 int name=2013-01-2811/int
 int name=2013-01-2915/int
 int name=2013-01-3021/int
 int name=2013-01-31109/int
 int name=2013-02-0111/int
 int name=2013-02-027/int
 int name=2013-02-0310/int
 int name=2013-02-048/int
 int name=2013-02-0513/int
 int name=2013-02-0675/int
 int name=2013-02-0777/int
 int name=2013-02-0831/int
 int name=2013-02-0935/int
 int name=2013-02-1022/int
 int name=2013-02-1118/int
 int name=2013-02-1211/int
 int name=2013-02-1368/int
 int name=2013-02-1440/int
 /lst/lstlst name=facet_dates/lst 
 name=facet_ranges//lst/response
 I have also tested this with different months of data and have seen the same 
 issue  around the number of documents.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your