[jira] [Commented] (SOLR-4824) Fuzzy / Faceting results are changed after ingestion of documents past a certain number
[ https://issues.apache.org/jira/browse/SOLR-4824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13792742#comment-13792742 ] Lakshmi Venkataswamy commented on SOLR-4824: I have tested 4.5.0 version and the same behavior has been observed. So we are staying with 3.6 in production for now. Fuzzy / Faceting results are changed after ingestion of documents past a certain number Key: SOLR-4824 URL: https://issues.apache.org/jira/browse/SOLR-4824 Project: Solr Issue Type: Bug Affects Versions: 4.2, 4.3 Environment: Ubuntu 12.04 LTS 12.04.2 jre1.7.0_17 jboss-as-7.1.1.Final Reporter: Lakshmi Venkataswamy In upgrading from SOLR 3.6 to 4.2/4.3 and comparing results on fuzzy queries, I found that after a certain number of documents were ingested the fuzzy query had drastically lower number of results. We have approximately 18,000 documents per day and after ingesting approximately 40 days of documents, the next incremental day of documents results in a lower number of results of a fuzzy search. The query : http://10.100.1.xx:8080/solr/corex/select?q=cc:worde~1facet=onfacet.field=datefl=datefacet.sort produces the following result before the threshold is crossed responselst name=responseHeader int name=status0/intint name=QTime2349/intlst name=paramsstr name=faceton/strstr name=fldate/strstr name=facet.sort/ str name=qcc:worde~1/strstr name=facet.fielddate/str/lst/lstresult name=response numFound=362803 start=0/result lst name=facet_countslst name=facet_queries/lst name=facet_fieldslst name=date int name=2012-12-312866/int int name=2013-01-0111372/int int name=2013-01-0211514/int int name=2013-01-0312015/int int name=2013-01-0411746/int int name=2013-01-0510853/int int name=2013-01-0611053/int int name=2013-01-0711815/int int name=2013-01-0811427/int int name=2013-01-0911475/int int name=2013-01-1011461/int int name=2013-01-1112058/int int name=2013-01-1211335/int int name=2013-01-1312039/int int name=2013-01-1412064/int int name=2013-01-1512234/int int name=2013-01-1612545/int int name=2013-01-1711766/int int name=2013-01-1812197/int int name=2013-01-1911414/int int name=2013-01-2011633/int int name=2013-01-2112863/int int name=2013-01-2212378/int int name=2013-01-2311947/int int name=2013-01-2411822/int int name=2013-01-2511882/int int name=2013-01-2610474/int int name=2013-01-2711051/int int name=2013-01-2811776/int int name=2013-01-2911957/int int name=2013-01-3011260/int int name=2013-01-318511/int /lst/lstlst name=facet_dates/lst name=facet_ranges//lst/response Once the 40 days of documents ingested threshold is crossed the results drop as show below for the same query responselst name=responseHeader int name=status0/intint name=QTime2/intlst name=paramsstr name=faceton/strstr name=fldate/strstr name=facet.sort/str name=qcc:worde~1/strstr name=facet.fielddate/str/lst/lst result name=response numFound=1338 start=0/result lst name=facet_countslst name=facet_queries/lst name=facet_fieldslst name=date int name=2012-12-310/int int name=2013-01-0141/int int name=2013-01-0221/int int name=2013-01-0324/int int name=2013-01-0419/int int name=2013-01-059/int int name=2013-01-0611/int int name=2013-01-0717/int int name=2013-01-0814/int int name=2013-01-0924/int int name=2013-01-1043/int int name=2013-01-1114/int int name=2013-01-1252/int int name=2013-01-1357/int int name=2013-01-1425/int int name=2013-01-1517/int int name=2013-01-1634/int int name=2013-01-1711/int int name=2013-01-1816/int int name=2013-01-19121/int int name=2013-01-2033/int int name=2013-01-2126/int int name=2013-01-2259/int int name=2013-01-2327/int int name=2013-01-2410/int int name=2013-01-259/int int name=2013-01-266/int int name=2013-01-2716/int int name=2013-01-2811/int int name=2013-01-2915/int int name=2013-01-3021/int int name=2013-01-31109/int int name=2013-02-0111/int int name=2013-02-027/int int name=2013-02-0310/int int name=2013-02-048/int int name=2013-02-0513/int int name=2013-02-0675/int int name=2013-02-0777/int int name=2013-02-0831/int int name=2013-02-0935/int int name=2013-02-1022/int int name=2013-02-1118/int int name=2013-02-1211/int int name=2013-02-1368/int int name=2013-02-1440/int /lst/lstlst name=facet_dates/lst name=facet_ranges//lst/response I have also tested this with different months of data and have seen the same issue around the number of documents. -- This message was sent by Atlassian JIRA (v6.1#6144) - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Commented] (SOLR-4824) Fuzzy / Faceting results are changed after ingestion of documents past a certain number
[ https://issues.apache.org/jira/browse/SOLR-4824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13664179#comment-13664179 ] Lakshmi Venkataswamy commented on SOLR-4824: That makes sense. So as a test I tried to restrict the search query to only 30 days of data after I had ingested the additional 11 days. This should have returned the same number 362,803 as before but it did not. I got 1263 results. I also noticed something else. We have a production system that is using Solr 3.5. I also have test systems on Solr 3.6.2 and Solr 4.3 using a smaller subset of production data. The physical size of the index is very different in 4.3 for the same data , number of fields, configuration etc. Solr 3.5 Averages 150 Kb / document Solr 3.6.2 Averages 148 KB / document Solr 4.3 Averages 75 KB / document Fuzzy / Faceting results are changed after ingestion of documents past a certain number Key: SOLR-4824 URL: https://issues.apache.org/jira/browse/SOLR-4824 Project: Solr Issue Type: Bug Affects Versions: 4.2, 4.3 Environment: Ubuntu 12.04 LTS 12.04.2 jre1.7.0_17 jboss-as-7.1.1.Final Reporter: Lakshmi Venkataswamy In upgrading from SOLR 3.6 to 4.2/4.3 and comparing results on fuzzy queries, I found that after a certain number of documents were ingested the fuzzy query had drastically lower number of results. We have approximately 18,000 documents per day and after ingesting approximately 40 days of documents, the next incremental day of documents results in a lower number of results of a fuzzy search. The query : http://10.100.1.xx:8080/solr/corex/select?q=cc:worde~1facet=onfacet.field=datefl=datefacet.sort produces the following result before the threshold is crossed responselst name=responseHeader int name=status0/intint name=QTime2349/intlst name=paramsstr name=faceton/strstr name=fldate/strstr name=facet.sort/ str name=qcc:worde~1/strstr name=facet.fielddate/str/lst/lstresult name=response numFound=362803 start=0/result lst name=facet_countslst name=facet_queries/lst name=facet_fieldslst name=date int name=2012-12-312866/int int name=2013-01-0111372/int int name=2013-01-0211514/int int name=2013-01-0312015/int int name=2013-01-0411746/int int name=2013-01-0510853/int int name=2013-01-0611053/int int name=2013-01-0711815/int int name=2013-01-0811427/int int name=2013-01-0911475/int int name=2013-01-1011461/int int name=2013-01-1112058/int int name=2013-01-1211335/int int name=2013-01-1312039/int int name=2013-01-1412064/int int name=2013-01-1512234/int int name=2013-01-1612545/int int name=2013-01-1711766/int int name=2013-01-1812197/int int name=2013-01-1911414/int int name=2013-01-2011633/int int name=2013-01-2112863/int int name=2013-01-2212378/int int name=2013-01-2311947/int int name=2013-01-2411822/int int name=2013-01-2511882/int int name=2013-01-2610474/int int name=2013-01-2711051/int int name=2013-01-2811776/int int name=2013-01-2911957/int int name=2013-01-3011260/int int name=2013-01-318511/int /lst/lstlst name=facet_dates/lst name=facet_ranges//lst/response Once the 40 days of documents ingested threshold is crossed the results drop as show below for the same query responselst name=responseHeader int name=status0/intint name=QTime2/intlst name=paramsstr name=faceton/strstr name=fldate/strstr name=facet.sort/str name=qcc:worde~1/strstr name=facet.fielddate/str/lst/lst result name=response numFound=1338 start=0/result lst name=facet_countslst name=facet_queries/lst name=facet_fieldslst name=date int name=2012-12-310/int int name=2013-01-0141/int int name=2013-01-0221/int int name=2013-01-0324/int int name=2013-01-0419/int int name=2013-01-059/int int name=2013-01-0611/int int name=2013-01-0717/int int name=2013-01-0814/int int name=2013-01-0924/int int name=2013-01-1043/int int name=2013-01-1114/int int name=2013-01-1252/int int name=2013-01-1357/int int name=2013-01-1425/int int name=2013-01-1517/int int name=2013-01-1634/int int name=2013-01-1711/int int name=2013-01-1816/int int name=2013-01-19121/int int name=2013-01-2033/int int name=2013-01-2126/int int name=2013-01-2259/int int name=2013-01-2327/int int name=2013-01-2410/int int name=2013-01-259/int int name=2013-01-266/int int name=2013-01-2716/int int name=2013-01-2811/int int name=2013-01-2915/int int name=2013-01-3021/int int name=2013-01-31109/int int name=2013-02-0111/int int name=2013-02-027/int int name=2013-02-0310/int int name=2013-02-048/int int name=2013-02-0513/int int name=2013-02-0675/int int name=2013-02-0777/int int name=2013-02-0831/int int name=2013-02-0935/int int name=2013-02-1022/int int
[jira] [Created] (SOLR-4824) Faceting results are changed after ingestion of documents past a certain number
Lakshmi Venkataswamy created SOLR-4824: -- Summary: Faceting results are changed after ingestion of documents past a certain number Key: SOLR-4824 URL: https://issues.apache.org/jira/browse/SOLR-4824 Project: Solr Issue Type: Bug Affects Versions: 4.3, 4.2 Environment: Ubuntu 12.04 LTS 12.04.2 jre1.7.0_17 jboss-as-7.1.1.Final Reporter: Lakshmi Venkataswamy In upgrading from SOLR 3.6 to 4.2/4.3 I and comparing results on fuzzy queries, I found that after a certain number of documents were ingested the fuzzy query has drastically lower number of results. We have approximately 18,000 documents per day and after ingesting approximately 40 days of documents, the next incremental day of documents results in a lower number of results of a fuzzy search. The query : http://10.100.1.48:8080/solr/coreTV3/select?q=cc:worde~1facet=onfacet.field=datefl=datefacet.sort produces the following result before the threshold is crossed responselst name=responseHeader int name=status0/intint name=QTime2349/intlst name=paramsstr name=faceton/strstr name=fldate/strstr name=facet.sort/ str name=qcc:worde~1/strstr name=facet.fielddate/str/lst/lstresult name=response numFound=362803 start=0/result lst name=facet_countslst name=facet_queries/lst name=facet_fieldslst name=date int name=2012-12-312866/int int name=2013-01-0111372/int int name=2013-01-0211514/int int name=2013-01-0312015/int int name=2013-01-0411746/int int name=2013-01-0510853/int int name=2013-01-0611053/int int name=2013-01-0711815/int int name=2013-01-0811427/int int name=2013-01-0911475/int int name=2013-01-1011461/int int name=2013-01-1112058/int int name=2013-01-1211335/int int name=2013-01-1312039/int int name=2013-01-1412064/int int name=2013-01-1512234/int int name=2013-01-1612545/int int name=2013-01-1711766/int int name=2013-01-1812197/int int name=2013-01-1911414/int int name=2013-01-2011633/int int name=2013-01-2112863/int int name=2013-01-2212378/int int name=2013-01-2311947/int int name=2013-01-2411822/int int name=2013-01-2511882/int int name=2013-01-2610474/int int name=2013-01-2711051/int int name=2013-01-2811776/int int name=2013-01-2911957/int int name=2013-01-3011260/int int name=2013-01-318511/int /lst/lstlst name=facet_dates/lst name=facet_ranges//lst/response Once the 40 days of documents ingested threshold is crossed the results drop as show below for the same query responselst name=responseHeader int name=status0/intint name=QTime2/intlst name=paramsstr name=faceton/strstr name=fldate/strstr name=facet.sort/str name=qcc:worde~1/strstr name=facet.fielddate/str/lst/lst result name=response numFound=1338 start=0/result lst name=facet_countslst name=facet_queries/lst name=facet_fieldslst name=date int name=2012-12-310/int int name=2013-01-0141/int int name=2013-01-0221/int int name=2013-01-0324/int int name=2013-01-0419/int int name=2013-01-059/int int name=2013-01-0611/int int name=2013-01-0717/int int name=2013-01-0814/int int name=2013-01-0924/int int name=2013-01-1043/int int name=2013-01-1114/int int name=2013-01-1252/int int name=2013-01-1357/int int name=2013-01-1425/int int name=2013-01-1517/int int name=2013-01-1634/int int name=2013-01-1711/int int name=2013-01-1816/int int name=2013-01-19121/int int name=2013-01-2033/int int name=2013-01-2126/int int name=2013-01-2259/int int name=2013-01-2327/int int name=2013-01-2410/int int name=2013-01-259/int int name=2013-01-266/int int name=2013-01-2716/int int name=2013-01-2811/int int name=2013-01-2915/int int name=2013-01-3021/int int name=2013-01-31109/int int name=2013-02-0111/int int name=2013-02-027/int int name=2013-02-0310/int int name=2013-02-048/int int name=2013-02-0513/int int name=2013-02-0675/int int name=2013-02-0777/int int name=2013-02-0831/int int name=2013-02-0935/int int name=2013-02-1022/int int name=2013-02-1118/int int name=2013-02-1211/int int name=2013-02-1368/int int name=2013-02-1440/int /lst/lstlst name=facet_dates/lst name=facet_ranges//lst/response I have also tested this with different months of data and have seen the same issue around the number of documents. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira - To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org
[jira] [Updated] (SOLR-4824) Fuzzy / Faceting results are changed after ingestion of documents past a certain number
[ https://issues.apache.org/jira/browse/SOLR-4824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lakshmi Venkataswamy updated SOLR-4824: --- Summary: Fuzzy / Faceting results are changed after ingestion of documents past a certain number (was: Faceting results are changed after ingestion of documents past a certain number ) Fuzzy / Faceting results are changed after ingestion of documents past a certain number Key: SOLR-4824 URL: https://issues.apache.org/jira/browse/SOLR-4824 Project: Solr Issue Type: Bug Affects Versions: 4.2, 4.3 Environment: Ubuntu 12.04 LTS 12.04.2 jre1.7.0_17 jboss-as-7.1.1.Final Reporter: Lakshmi Venkataswamy In upgrading from SOLR 3.6 to 4.2/4.3 I and comparing results on fuzzy queries, I found that after a certain number of documents were ingested the fuzzy query has drastically lower number of results. We have approximately 18,000 documents per day and after ingesting approximately 40 days of documents, the next incremental day of documents results in a lower number of results of a fuzzy search. The query : http://10.100.1.48:8080/solr/coreTV3/select?q=cc:worde~1facet=onfacet.field=datefl=datefacet.sort produces the following result before the threshold is crossed responselst name=responseHeader int name=status0/intint name=QTime2349/intlst name=paramsstr name=faceton/strstr name=fldate/strstr name=facet.sort/ str name=qcc:worde~1/strstr name=facet.fielddate/str/lst/lstresult name=response numFound=362803 start=0/result lst name=facet_countslst name=facet_queries/lst name=facet_fieldslst name=date int name=2012-12-312866/int int name=2013-01-0111372/int int name=2013-01-0211514/int int name=2013-01-0312015/int int name=2013-01-0411746/int int name=2013-01-0510853/int int name=2013-01-0611053/int int name=2013-01-0711815/int int name=2013-01-0811427/int int name=2013-01-0911475/int int name=2013-01-1011461/int int name=2013-01-1112058/int int name=2013-01-1211335/int int name=2013-01-1312039/int int name=2013-01-1412064/int int name=2013-01-1512234/int int name=2013-01-1612545/int int name=2013-01-1711766/int int name=2013-01-1812197/int int name=2013-01-1911414/int int name=2013-01-2011633/int int name=2013-01-2112863/int int name=2013-01-2212378/int int name=2013-01-2311947/int int name=2013-01-2411822/int int name=2013-01-2511882/int int name=2013-01-2610474/int int name=2013-01-2711051/int int name=2013-01-2811776/int int name=2013-01-2911957/int int name=2013-01-3011260/int int name=2013-01-318511/int /lst/lstlst name=facet_dates/lst name=facet_ranges//lst/response Once the 40 days of documents ingested threshold is crossed the results drop as show below for the same query responselst name=responseHeader int name=status0/intint name=QTime2/intlst name=paramsstr name=faceton/strstr name=fldate/strstr name=facet.sort/str name=qcc:worde~1/strstr name=facet.fielddate/str/lst/lst result name=response numFound=1338 start=0/result lst name=facet_countslst name=facet_queries/lst name=facet_fieldslst name=date int name=2012-12-310/int int name=2013-01-0141/int int name=2013-01-0221/int int name=2013-01-0324/int int name=2013-01-0419/int int name=2013-01-059/int int name=2013-01-0611/int int name=2013-01-0717/int int name=2013-01-0814/int int name=2013-01-0924/int int name=2013-01-1043/int int name=2013-01-1114/int int name=2013-01-1252/int int name=2013-01-1357/int int name=2013-01-1425/int int name=2013-01-1517/int int name=2013-01-1634/int int name=2013-01-1711/int int name=2013-01-1816/int int name=2013-01-19121/int int name=2013-01-2033/int int name=2013-01-2126/int int name=2013-01-2259/int int name=2013-01-2327/int int name=2013-01-2410/int int name=2013-01-259/int int name=2013-01-266/int int name=2013-01-2716/int int name=2013-01-2811/int int name=2013-01-2915/int int name=2013-01-3021/int int name=2013-01-31109/int int name=2013-02-0111/int int name=2013-02-027/int int name=2013-02-0310/int int name=2013-02-048/int int name=2013-02-0513/int int name=2013-02-0675/int int name=2013-02-0777/int int name=2013-02-0831/int int name=2013-02-0935/int int name=2013-02-1022/int int name=2013-02-1118/int int name=2013-02-1211/int int name=2013-02-1368/int int name=2013-02-1440/int /lst/lstlst name=facet_dates/lst name=facet_ranges//lst/response I have also tested this with different months of data and have seen the same issue around the number of documents. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (SOLR-4824) Fuzzy / Faceting results are changed after ingestion of documents past a certain number
[ https://issues.apache.org/jira/browse/SOLR-4824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lakshmi Venkataswamy updated SOLR-4824: --- Description: In upgrading from SOLR 3.6 to 4.2/4.3 and comparing results on fuzzy queries, I found that after a certain number of documents were ingested the fuzzy query had drastically lower number of results. We have approximately 18,000 documents per day and after ingesting approximately 40 days of documents, the next incremental day of documents results in a lower number of results of a fuzzy search. The query : http://10.100.1.xx:8080/solr/corex/select?q=cc:worde~1facet=onfacet.field=datefl=datefacet.sort produces the following result before the threshold is crossed responselst name=responseHeader int name=status0/intint name=QTime2349/intlst name=paramsstr name=faceton/strstr name=fldate/strstr name=facet.sort/ str name=qcc:worde~1/strstr name=facet.fielddate/str/lst/lstresult name=response numFound=362803 start=0/result lst name=facet_countslst name=facet_queries/lst name=facet_fieldslst name=date int name=2012-12-312866/int int name=2013-01-0111372/int int name=2013-01-0211514/int int name=2013-01-0312015/int int name=2013-01-0411746/int int name=2013-01-0510853/int int name=2013-01-0611053/int int name=2013-01-0711815/int int name=2013-01-0811427/int int name=2013-01-0911475/int int name=2013-01-1011461/int int name=2013-01-1112058/int int name=2013-01-1211335/int int name=2013-01-1312039/int int name=2013-01-1412064/int int name=2013-01-1512234/int int name=2013-01-1612545/int int name=2013-01-1711766/int int name=2013-01-1812197/int int name=2013-01-1911414/int int name=2013-01-2011633/int int name=2013-01-2112863/int int name=2013-01-2212378/int int name=2013-01-2311947/int int name=2013-01-2411822/int int name=2013-01-2511882/int int name=2013-01-2610474/int int name=2013-01-2711051/int int name=2013-01-2811776/int int name=2013-01-2911957/int int name=2013-01-3011260/int int name=2013-01-318511/int /lst/lstlst name=facet_dates/lst name=facet_ranges//lst/response Once the 40 days of documents ingested threshold is crossed the results drop as show below for the same query responselst name=responseHeader int name=status0/intint name=QTime2/intlst name=paramsstr name=faceton/strstr name=fldate/strstr name=facet.sort/str name=qcc:worde~1/strstr name=facet.fielddate/str/lst/lst result name=response numFound=1338 start=0/result lst name=facet_countslst name=facet_queries/lst name=facet_fieldslst name=date int name=2012-12-310/int int name=2013-01-0141/int int name=2013-01-0221/int int name=2013-01-0324/int int name=2013-01-0419/int int name=2013-01-059/int int name=2013-01-0611/int int name=2013-01-0717/int int name=2013-01-0814/int int name=2013-01-0924/int int name=2013-01-1043/int int name=2013-01-1114/int int name=2013-01-1252/int int name=2013-01-1357/int int name=2013-01-1425/int int name=2013-01-1517/int int name=2013-01-1634/int int name=2013-01-1711/int int name=2013-01-1816/int int name=2013-01-19121/int int name=2013-01-2033/int int name=2013-01-2126/int int name=2013-01-2259/int int name=2013-01-2327/int int name=2013-01-2410/int int name=2013-01-259/int int name=2013-01-266/int int name=2013-01-2716/int int name=2013-01-2811/int int name=2013-01-2915/int int name=2013-01-3021/int int name=2013-01-31109/int int name=2013-02-0111/int int name=2013-02-027/int int name=2013-02-0310/int int name=2013-02-048/int int name=2013-02-0513/int int name=2013-02-0675/int int name=2013-02-0777/int int name=2013-02-0831/int int name=2013-02-0935/int int name=2013-02-1022/int int name=2013-02-1118/int int name=2013-02-1211/int int name=2013-02-1368/int int name=2013-02-1440/int /lst/lstlst name=facet_dates/lst name=facet_ranges//lst/response I have also tested this with different months of data and have seen the same issue around the number of documents. was: In upgrading from SOLR 3.6 to 4.2/4.3 I and comparing results on fuzzy queries, I found that after a certain number of documents were ingested the fuzzy query has drastically lower number of results. We have approximately 18,000 documents per day and after ingesting approximately 40 days of documents, the next incremental day of documents results in a lower number of results of a fuzzy search. The query : http://10.100.1.48:8080/solr/coreTV3/select?q=cc:worde~1facet=onfacet.field=datefl=datefacet.sort produces the following result before the threshold is crossed responselst name=responseHeader int name=status0/intint name=QTime2349/intlst name=paramsstr name=faceton/strstr name=fldate/strstr name=facet.sort/ str name=qcc:worde~1/strstr name=facet.fielddate/str/lst/lstresult name=response numFound=362803 start=0/result lst name=facet_countslst name=facet_queries/lst name=facet_fieldslst name=date int name=2012-12-312866/int int name=2013-01-0111372/int int name=2013-01-0211514/int int
[jira] [Commented] (SOLR-4824) Fuzzy / Faceting results are changed after ingestion of documents past a certain number
[ https://issues.apache.org/jira/browse/SOLR-4824?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13658732#comment-13658732 ] Lakshmi Venkataswamy commented on SOLR-4824: Not sure I understand. When I have 30 days of data I get 362,803 results. When I add another 11 days worth of data the same search returns 1,338 results. Even if there is a maximum limit would I not see a capping of the results as opposed to a drastic drop ? Fuzzy / Faceting results are changed after ingestion of documents past a certain number Key: SOLR-4824 URL: https://issues.apache.org/jira/browse/SOLR-4824 Project: Solr Issue Type: Bug Affects Versions: 4.2, 4.3 Environment: Ubuntu 12.04 LTS 12.04.2 jre1.7.0_17 jboss-as-7.1.1.Final Reporter: Lakshmi Venkataswamy In upgrading from SOLR 3.6 to 4.2/4.3 and comparing results on fuzzy queries, I found that after a certain number of documents were ingested the fuzzy query had drastically lower number of results. We have approximately 18,000 documents per day and after ingesting approximately 40 days of documents, the next incremental day of documents results in a lower number of results of a fuzzy search. The query : http://10.100.1.xx:8080/solr/corex/select?q=cc:worde~1facet=onfacet.field=datefl=datefacet.sort produces the following result before the threshold is crossed responselst name=responseHeader int name=status0/intint name=QTime2349/intlst name=paramsstr name=faceton/strstr name=fldate/strstr name=facet.sort/ str name=qcc:worde~1/strstr name=facet.fielddate/str/lst/lstresult name=response numFound=362803 start=0/result lst name=facet_countslst name=facet_queries/lst name=facet_fieldslst name=date int name=2012-12-312866/int int name=2013-01-0111372/int int name=2013-01-0211514/int int name=2013-01-0312015/int int name=2013-01-0411746/int int name=2013-01-0510853/int int name=2013-01-0611053/int int name=2013-01-0711815/int int name=2013-01-0811427/int int name=2013-01-0911475/int int name=2013-01-1011461/int int name=2013-01-1112058/int int name=2013-01-1211335/int int name=2013-01-1312039/int int name=2013-01-1412064/int int name=2013-01-1512234/int int name=2013-01-1612545/int int name=2013-01-1711766/int int name=2013-01-1812197/int int name=2013-01-1911414/int int name=2013-01-2011633/int int name=2013-01-2112863/int int name=2013-01-2212378/int int name=2013-01-2311947/int int name=2013-01-2411822/int int name=2013-01-2511882/int int name=2013-01-2610474/int int name=2013-01-2711051/int int name=2013-01-2811776/int int name=2013-01-2911957/int int name=2013-01-3011260/int int name=2013-01-318511/int /lst/lstlst name=facet_dates/lst name=facet_ranges//lst/response Once the 40 days of documents ingested threshold is crossed the results drop as show below for the same query responselst name=responseHeader int name=status0/intint name=QTime2/intlst name=paramsstr name=faceton/strstr name=fldate/strstr name=facet.sort/str name=qcc:worde~1/strstr name=facet.fielddate/str/lst/lst result name=response numFound=1338 start=0/result lst name=facet_countslst name=facet_queries/lst name=facet_fieldslst name=date int name=2012-12-310/int int name=2013-01-0141/int int name=2013-01-0221/int int name=2013-01-0324/int int name=2013-01-0419/int int name=2013-01-059/int int name=2013-01-0611/int int name=2013-01-0717/int int name=2013-01-0814/int int name=2013-01-0924/int int name=2013-01-1043/int int name=2013-01-1114/int int name=2013-01-1252/int int name=2013-01-1357/int int name=2013-01-1425/int int name=2013-01-1517/int int name=2013-01-1634/int int name=2013-01-1711/int int name=2013-01-1816/int int name=2013-01-19121/int int name=2013-01-2033/int int name=2013-01-2126/int int name=2013-01-2259/int int name=2013-01-2327/int int name=2013-01-2410/int int name=2013-01-259/int int name=2013-01-266/int int name=2013-01-2716/int int name=2013-01-2811/int int name=2013-01-2915/int int name=2013-01-3021/int int name=2013-01-31109/int int name=2013-02-0111/int int name=2013-02-027/int int name=2013-02-0310/int int name=2013-02-048/int int name=2013-02-0513/int int name=2013-02-0675/int int name=2013-02-0777/int int name=2013-02-0831/int int name=2013-02-0935/int int name=2013-02-1022/int int name=2013-02-1118/int int name=2013-02-1211/int int name=2013-02-1368/int int name=2013-02-1440/int /lst/lstlst name=facet_dates/lst name=facet_ranges//lst/response I have also tested this with different months of data and have seen the same issue around the number of documents. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your