Re: Analyzing .pos files using luke

2016-07-11 Thread KNitin
Erick, 1. Yes both indices are optimized. They use lucene_40 version for their solr indices. 2. I discovered a bloat on one of the index. I am trying to figure out what might have caused the bloat. There are some schema changes on the bloated one but I was wondering if there is way to identify som

Re: Analyzing .pos files using luke

2016-07-11 Thread Erick Erickson
Why do you care? Is there a specific problem you're trying to diagnose? Since the merge logic wasn't identical (almost guaranteed), the sizes of the files are unreliable since they include deleted documents which are compacted on merging. So, you can try an optimize and compare immediately afterwa

Analyzing .pos files using luke

2016-07-10 Thread KNitin
Hi, I am trying to diff between 2 versions of solr index. Both the indices have similar .doc, .pay file sizes but their .pos files are extremely different. How do i dig deeper to understand what could be causing this difference? Is there a way to just open/analyze .pos file/compare 2 .pos files