Huh, I didn't spot the where clause the first time around - I usually group all 
my indexes, has, set_property, where (etc) methods together in roughly that 
order.

Let's see if we can get the right value from Sphinx in a search:

 ThinkingSphinx.search_for_ids(
   :index       => 'asset_core',
   :max_matches => 100_000
 ).total_entries

 # vs  
 Asset.search_count
 # vs
 Asset.search_count(:max_matches => 100_000)

-- 
Pat

On 03/03/2011, at 5:30 AM, Jake wrote:

> For a moment just now, I thought it was the where statement in the
> define_index block, but I checked the db and all Assets have the
> 'approved' status. Here is the output from rake ts:index. It says it
> collected 1827 docs for asset_core, which is the right amount.
> 
> Generating Configuration to /Users/jakemack/Work/Repositories/git/
> RightsGenie/config/development.sphinx.conf
> Sphinx 0.9.9-release (r2117)
> Copyright (c) 2001-2009, Andrew Aksyonoff
> 
> using config file '/Users/jakemack/Work/Repositories/git/RightsGenie/
> config/development.sphinx.conf'...
> indexing index 'asset_core'...
> collected 1827 docs, 0.2 MB
> collected 3703 attr values
> sorted 0.0 Mvalues, 100.0% done
> sorted 0.7 Mhits, 98.2% done
> total 1827 docs, 151014 bytes
> total 0.894 sec, 168857 bytes/sec, 2042.87 docs/sec
> distributed index 'asset' can not be directly indexed; skipping.
> indexing index 'company_core'...
> collected 4 docs, 0.0 MB
> sorted 0.0 Mhits, 100.0% done
> total 4 docs, 81 bytes
> total 0.044 sec, 1840 bytes/sec, 90.87 docs/sec
> distributed index 'company' can not be directly indexed; skipping.
> indexing index 'person_core'...
> collected 6 docs, 0.0 MB
> sorted 0.0 Mhits, 100.0% done
> total 6 docs, 74 bytes
> total 0.017 sec, 4197 bytes/sec, 340.32 docs/sec
> distributed index 'person' can not be directly indexed; skipping.
> total 16 reads, 0.009 sec, 267.9 kb/call avg, 0.5 msec/call avg
> total 40 writes, 0.011 sec, 264.9 kb/call avg, 0.2 msec/call avg
> rotating indices: succesfully sent SIGHUP to searchd (pid=28312).
> 
> 
> On Mar 2, 2:00 am, Pat Allan <[email protected]> wrote:
>> Hi Jake
>> 
>> What's the output when you index the data? How many records does Sphinx say 
>> it's processed?
>> 
>> --
>> Pat
>> 
>> On 02/03/2011, at 8:09 AM, Jake wrote:
>> 
>>> I'll post my new numbers so you can see the latest. I'm using MySQL and in 
>>> my sphinx.yml file I've set max_matches: 100000 and a stopwords file for my 
>>> local development. I've also added pid_file and searchd_file_path for 
>>> staging and production, but I'm assuming that's irrelevant. My smallest id 
>>> for the Asset model is 30 and the largest is 5479. Here are the set of ids 
>>> that are not being returned through Asset.search:
>> 
>>> [5318, 5319, 5320, 5321, 5322, 5323, 5324, 5325, 5326, 5327, 5328, 5329, 
>>> 5330, 5331, 5332, 5333, 5334, 5335, 5336, 5337, 5338, 5339, 5340, 5341, 
>>> 5342, 5343, 5344, 5345, 5346, 5347, 5348, 5349, 5350, 5351, 5352, 5353, 
>>> 5354, 5355, 5356, 5357, 5358, 5359, 5360, 5361, 5392, 5393, 5394]
>> 
>>> It seems that there was a time when something about creating the assets 
>>> caused them to not be indexed, but I don't know what would have caused 
>>> that. I've tried comparing the various columns for these un-indexed Assets, 
>>> but haven't been able to notice a pattern. I'm really just at a loss of 
>>> where to look next. Let me know if any other information would be helpful 
>>> and thanks for your help.
>> 
>>> --
>>> You received this message because you are subscribed to the Google Groups 
>>> "Thinking Sphinx" group.
>>> To post to this group, send email to [email protected].
>>> To unsubscribe from this group, send email to 
>>> [email protected].
>>> For more options, visit this group 
>>> athttp://groups.google.com/group/thinking-sphinx?hl=en.
> 
> -- 
> You received this message because you are subscribed to the Google Groups 
> "Thinking Sphinx" group.
> To post to this group, send email to [email protected].
> To unsubscribe from this group, send email to 
> [email protected].
> For more options, visit this group at 
> http://groups.google.com/group/thinking-sphinx?hl=en.
> 

-- 
You received this message because you are subscribed to the Google Groups 
"Thinking Sphinx" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit this group at 
http://groups.google.com/group/thinking-sphinx?hl=en.

Reply via email to