I think I figured out the problem - seemed there was a flaw in my logic, I
was using duplicate values for the Key field which arses up Verity. Was
joining 2 tables - Pages and Content (many-to-many), and using the PageID
as the basis for the Key, which of course wasn't unique.
"Snake
Hollywood" To: <[EMAIL PROTECTED]>
<[EMAIL PROTECTED] cc:
kepit.net> Subject: RE: [ cf-dev ] Verity madness
07/07/2004 18:29
Please respond
to dev
You do know that certain words do not get indexed?
I do not have a list for you, you will need to go find that, but it's all
the common stuff like THE, THIS, THAT and some other more obscure things.
> -----Original Message-----
> From: [EMAIL PROTECTED]
> [mailto:[EMAIL PROTECTED]
> Sent: 07 July 2004 17:21
> To: [EMAIL PROTECTED]
> Subject: Re: [ cf-dev ] Verity madness
>
>
> So far, it looks like the 23rd item could be the culprit -
> the text in the Body is relatively long and full of HTML
> (although it seems to be indexed ok itself); could this have
> an affect on the collection not wanting to index anything else?
>
>
>
>
>
>
> [EMAIL PROTECTED]
>
> sign.co.uk To:
> [EMAIL PROTECTED]
> cc:
>
> 07/07/2004 16:46
> Subject: [ cf-dev ] Verity madness
>
> Please respond to dev
>
>
>
>
>
>
>
>
> Here's a strange one. Got a website on CF 5. Uses Verity
> for a search, all works fine. Apart from it doesn't seem to
> have some recent data in the index. Re-indexing doesn't make
> any difference.
> In the CF Administrator, if I Purge the collection, the
> search still works (i.e. returns results) - isn't the Purge
> meant to remove all the data from the collection?
> I then deleted the collection, re-created and re-indexed -
> still not returning certain results that it should be.
>
> My indexing looks roughly like:
>
> <cfquery name="getPages" datasource="dsn">
> SELECT *
> FROM Content
> </cfquery>
>
> <cfdump var="#getPages#">
>
> <cflock name="CollectionName" timeout="100" type="Exclusive">
> <cfindex collection="CollectionName" action="Refresh"
> type="Custom"
> Key="theID" Title="page_title" Body="page_title, menu_title,
> Content" Query ="getPages" Custom1="Pages"> </cflock>
>
> So from the CFDump, I can see what data should be getting
> indexed. I see there are 26 items. When I then do a
> CFSearch on * I get only 23 results, and looking in the
> actual contents of the collection, I can see it's not indexed
> certain words in at least one of the three missing items.
>
> I'm probably being stupid here - any ideas? It's as if there
> are two versions, or perhaps it's failing on the 24th item
> for some reason.
> Normally don't have much problems with Verity stuff, it's
> just a pain in the arse when it goes wrong.
>
>
> --
> These lists are syncronised with the CFDeveloper forum at
> http://forum.cfdeveloper.co.uk/
> Archive: http://www.mail-archive.com/dev%40lists.cfdeveloper.co.uk/
>
> CFDeveloper Sponsors and contributors:-
> *Hosting and support provided by CFMXhosting.co.uk* ::
> *ActivePDF provided by activepdf.com*
> *Forums provided by fusetalk.com* :: *ProWorkFlow provided by
> proworkflow.com*
> *Tutorials provided by helmguru.com* :: *Lists hosted by
> gradwell.com*
>
> To unsubscribe, e-mail: [EMAIL PROTECTED]
>
>
>
>
>
>
> --
> These lists are syncronised with the CFDeveloper forum at
> http://forum.cfdeveloper.co.uk/
> Archive: http://www.mail-archive.com/dev%40lists.cfdeveloper.co.uk/
>
> CFDeveloper Sponsors and contributors:-
> *Hosting and support provided by CFMXhosting.co.uk* ::
> *ActivePDF provided by activepdf.com*
> *Forums provided by fusetalk.com* :: *ProWorkFlow
> provided by proworkflow.com*
> *Tutorials provided by helmguru.com* :: *Lists
> hosted by gradwell.com*
>
> To unsubscribe, e-mail: [EMAIL PROTECTED]
>
--
These lists are syncronised with the CFDeveloper forum at
http://forum.cfdeveloper.co.uk/
Archive: http://www.mail-archive.com/dev%40lists.cfdeveloper.co.uk/
CFDeveloper Sponsors and contributors:-
*Hosting and support provided by CFMXhosting.co.uk* :: *ActivePDF provided
by activepdf.com*
*Forums provided by fusetalk.com* :: *ProWorkFlow provided by
proworkflow.com*
*Tutorials provided by helmguru.com* :: *Lists hosted by
gradwell.com*
To unsubscribe, e-mail: [EMAIL PROTECTED]
--
These lists are syncronised with the CFDeveloper forum at
http://forum.cfdeveloper.co.uk/
Archive: http://www.mail-archive.com/dev%40lists.cfdeveloper.co.uk/
CFDeveloper Sponsors and contributors:-
*Hosting and support provided by CFMXhosting.co.uk* :: *ActivePDF provided by
activepdf.com*
*Forums provided by fusetalk.com* :: *ProWorkFlow provided by proworkflow.com*
*Tutorials provided by helmguru.com* :: *Lists hosted by gradwell.com*
To unsubscribe, e-mail: [EMAIL PROTECTED]