I would create an associative array, where the key is the word and the value
is the count. You would have to loop over the data as a space delimited
list, unless you want to setup some type of full text indexing. I don't know
what database server you are running.

In terms of filters, there is a term for that. They are called stop words.
Building a stop word list is part of building a search engine. If you have
niche data, then you will add certain words to the stop list that other
people wouldn't. For instance, if all your data is about music. You might
want to add CD, listen, song, etc to the list, along with the other
universally common words (the, a, is, etc..).

You may run into needing to group all variations of a word together. For
instance, do you want one count for "box" and another count for "boxes". If
not, you need to implement stemming. There are several great stemming
algorithms out there. I have used the Porter Stemmer before and was pleased
with the output.

- Daniel

-----Original Message-----
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf
Of Jake McKee
Sent: Wednesday, April 06, 2005 8:19 AM
To: [email protected]
Subject: Re: Here's a new one...

The data contained within the fields of this particular column is 
multiple words (sentences/paragraphs).

I'd want to filter out certain things (like "I" or "an" or "the").

Jake

PC Carraway wrote:

> Does the DB column contain only one word per row or are there multiple 
> words per row?  Please send a little more info.
>
> Precia
>
> ----- Original Message ----- From: "Jake McKee" <[EMAIL PROTECTED]>
> To: <[email protected]>
> Sent: Wednesday, April 06, 2005 8:47 AM
> Subject: Here's a new one...
>
>
>> OK, I've not seen this one discussed anywhere, so perhaps it's a new 
>> question... probably not.
>>
>> If I wanted to generate a report of some sort on which words in a 
>> particular DB column are used the most, how would I do that?
>>
>> Jake
>>
>> ----------------------------------------------------------
>> To post, send email to [email protected]
>> To unsubscribe: http://www.dfwcfug.org/form_MemberUnsubscribe.cfm
>> To subscribe: http://www.dfwcfug.org/form_MemberRegistration.cfm
>>
>>
>>
>
> ----------------------------------------------------------
> To post, send email to [email protected]
> To unsubscribe:   http://www.dfwcfug.org/form_MemberUnsubscribe.cfm
> To subscribe:   http://www.dfwcfug.org/form_MemberRegistration.cfm
>
>
>
>
>
>

----------------------------------------------------------
To post, send email to [email protected]
To unsubscribe: 
   http://www.dfwcfug.org/form_MemberUnsubscribe.cfm
To subscribe: 
   http://www.dfwcfug.org/form_MemberRegistration.cfm



----------------------------------------------------------
To post, send email to [email protected]
To unsubscribe: 
   http://www.dfwcfug.org/form_MemberUnsubscribe.cfm
To subscribe: 
   http://www.dfwcfug.org/form_MemberRegistration.cfm


Reply via email to