There are a couple of options available to you...  neither of which involve 
scraping a Word document.  I have to agree with Keith that it would be way 
too painfull a process to try and do.

1. Use Verity to index and search the files.
2. Dump the full file into a BLOB column in your database and have SQL 
Server search it.  It can do that you know...
http://www.google.com/search?hl=en&q=Searching+Word+Files+in+SQL+Server

Personally, I'd opt for (1) and just join the result list from Verity with a 
search from the database.

HTH,

Jeff Garza

----- Original Message ----- 
From: "Strickland, Douglas M." <[EMAIL PROTECTED]>
To: "CF-Talk" <cf-talk@houseoffusion.com>
Sent: Wednesday, February 09, 2005 9:44 AM
Subject: Uploaded .doc files using cffile, now I need to 'scrape' the data 
from the files


> Using cffile, I have a form that allows users to upload their resume
> (MSWord documents (.doc files)) to our system.
>
> What I wanted to do next was...as each document is uploaded,
> read/retrieve the text out of it, and insert it into a table (SQL Server
> database). The reason I wanted to do this, was to make it easier to
> search through the resume data. Any suggestions on how to accomplish
> this would be appreciated.
>
> I attempted to read the file by using cffile, and was able to read it;
> however, there was also a lot of 'garbage' along with the text that was
> in the file.
>
> Thanks,
> Doug
>
>
>
> 

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~|
Logware (www.logware.us): a new and convenient web-based time tracking 
application. Start tracking and documenting hours spent on a project or with a 
client with Logware today. Try it for free with a 15 day trial account.
http://www.houseoffusion.com/banners/view.cfm?bannerid=67

Message: http://www.houseoffusion.com/lists.cfm/link=i:4:193910
Archives: http://www.houseoffusion.com/cf_lists/threads.cfm/4
Subscription: http://www.houseoffusion.com/lists.cfm/link=s:4
Unsubscribe: http://www.houseoffusion.com/cf_lists/unsubscribe.cfm?user=89.70.4
Donations & Support: http://www.houseoffusion.com/tiny.cfm/54

Reply via email to