[Dspace-tech] Searching of text from PDF files

Gary Browne Mon, 31 Aug 2009 22:56:06 -0700

Hi all,

I have a query about searching of pdf documents which I can't seem to
find a definitive answer for:


When a user searches via the dspace web interface, is the search run
across the content of text pdfs or just the metadata? If so, does the
pdf submitted to the repository need to have been previously OCR'd, or
does the repository attempt to extract & index text from all pdfs?

Any information regarding this would be greatly appreciated.

Thanks
Gary


Gary Browne
Development Programmer
Library IT Services
University of Sydney
ph: 9351-5946
Sent from my plain old desktop computer.

------------------------------------------------------------------------------
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with 
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
_______________________________________________
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech

[Dspace-tech] Searching of text from PDF files

Reply via email to