Hi,

For full-text indexing/searching in DSpace, you need to enable/run the 
"Media 
filters": 
https://wiki.lyrasis.org/display/DSDOC7x/Mediafilters+for+Transforming+DSpace+Content

These are scripts that can extract text out of text-based content (like 
OCR'd PDFs, Word docs, etc).

Most sites choose to run those on a scheduled basis (e.g. once per day, or 
a few times a day) via a Cron Job.  See this 
guide: https://wiki.lyrasis.org/display/DSDOC7x/Scheduled+Tasks+via+Cron

If you have more questions, let us know on this list.

Tim

On Thursday, January 19, 2023 at 12:51:29 AM UTC-6 ruchi...@gmail.com wrote:

> I have installed dspace 7.3 on ubuntu 22.04. 
> Kindly help me to make pdf or other file OCR searchable ie. can be 
> searched with any content in pdf or file and not only from keywords, title, 
> author etc.
>
> Please let me know the settings or steps to be changed in the 
> configuration files.
>

-- 
All messages to this mailing list should adhere to the Code of Conduct: 
https://www.lyrasis.org/about/Pages/Code-of-Conduct.aspx
--- 
You received this message because you are subscribed to the Google Groups 
"DSpace Technical Support" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to dspace-tech+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/dspace-tech/62d0e5bd-7293-4f16-b1e1-908d765a2c39n%40googlegroups.com.

Reply via email to