Re: [Nutch-general] What is parse-oo and why doesn't parsed PDF content show up in cached.jsp ?

2007-06-12 Thread Manoharam Reddy
How can I change it to read from segment/parse_text instead of segment/content ? On 5/31/07, Doğacan Güney [EMAIL PROTECTED] wrote: Hi, On 5/31/07, Manoharam Reddy [EMAIL PROTECTED] wrote: Some confusions regarding plugins.includes 1. I find a parse-oo in the plugins folder. What is that

Re: [Nutch-general] What is parse-oo and why doesn't parsed PDF content show up in cached.jsp ?

2007-06-12 Thread Doğacan Güney
On 6/12/07, Manoharam Reddy [EMAIL PROTECTED] wrote: How can I change it to read from segment/parse_text instead of segment/content ? If you are using Nutch's web ui, you have to change this part in cached.jsp : % } else { % The cached content has mime type %=contentType%, click this a

[Nutch-general] What is parse-oo and why doesn't parsed PDF content show up in cached.jsp ?

2007-05-31 Thread Manoharam Reddy
Some confusions regarding plugins.includes 1. I find a parse-oo in the plugins folder. What is that for? 2. I have enabled parse-pdf by including in plugins.include of nutch-site.xml. The pages now come in the search result. But when I visit the cached page of the result. It shows a message like

Re: [Nutch-general] What is parse-oo and why doesn't parsed PDF content show up in cached.jsp ?

2007-05-31 Thread Doğacan Güney
Hi, On 5/31/07, Manoharam Reddy [EMAIL PROTECTED] wrote: Some confusions regarding plugins.includes 1. I find a parse-oo in the plugins folder. What is that for? Plugin parse-oo has something to do with parsing OpenOffice.org documents, I am not sure what exactly. 2. I have enabled