Nils Hoeller wrote:
Hi,
actually I thought the content of the pages,
is beeing indexed.
When I have a look with Luke at the
index of a Nutch Crawl, it says
contents not available.
Please try "reconstruct & Edit" button, and you should see some text
from the content. The plain text is NOT
Hi,
actually I thought the content of the pages,
is beeing indexed.
When I have a look with Luke at the
index of a Nutch Crawl, it says
contents not available.
When I search for a word in field "content"
that IS IN A SITE in the index,
it gives me no results.
Now I saw something in config