This is probably caused by an encoding detection problem in Nutch and/or
Tika. If you can share the file on the Tika user’s list, I can take a look.
On Fri, Oct 5, 2018 at 7:11 AM UMA MAHESWAR
wrote:
> HI ALL,
>
> while i am using nutch for crawling and indexing in to solr,while storing
> data i
HI ALL,
while i am using nutch for crawling and indexing in to solr,while storing
data in to solr encoding issue facing
in site having the title
title : ebm-papst Motoren & Ventilatoren GmbH - Axialventilatoren und
Radialventilatoren aus Linz, Österreich
but in solr storing in the below form