First, when switching subjects please start a new thread. It gets confusing to have multiple topics, it's called "thread hijacking".
Second, I have no clue why your Nutch output is outputting invalid characters. Sounds like 1> your custom plugin is doing something weird or 2> something you could configure in Nutch. So I'd recommend asking on the Nutch board. Best Erick On Mon, Jul 15, 2013 at 11:40 AM, glumet <jan.bouch...@gmail.com> wrote: > As I can see, this is the same problem like one from older posts - > http://lucene.472066.n3.nabble.com/strange-utf-8-problem-td3094473.html > ...but it was without any response. > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Apache-Solr-4-after-1st-commit-the-index-does-not-grow-tp4077913p4078079.html > Sent from the Solr - User mailing list archive at Nabble.com.