it was the http-header, as soon as i force a iso-8859-1 header it worked On 12. Sep 2013, at 9:44 AM, Andreas Owen wrote:
> could it have something to do with the meta encoding tag is iso-8859-1 but > the http-header tag is utf8 and firefox inteprets it as utf8? > > On 12. Sep 2013, at 8:36 AM, Andreas Owen wrote: > >> no jetty, and yes for tomcat i've seen a couple of answers >> >> On 12. Sep 2013, at 3:12 AM, Otis Gospodnetic wrote: >> >>> Using tomcat by any chance? The ML archive has the solution. May be on >>> Wiki, too. >>> >>> Otis >>> Solr & ElasticSearch Support >>> http://sematext.com/ >>> On Sep 11, 2013 8:56 AM, "Andreas Owen" <a...@conx.ch> wrote: >>> >>>> i'm using solr 4.3.1 with tika to index html-pages. the html files are >>>> iso-8859-1 (ansi) encoded and the meta tag "content-encoding" as well. the >>>> server-http-header says it's utf8 and firefox-webdeveloper agrees. >>>> >>>> when i index a page with special chars like ä,ö,ü solr outputs it >>>> completly foreign signs, not the normal wrong chars with 1/4 or the Flag in >>>> it. so it seams that its not simply the normal utf8/iso-8859-1 discrepancy. >>>> has anyone got a idea whats wrong? >>>> >>>>