Re: recent utf8 problems

2017-11-07 Thread Chris Hostetter
: 1) When looking for Tübingen in the title, I am expecting the 3092484 Just to be clear -- I'm reading that as an 8 character word, where the 2nd character is U+00FC and the other characters are plain ascii: T_bingen Also to be clear: I'm attempting to reproduce the steps you describe using

Re: recent utf8 problems

2017-11-07 Thread Rick Leir
Dr Krell Item 11): It is best to get the solrconfig.xml provided with the new version of Solr, and change it to suit your needs. Do not try to work from the old version's solrconfig.xml. I did not have time to read the other items. Look in solr.log, and compare the successful query with the un

Re: recent utf8 problems

2017-11-06 Thread Dr. Mario Michael Krell
Hi, thank you for your time and trying to narrow down my problem. 1) When looking for Tübingen in the title, I am expecting the 3092484 results. That sounds like a reasonable result. Furthermore, when looking at some of the results, they are exactly what I am looking for. 2) I am testing them

Re: recent utf8 problems

2017-11-06 Thread Rick Leir
Hoss Clearly it is U+00FC ü c3 bc LATIN SMALL LETTER U WITH DIAERESIS As in Tübingen "With the Yahoo Flickr Creative Commons 100 Million (YFCC100m) dataset, a great novel dataset was introduced to the computer vision and multimedia research community." -- cool I think it is strange th

Re: recent utf8 problems

2017-11-06 Thread Chris Hostetter
: We recently discovered issues with solr with converting utf8 code in the search. One or two month ago everything was still working. : : - What might have caused it is a Java update (Java 8 Update 151). : - We are using firefox as well as chrome for displaying results. : - We tested it with So

Re: recent utf8 problems

2017-11-06 Thread Dr. Mario Michael Krell
Hi Rick, Hi Solr Experts, Thank you for this reply! My solr database is supposed to be(come) open source. Hence, I am willing to share any information. Since I am new to solr, I just did not know what to share. But in the mean time, I put some of the information online. My current configuratio

Re: recent utf8 problems

2017-11-06 Thread Rick Leir
Dr. Krell You could look at your /select query handler, and compare it with the /query query handler in the Admin config. Did you upgrade from a previous version of Solr? Or change your config ( no, you must have thought of that). If it is a bug related to the Java upgrade then you need to sho

recent utf8 problems

2017-11-04 Thread Dr. Mario Michael Krell
Hi, We recently discovered issues with solr with converting utf8 code in the search. One or two month ago everything was still working. - What might have caused it is a Java update (Java 8 Update 151). - We are using firefox as well as chrome for displaying results. - We tested it with Solr 6.5