Re: [Dspace-tech] Help with Latin Languages
On Mon, Nov 24, 2014 at 11:57 AM, siriom siriom sir...@gmail.com wrote: Greetings . Hope everyones having a good monday :) Petya Im running Dspace 4.2 , im not sure on which version of Solr is running on it since I cant access it via localhost:8080/solr/search. It says something like 403 access denied even though im accessing from localhost which is odd . Do i need to turn something on to view the admin panel for solr ? That sounds like something is configured incorrectly in your /etc/hosts file on the DSpace server. Anyway, try one of these methods to bypass the restriction: https://wiki.duraspace.org/display/DSPACE/Solr Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Help with Latin Languages
My hosts file is simple. 127.0.0.1 localhost ::1 localhost thats it. This is a very odd error. I create an index.html and put it in /webapps/jspui I access http://localhost:8080/jspui/index.html just fine same for xmlui but the second i put the index.html in /webapps/solr i get 403 forbidden. The entire dspace works but i cant seem to access solr admin page. Solr is running , i get stats up on the 14 000 items i have added. Im running 4.2 and im out of ideas . any suggestions would be apreciated. Ive opened a prompt , downloaded and installed lynx ... did a lynx http:localhost:8080/solr from within the very machine dspace is running on , in a prompt and i still get 403 forbidden. On Tue, Nov 25, 2014 at 9:57 AM, helix84 heli...@centrum.sk wrote: On Mon, Nov 24, 2014 at 11:57 AM, siriom siriom sir...@gmail.com wrote: Greetings . Hope everyones having a good monday :) Petya Im running Dspace 4.2 , im not sure on which version of Solr is running on it since I cant access it via localhost:8080/solr/search. It says something like 403 access denied even though im accessing from localhost which is odd . Do i need to turn something on to view the admin panel for solr ? That sounds like something is configured incorrectly in your /etc/hosts file on the DSpace server. Anyway, try one of these methods to bypass the restriction: https://wiki.duraspace.org/display/DSPACE/Solr Regards, ~~helix84 Compulsory reading: DSpace Mailing List Etiquette https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Help with Latin Languages
Greetings . Hope everyones having a good monday :) Petya Im running Dspace 4.2 , im not sure on which version of Solr is running on it since I cant access it via localhost:8080/solr/search. It says something like 403 access denied even though im accessing from localhost which is odd . Do i need to turn something on to view the admin panel for solr ? On Sat, Nov 22, 2014 at 7:17 PM, Petya Kohts petya.ko...@gmail.com wrote: Hello siriom, I think you'd better off starting with specifying DSpace version and solr version (right from the dashboard). Next it would be handy to see some screenshots or at least solr ResponseHeader structure. Generally I have solr-spec 4.4.0 / solr-impl 4.4.0 1504776, query working for Cyrillic symbols. Petya. On Wed, Nov 19, 2014 at 9:01 PM, siriom siriom sir...@gmail.com wrote: Can anyone give me a hand with enabling solr to properly search for non english words ? More specifically portuguese words with ã or é for example. Right now a search for são will find nothing but a search for sao will find são. I was told some changes need to be made to schema.xml ? Anyone out there using solr with a non english language that could send me a schema.xml ? Thanks. -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Help with Latin Languages
First all thanks for your replies but i still havent gotten this fixed. This is a copy from my /dspace/solr/search/conf/schema.xml fieldType name=text class=solr.TextField positionIncrementGap=100 analyzer type=index tokenizer class=solr.WhitespaceTokenizerFactory/ !-- in this example, we will only use synonyms at query time filter class=solr.SynonymFilterFactory synonyms=index_synonyms.txt ignoreCase=true expand=false/ -- !-- Case insensitive stop word removal. add enablePositionIncrements=true in both the index and query analyzers to leave a 'gap' for more accurate phrase queries. -- filter class=solr.ASCIIFoldingFilterFactory/filter filter class=solr.StopFilterFactory ignoreCase=true words=stopwords.txt enablePositionIncrements=true / filter class=solr.WordDelimiterFilterFactory generateWordParts=1 generateNumberParts=1 catenateWords=1 catenateNumbers=1 catenateAll=0 splitOnCaseChange=1/ filter class=solr.ICUFoldingFilterFactory/ filter class=solr.SnowballPorterFilterFactory language=English protected=protwords.txt/ filter class=solr.RemoveDuplicatesTokenFilterFactory/ /analyzer analyzer type=query tokenizer class=solr.WhitespaceTokenizerFactory/ filter class=solr.ASCIIFoldingFilterFactory/filter filter class=solr.SynonymFilterFactory synonyms=synonyms.txt ignoreCase=true expand=true/ filter class=solr.StopFilterFactory ignoreCase=true words=stopwords.txt enablePositionIncrements=true As you can see I've added filter class=solr.ASCIIFoldingFilterFactory/filter twice , once to each analyzer. This is whats happening: If i search for accao if find tons of relevant matches including acção in the title, if on the other hand i search for acção i get searching for all of Dspace for screen except its looking for acçao Its all garbled ... and therefore wont find any relevant hits Ive done a re index -b as requested. Im running Dspace 4.2 Please help, On Thu, Nov 20, 2014 at 12:04 PM, Adan adan.ro...@gmail.com wrote: Hi anonimous You can begin searching fieldType name=”text” …… in schema.xml and change filter class=solr.EnglishPorterFilterFactory protected=protwords.txt with filter class=solr.ASCIIFoldingFilterFactory/filter filter class=solr.EnglishPorterFilterFactory protected=protwords.txt then do a dspace update-discovery-index -b for 3.x or a dspace index-discovery -b for a 4.x version Its explained at http://www.arvo.es/dspace/configurando-solr/ (in spanish) regards Adán Román Ruiz ARVO Consultores Can anyone give me a hand with enabling solr to properly search for non english words ? More specifically portuguese words with ã or é for example. Right now a search for são will find nothing but a search for sao will find são. I was told some changes need to be made to schema.xml ? Anyone out there using solr with a non english language that could send me a schema.xml ? Thanks. -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREEhttp://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk ___ DSpace-tech mailing listDSpace-tech@lists.sourceforge.nethttps://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- http://www.avast.com/ El software de antivirus Avast ha analizado este correo electrónico en busca de virus. www.avast.com -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App
Re: [Dspace-tech] Help with Latin Languages
Hi Siriom, You might also have to do this, which is what we figured out in my office: 1) Edit /etc/tomcat7/server.xml and change Connector port=8080 protocol=HTTP/1.1 connectionTimeout=2 redirectPort=8443/ to Connector port=8080 protocol=HTTP/1.1 connectionTimeout=2 redirectPort=8443 URIEncoding=UTF-8/ 2) Restart tomcat That took care of character encoding in the search box. Aaron Helton (Mr.) United Nations Department of Public Information Outreach Division From: siriom siriom sir...@gmail.com To: Adan adan.ro...@gmail.com, Hilton Gibson hilton.gib...@gmail.com, Cc: dspace-tech@lists.sourceforge.net Date: 24/11/2014 01:02 PM Subject: Re: [Dspace-tech] Help with Latin Languages First all thanks for your replies but i still havent gotten this fixed. This is a copy from my /dspace/solr/search/conf/schema.xml fieldType name=text class=solr.TextField positionIncrementGap=100 analyzer type=index tokenizer class=solr.WhitespaceTokenizerFactory/ !-- in this example, we will only use synonyms at query time filter class=solr.SynonymFilterFactory synonyms=index_synonyms.txt ignoreCase=true expand=false/ -- !-- Case insensitive stop word removal. add enablePositionIncrements=true in both the index and query analyzers to leave a 'gap' for more accurate phrase queries. -- filter class=solr.ASCIIFoldingFilterFactory/filter filter class=solr.StopFilterFactory ignoreCase=true words=stopwords.txt enablePositionIncrements=true / filter class=solr.WordDelimiterFilterFactory generateWordParts=1 generateNumberParts=1 catenateWords=1 catenateNumbers=1 catenateAll=0 splitOnCaseChange=1/ filter class=solr.ICUFoldingFilterFactory/ filter class=solr.SnowballPorterFilterFactory language=English protected=protwords.txt/ filter class=solr.RemoveDuplicatesTokenFilterFactory/ /analyzer analyzer type=query tokenizer class=solr.WhitespaceTokenizerFactory/ filter class=solr.ASCIIFoldingFilterFactory/filter filter class=solr.SynonymFilterFactory synonyms=synonyms.txt ignoreCase=true expand=true/ filter class=solr.StopFilterFactory ignoreCase=true words=stopwords.txt enablePositionIncrements=true As you can see I've added filter class=solr.ASCIIFoldingFilterFactory/filter twice , once to each analyzer. This is whats happening: If i search for accao if find tons of relevant matches including acção in the title, if on the other hand i search for acção i get searching for all of Dspace for screen except its looking for acçao Its all garbled ... and therefore wont find any relevant hits Ive done a re index -b as requested. Im running Dspace 4.2 Please help, On Thu, Nov 20, 2014 at 12:04 PM, Adan adan.ro...@gmail.com wrote: Hi anonimous You can begin searching fieldType name=text in schema.xml and change filter class=solr.EnglishPorterFilterFactory protected=protwords.txt with filter class=solr.ASCIIFoldingFilterFactory/filter filter class=solr.EnglishPorterFilterFactory protected=protwords.txt then do a dspace update-discovery-index -b for 3.x or a dspace index-discovery -b for a 4.x version Its explained at http://www.arvo.es/dspace/configurando-solr/ (in spanish) regards Adán Román Ruiz ARVO Consultores Can anyone give me a hand with enabling solr to properly search for non english words ? More specifically portuguese words with ã or é for example. Right now a search for são will find nothing but a search for sao will find são. I was told some changes need to be made to schema.xml ? Anyone out there using solr with a non english language that could send me a schema.xml ? Thanks. -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette El software de antivirus Avast ha analizado este correo electrónico en busca de virus. www.avast.com -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo
Re: [Dspace-tech] Help with Latin Languages
Hello siriom, I think you'd better off starting with specifying DSpace version and solr version (right from the dashboard). Next it would be handy to see some screenshots or at least solr ResponseHeader structure. Generally I have solr-spec 4.4.0 / solr-impl 4.4.0 1504776, query working for Cyrillic symbols. Petya. On Wed, Nov 19, 2014 at 9:01 PM, siriom siriom sir...@gmail.com wrote: Can anyone give me a hand with enabling solr to properly search for non english words ? More specifically portuguese words with ã or é for example. Right now a search for são will find nothing but a search for sao will find são. I was told some changes need to be made to schema.xml ? Anyone out there using solr with a non english language that could send me a schema.xml ? Thanks. -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Help with Latin Languages
Hi anonimous You can begin searching fieldType name=text .. in schema.xml and change filter class=solr.EnglishPorterFilterFactory protected=protwords.txt with filter class=solr.ASCIIFoldingFilterFactory/filter filter class=solr.EnglishPorterFilterFactory protected=protwords.txt then do a dspace update-discovery-index -b for 3.x or a dspace index-discovery -b for a 4.x version Its explained at http://www.arvo.es/dspace/configurando-solr/ (in spanish) regards Adán Román Ruiz ARVO Consultores Can anyone give me a hand with enabling solr to properly search for non english words ? More specifically portuguese words with ã or é for example. Right now a search for são will find nothing but a search for sao will find são. I was told some changes need to be made to schema.xml ? Anyone out there using solr with a non english language that could send me a schema.xml ? Thanks. -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette -- --- El software de antivirus Avast ha analizado este correo electrónico en busca de virus. http://www.avast.com -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
Re: [Dspace-tech] Help with Latin Languages
Hi anonimous You can begin searching fieldType name=text .. in schema.xml and change filter class=solr.EnglishPorterFilterFactory protected=protwords.txt with filter class=solr.ASCIIFoldingFilterFactory/filter filter class=solr.EnglishPorterFilterFactory protected=protwords.txt then do a dspace update-discovery-index -b for 3.x or a dspace index-discovery -b for a 4.x version Its explained at http://www.arvo.es/dspace/configurando-solr/ (in spanish) regards Adán Román Ruiz ARVO Consultores Can anyone give me a hand with enabling solr to properly search for non english words ? More specifically portuguese words with ã or é for example. Right now a search for são will find nothing but a search for sao will find são. I was told some changes need to be made to schema.xml ? Anyone out there using solr with a non english language that could send me a schema.xml ? Thanks. -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette --- El software de antivirus Avast ha analizado este correo electrónico en busca de virus. http://www.avast.com -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
[Dspace-tech] Help with Latin Languages
Can anyone give me a hand with enabling solr to properly search for non english words ? More specifically portuguese words with ã or é for example. Right now a search for são will find nothing but a search for sao will find são. I was told some changes need to be made to schema.xml ? Anyone out there using solr with a non english language that could send me a schema.xml ? Thanks. -- Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server from Actuate! Instantly Supercharge Your Business Reports and Dashboards with Interactivity, Sharing, Native Excel Exports, App Integration more Get technology previously reserved for billion-dollar corporations, FREE http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette