Re: [Dspace-tech] Searching : Diacritics Indexing
Hi, with ASCIIFoldingFilter it not expected for that query to fail. So, probably that is some configuration problem or some wrong deployment procedure. On 9 August 2012 16:12, Claudia Jürgen claudia.juer...@ub.tu-dortmund.dewrote: Hello Emilio and all, just taken a look at the ASCIIFoldingFilter, which should cover most (those characters with reasonable ASCII alternatives are converted)of the latin characters see http://lucene.apache.org/core/old_versioned_docs/versions/2_9_0/api/all/org/apache/lucene/analysis/ASCIIFoldingFilter.html Thought Latin Extended A would be covered, but the first test with the author name Petuškova, Jekaterina failed. Is there any definite list, which is supported in which way? Cheers Claudia Am 09.08.2012 09:14, schrieb emilio lorenzo: Hi, The class ISOLatin1AccentFilter has been deprecated by Lucene (although still can be found...) and substitued by ASCIIFoldingFilter class For english + latin languages installations , we suggest the following *org.dspace.search.DSAnalyzer* configuration (keep the order, is relevant for the searcher): import org.apache.lucene.analysis.ASCIIFoldingFilter; .. .. result = new StandardFilter(result); result = new LowerCaseFilter(result); result = new StopFilter(result, stopSet); result = new ASCIIFoldingFilter(result); result = new PorterStemFilter(result); Anyway, *org.dspace.search.DSAnalyzer* corresponds to Lucene configuration.SOLR conf is quite different. Best Luck. Emilio El 08/08/2012 20:14, Hatem Jlassi escribió: Hi all, We are running a bilingual (French/English) instance of last version of Dspace (1.8.2). We have some problems with the search with diacritics. The Dspace's searcher doesn't find words with accented characters when the search doesn't include these accents. We modified (\dspace-1.8.2-src-release\dspace-api\src\main\java\org\dspace\search\DSAnalyzer.java) and we added the followings two lines: ISOLatin1AccentFilter; result = new ISOLatin1AccentFilter(result); Rebuild, Re-index Dspace But the problem was not resolved. If anyone has solved this problem - Please Help!!! Thank You Regards, -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech -- Claudia Juergen Universitaetsbibliothek Dortmund Eldorado 0231/755-4043 https://eldorado.tu-dortmund.de/ -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech -- Thanks, DSpace @ Lyncode DSpace Department *Lyncode*: Official website http://www.lyncode.com/ -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Searching : Diacritics Indexing
Hello, you're right, it works fine, guess I need some new glasses The author was Petuškova, Jekaterina and I searched with Petruskova Made a couple more test with characters from http://en.wikipedia.org/wiki/Latin_characters_in_Unicode and all is well. I'm still interested in an documentation about the mapping. Have a nice day Claudia Am 10.08.2012 12:09, schrieb DSpace @ Lyncode: Hi, with ASCIIFoldingFilter it not expected for that query to fail. So, probably that is some configuration problem or some wrong deployment procedure. On 9 August 2012 16:12, Claudia Jürgen claudia.juer...@ub.tu-dortmund.dewrote: Hello Emilio and all, just taken a look at the ASCIIFoldingFilter, which should cover most (those characters with reasonable ASCII alternatives are converted)of the latin characters see http://lucene.apache.org/core/old_versioned_docs/versions/2_9_0/api/all/org/apache/lucene/analysis/ASCIIFoldingFilter.html Thought Latin Extended A would be covered, but the first test with the author name Petuškova, Jekaterina failed. Is there any definite list, which is supported in which way? Cheers Claudia Am 09.08.2012 09:14, schrieb emilio lorenzo: Hi, The class ISOLatin1AccentFilter has been deprecated by Lucene (although still can be found...) and substitued by ASCIIFoldingFilter class For english + latin languages installations , we suggest the following *org.dspace.search.DSAnalyzer* configuration (keep the order, is relevant for the searcher): import org.apache.lucene.analysis.ASCIIFoldingFilter; .. .. result = new StandardFilter(result); result = new LowerCaseFilter(result); result = new StopFilter(result, stopSet); result = new ASCIIFoldingFilter(result); result = new PorterStemFilter(result); Anyway, *org.dspace.search.DSAnalyzer* corresponds to Lucene configuration.SOLR conf is quite different. Best Luck. Emilio El 08/08/2012 20:14, Hatem Jlassi escribió: Hi all, We are running a bilingual (French/English) instance of last version of Dspace (1.8.2). We have some problems with the search with diacritics. The Dspace's searcher doesn't find words with accented characters when the search doesn't include these accents. We modified (\dspace-1.8.2-src-release\dspace-api\src\main\java\org\dspace\search\DSAnalyzer.java) and we added the followings two lines: ISOLatin1AccentFilter; result = new ISOLatin1AccentFilter(result); Rebuild, Re-index Dspace But the problem was not resolved. If anyone has solved this problem - Please Help!!! Thank You Regards, -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech -- Claudia Juergen Universitaetsbibliothek Dortmund Eldorado 0231/755-4043 https://eldorado.tu-dortmund.de/ -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech -- Claudia Juergen Universitaetsbibliothek Dortmund Eldorado 0231/755-4043 https://eldorado.tu-dortmund.de/ -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ DSpace-tech mailing list
Re: [Dspace-tech] Searching : Diacritics Indexing
Full list of mappings http://grepcode.com/file/repo1.maven.org/maven2/org.apache.lucene/lucene-core/3.0.2/org/apache/lucene/analysis/ASCIIFoldingFilter.java#117 On 10 August 2012 12:37, Claudia Jürgen claudia.juer...@ub.tu-dortmund.dewrote: Hello, you're right, it works fine, guess I need some new glasses The author was Petuškova, Jekaterina and I searched with Petruskova Made a couple more test with characters from http://en.wikipedia.org/wiki/ **Latin_characters_in_Unicodehttp://en.wikipedia.org/wiki/Latin_characters_in_Unicodeand all is well. I'm still interested in an documentation about the mapping. Have a nice day Claudia Am 10.08.2012 12:09, schrieb DSpace @ Lyncode: Hi, with ASCIIFoldingFilter it not expected for that query to fail. So, probably that is some configuration problem or some wrong deployment procedure. On 9 August 2012 16:12, Claudia Jürgen claudia.juer...@ub.tu-** dortmund.de claudia.juer...@ub.tu-dortmund.dewrote: Hello Emilio and all, just taken a look at the ASCIIFoldingFilter, which should cover most (those characters with reasonable ASCII alternatives are converted)of the latin characters see http://lucene.apache.org/core/**old_versioned_docs/versions/2_** 9_0/api/all/org/apache/lucene/**analysis/ASCIIFoldingFilter.**htmlhttp://lucene.apache.org/core/old_versioned_docs/versions/2_9_0/api/all/org/apache/lucene/analysis/ASCIIFoldingFilter.html Thought Latin Extended A would be covered, but the first test with the author name Petuškova, Jekaterina failed. Is there any definite list, which is supported in which way? Cheers Claudia Am 09.08.2012 09:14, schrieb emilio lorenzo: Hi, The class ISOLatin1AccentFilter has been deprecated by Lucene (although still can be found...) and substitued by ASCIIFoldingFilter class For english + latin languages installations , we suggest the following *org.dspace.search.DSAnalyzer* configuration (keep the order, is relevant for the searcher): import org.apache.lucene.analysis.**ASCIIFoldingFilter; .. .. result = new StandardFilter(result); result = new LowerCaseFilter(result); result = new StopFilter(result, stopSet); result = new ASCIIFoldingFilter(result); result = new PorterStemFilter(result); Anyway, *org.dspace.search.DSAnalyzer* corresponds to Lucene configuration.SOLR conf is quite different. Best Luck. Emilio El 08/08/2012 20:14, Hatem Jlassi escribió: Hi all, We are running a bilingual (French/English) instance of last version of Dspace (1.8.2). We have some problems with the search with diacritics. The Dspace's searcher doesn't find words with accented characters when the search doesn't include these accents. We modified (\dspace-1.8.2-src-release\**dspace-api\src\main\java\org\** dspace\search\DSAnalyzer.java) and we added the followings two lines: ISOLatin1AccentFilter; result = new ISOLatin1AccentFilter(result); Rebuild, Re-index Dspace But the problem was not resolved. If anyone has solved this problem - Please Help!!! Thank You Regards, --**--** -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/**sfrnl04242012/114/50122263/http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ __**_ DSpace-tech mailing list DSpace-tech@lists.sourceforge.**netDSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/**lists/listinfo/dspace-techhttps://lists.sourceforge.net/lists/listinfo/dspace-tech --**--** -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/**sfrnl04242012/114/50122263/http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ __**_ DSpace-tech mailing list DSpace-tech@lists.sourceforge.**net DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/**lists/listinfo/dspace-techhttps://lists.sourceforge.net/lists/listinfo/dspace-tech -- Claudia Juergen Universitaetsbibliothek Dortmund Eldorado 0231/755-4043 https://eldorado.tu-dortmund.**de/ https://eldorado.tu-dortmund.de/ --**--** -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include
[Dspace-tech] Problem with New Config file structure
Hi All, I'm in the process of trying to configure my CAS module to run with DSpace 1.8.2. I'm currently running into a problem where the CAS class is unable to find the cas.server.url, and is getting a java.lang.NullPointerException when it is trying to read the configuration parameter. This is the line of code: final String authServer = ConfigurationManager.getProperty(cas.server.url); With the new structure of the dspace.cfg, which separates the authentication parameters, I'm having a difficult time trying to figure out how the build the configuration paramaters for CAS. Thanks Keith -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/ ___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech
Re: [Dspace-tech] Problem with New Config file structure
Hi Keith, based on that source line I think it should simply go to dspace.cfg as cas.server.url, not to modules. Did you try that? Regards, ~~helix84 -- Live Security Virtual Conference Exclusive live event will cover all the ways today's security and threat landscape has changed and how IT managers can respond. Discussions will include endpoint security, mobile security and the latest in malware threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___ DSpace-tech mailing list DSpace-tech@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dspace-tech