Re: [Dspace-tech] Searching : Diacritics Indexing

2012-08-10 Thread DSpace @ Lyncode
Hi,

with ASCIIFoldingFilter it not expected for that query to fail.
So, probably that is some configuration problem or some
wrong deployment procedure.

On 9 August 2012 16:12, Claudia Jürgen claudia.juer...@ub.tu-dortmund.dewrote:

 Hello Emilio and all,

 just taken a look at the ASCIIFoldingFilter, which should cover
 most (those characters with reasonable ASCII alternatives are
 converted)of the latin characters see

 http://lucene.apache.org/core/old_versioned_docs/versions/2_9_0/api/all/org/apache/lucene/analysis/ASCIIFoldingFilter.html

 Thought Latin Extended A would be covered, but the first test with
 the author name Petuškova, Jekaterina failed.
 Is there any definite list, which is supported in which way?

 Cheers

 Claudia





 Am 09.08.2012 09:14, schrieb emilio lorenzo:
  Hi,
 
  The class ISOLatin1AccentFilter has been deprecated by Lucene (although
  still can be found...) and substitued by  ASCIIFoldingFilter class
  For english + latin languages installations , we suggest the following
  *org.dspace.search.DSAnalyzer* configuration (keep the order, is
  relevant for the searcher):
 
  import org.apache.lucene.analysis.ASCIIFoldingFilter;
  ..
  ..
  result = new StandardFilter(result);
  result = new LowerCaseFilter(result);
  result = new StopFilter(result, stopSet);
  result = new ASCIIFoldingFilter(result);
  result = new PorterStemFilter(result);
 
 
  Anyway, *org.dspace.search.DSAnalyzer* corresponds to Lucene
  configuration.SOLR conf is quite different.
 
  Best Luck.
  Emilio
 
 
 
  El 08/08/2012 20:14, Hatem Jlassi escribió:
 
  Hi all,
 
  We are running a bilingual (French/English) instance of last version
  of Dspace (1.8.2). We have some problems with the search with
  diacritics. The Dspace's searcher doesn't find words with accented
  characters when the search doesn't include these accents.
 
  We modified
 
 (\dspace-1.8.2-src-release\dspace-api\src\main\java\org\dspace\search\DSAnalyzer.java)
  and we added the followings two lines:
 
  ISOLatin1AccentFilter;
 
  result = new ISOLatin1AccentFilter(result);
 
  Rebuild, Re-index Dspace
 
  But the problem was not resolved.
 
  If anyone has solved this problem - Please Help!!! Thank You
 
  Regards,
 
 
 
 --
 
  Live Security Virtual Conference
  Exclusive live event will cover all the ways today's security and
  threat landscape has changed and how IT managers can respond.
 Discussions
  will include endpoint security, mobile security and the latest in
 malware
  threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
 
 
  ___
  DSpace-tech mailing list
  DSpace-tech@lists.sourceforge.net
  https://lists.sourceforge.net/lists/listinfo/dspace-tech
 
 
 
 
 --
  Live Security Virtual Conference
  Exclusive live event will cover all the ways today's security and
  threat landscape has changed and how IT managers can respond. Discussions
  will include endpoint security, mobile security and the latest in malware
  threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
 
 
 
  ___
  DSpace-tech mailing list
  DSpace-tech@lists.sourceforge.net
  https://lists.sourceforge.net/lists/listinfo/dspace-tech
 

 --
 Claudia Juergen
 Universitaetsbibliothek Dortmund
 Eldorado
 0231/755-4043
 https://eldorado.tu-dortmund.de/


 --
 Live Security Virtual Conference
 Exclusive live event will cover all the ways today's security and
 threat landscape has changed and how IT managers can respond. Discussions
 will include endpoint security, mobile security and the latest in malware
 threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech




-- 

Thanks, DSpace @ Lyncode
DSpace Department
*Lyncode*: Official website http://www.lyncode.com/
--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Searching : Diacritics Indexing

2012-08-10 Thread Claudia Jürgen
Hello,

you're right, it works fine, guess I need some new glasses
The author was
Petuškova, Jekaterina
and I searched with
Petruskova
Made a couple more test with characters from 
http://en.wikipedia.org/wiki/Latin_characters_in_Unicode and all is well.

I'm still interested in an documentation about the mapping.

Have a nice day

Claudia


Am 10.08.2012 12:09, schrieb DSpace @ Lyncode:
 Hi,

 with ASCIIFoldingFilter it not expected for that query to fail.
 So, probably that is some configuration problem or some
 wrong deployment procedure.

 On 9 August 2012 16:12, Claudia Jürgen 
 claudia.juer...@ub.tu-dortmund.dewrote:

 Hello Emilio and all,

 just taken a look at the ASCIIFoldingFilter, which should cover
 most (those characters with reasonable ASCII alternatives are
 converted)of the latin characters see

 http://lucene.apache.org/core/old_versioned_docs/versions/2_9_0/api/all/org/apache/lucene/analysis/ASCIIFoldingFilter.html

 Thought Latin Extended A would be covered, but the first test with
 the author name Petuškova, Jekaterina failed.
 Is there any definite list, which is supported in which way?

 Cheers

 Claudia





 Am 09.08.2012 09:14, schrieb emilio lorenzo:
 Hi,

 The class ISOLatin1AccentFilter has been deprecated by Lucene (although
 still can be found...) and substitued by  ASCIIFoldingFilter class
 For english + latin languages installations , we suggest the following
 *org.dspace.search.DSAnalyzer* configuration (keep the order, is
 relevant for the searcher):

 import org.apache.lucene.analysis.ASCIIFoldingFilter;
 ..
 ..
 result = new StandardFilter(result);
 result = new LowerCaseFilter(result);
 result = new StopFilter(result, stopSet);
 result = new ASCIIFoldingFilter(result);
 result = new PorterStemFilter(result);


 Anyway, *org.dspace.search.DSAnalyzer* corresponds to Lucene
 configuration.SOLR conf is quite different.

 Best Luck.
 Emilio



 El 08/08/2012 20:14, Hatem Jlassi escribió:

 Hi all,

 We are running a bilingual (French/English) instance of last version
 of Dspace (1.8.2). We have some problems with the search with
 diacritics. The Dspace's searcher doesn't find words with accented
 characters when the search doesn't include these accents.

 We modified

 (\dspace-1.8.2-src-release\dspace-api\src\main\java\org\dspace\search\DSAnalyzer.java)
 and we added the followings two lines:

 ISOLatin1AccentFilter;

 result = new ISOLatin1AccentFilter(result);

 Rebuild, Re-index Dspace

 But the problem was not resolved.

 If anyone has solved this problem - Please Help!!! Thank You

 Regards,



 --

 Live Security Virtual Conference
 Exclusive live event will cover all the ways today's security and
 threat landscape has changed and how IT managers can respond.
 Discussions
 will include endpoint security, mobile security and the latest in
 malware
 threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/


 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech




 --
 Live Security Virtual Conference
 Exclusive live event will cover all the ways today's security and
 threat landscape has changed and how IT managers can respond. Discussions
 will include endpoint security, mobile security and the latest in malware
 threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/



 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech


 --
 Claudia Juergen
 Universitaetsbibliothek Dortmund
 Eldorado
 0231/755-4043
 https://eldorado.tu-dortmund.de/


 --
 Live Security Virtual Conference
 Exclusive live event will cover all the ways today's security and
 threat landscape has changed and how IT managers can respond. Discussions
 will include endpoint security, mobile security and the latest in malware
 threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech





-- 
Claudia Juergen
Universitaetsbibliothek Dortmund
Eldorado
0231/755-4043
https://eldorado.tu-dortmund.de/

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
DSpace-tech mailing list

Re: [Dspace-tech] Searching : Diacritics Indexing

2012-08-10 Thread DSpace @ Lyncode
Full list of mappings

http://grepcode.com/file/repo1.maven.org/maven2/org.apache.lucene/lucene-core/3.0.2/org/apache/lucene/analysis/ASCIIFoldingFilter.java#117

On 10 August 2012 12:37, Claudia Jürgen
claudia.juer...@ub.tu-dortmund.dewrote:

 Hello,

 you're right, it works fine, guess I need some new glasses
 The author was
 Petuškova, Jekaterina
 and I searched with
 Petruskova
 Made a couple more test with characters from http://en.wikipedia.org/wiki/
 **Latin_characters_in_Unicodehttp://en.wikipedia.org/wiki/Latin_characters_in_Unicodeand
  all is well.

 I'm still interested in an documentation about the mapping.

 Have a nice day

 Claudia


 Am 10.08.2012 12:09, schrieb DSpace @ Lyncode:

  Hi,

 with ASCIIFoldingFilter it not expected for that query to fail.
 So, probably that is some configuration problem or some
 wrong deployment procedure.

 On 9 August 2012 16:12, Claudia Jürgen claudia.juer...@ub.tu-**
 dortmund.de claudia.juer...@ub.tu-dortmund.dewrote:

  Hello Emilio and all,

 just taken a look at the ASCIIFoldingFilter, which should cover
 most (those characters with reasonable ASCII alternatives are
 converted)of the latin characters see

 http://lucene.apache.org/core/**old_versioned_docs/versions/2_**
 9_0/api/all/org/apache/lucene/**analysis/ASCIIFoldingFilter.**htmlhttp://lucene.apache.org/core/old_versioned_docs/versions/2_9_0/api/all/org/apache/lucene/analysis/ASCIIFoldingFilter.html

 Thought Latin Extended A would be covered, but the first test with
 the author name Petuškova, Jekaterina failed.
 Is there any definite list, which is supported in which way?

 Cheers

 Claudia





 Am 09.08.2012 09:14, schrieb emilio lorenzo:

 Hi,

 The class ISOLatin1AccentFilter has been deprecated by Lucene (although
 still can be found...) and substitued by  ASCIIFoldingFilter class
 For english + latin languages installations , we suggest the following
 *org.dspace.search.DSAnalyzer* configuration (keep the order, is
 relevant for the searcher):

 import org.apache.lucene.analysis.**ASCIIFoldingFilter;
 ..
 ..
 result = new StandardFilter(result);
 result = new LowerCaseFilter(result);
 result = new StopFilter(result, stopSet);
 result = new ASCIIFoldingFilter(result);
 result = new PorterStemFilter(result);


 Anyway, *org.dspace.search.DSAnalyzer* corresponds to Lucene
 configuration.SOLR conf is quite different.

 Best Luck.
 Emilio



 El 08/08/2012 20:14, Hatem Jlassi escribió:


 Hi all,

 We are running a bilingual (French/English) instance of last version
 of Dspace (1.8.2). We have some problems with the search with
 diacritics. The Dspace's searcher doesn't find words with accented
 characters when the search doesn't include these accents.

 We modified

  (\dspace-1.8.2-src-release\**dspace-api\src\main\java\org\**
 dspace\search\DSAnalyzer.java)

 and we added the followings two lines:

 ISOLatin1AccentFilter;

 result = new ISOLatin1AccentFilter(result);

 Rebuild, Re-index Dspace

 But the problem was not resolved.

 If anyone has solved this problem - Please Help!!! Thank You

 Regards,



  --**--**
 --


 Live Security Virtual Conference
 Exclusive live event will cover all the ways today's security and
 threat landscape has changed and how IT managers can respond.

 Discussions

 will include endpoint security, mobile security and the latest in

 malware

 threats. 
 http://www.accelacomm.com/jaw/**sfrnl04242012/114/50122263/http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/


 __**_
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.**netDSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/**lists/listinfo/dspace-techhttps://lists.sourceforge.net/lists/listinfo/dspace-tech





  --**--**
 --

 Live Security Virtual Conference
 Exclusive live event will cover all the ways today's security and
 threat landscape has changed and how IT managers can respond.
 Discussions
 will include endpoint security, mobile security and the latest in
 malware
 threats. 
 http://www.accelacomm.com/jaw/**sfrnl04242012/114/50122263/http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/



 __**_
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.**net DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/**lists/listinfo/dspace-techhttps://lists.sourceforge.net/lists/listinfo/dspace-tech


 --
 Claudia Juergen
 Universitaetsbibliothek Dortmund
 Eldorado
 0231/755-4043
 https://eldorado.tu-dortmund.**de/ https://eldorado.tu-dortmund.de/


 --**--**
 --
 Live Security Virtual Conference
 Exclusive live event will cover all the ways today's security and
 threat landscape has changed and how IT managers can respond. Discussions
 will include 

[Dspace-tech] Problem with New Config file structure

2012-08-10 Thread Keith Jones

Hi All,

I'm in the process of trying to configure my CAS module to run with DSpace 
1.8.2. I'm currently running into a problem where the CAS class is unable 
to find the cas.server.url, and is getting a 
java.lang.NullPointerException when it is trying to read the configuration 
parameter.

This is the line of code:

final String authServer = 
ConfigurationManager.getProperty(cas.server.url);

With the new structure of the dspace.cfg, which separates the 
authentication  parameters, I'm having a difficult time trying to figure 
out how the build the configuration paramaters for CAS.

Thanks
Keith

--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Problem with New Config file structure

2012-08-10 Thread helix84
Hi Keith, based on that source line I think it should simply go to
dspace.cfg as cas.server.url, not to modules. Did you try that?

Regards,
~~helix84
--
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech