Re: [Dspace-tech] Help with Latin Languages

2014-11-25 Thread helix84
On Mon, Nov 24, 2014 at 11:57 AM, siriom siriom sir...@gmail.com wrote:
 Greetings .
 Hope everyones having a good monday :)
 Petya Im running Dspace 4.2 , im not sure on which version of Solr is
 running on it since I cant access it via localhost:8080/solr/search.
 It says something like 403 access denied even though im accessing from
 localhost which is odd . Do i need to turn something on to view the admin
 panel for solr ?

That sounds like something is configured incorrectly in your
/etc/hosts file on the DSpace server. Anyway, try one of these methods
to bypass the restriction:

https://wiki.duraspace.org/display/DSPACE/Solr


Regards,
~~helix84

Compulsory reading: DSpace Mailing List Etiquette
https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

--
Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
from Actuate! Instantly Supercharge Your Business Reports and Dashboards
with Interactivity, Sharing, Native Excel Exports, App Integration  more
Get technology previously reserved for billion-dollar corporations, FREE
http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette


Re: [Dspace-tech] Help with Latin Languages

2014-11-25 Thread siriom siriom
My hosts file is simple.
127.0.0.1 localhost
::1 localhost
thats it.
This is a very odd error.
I create an index.html and put it in /webapps/jspui
I access http://localhost:8080/jspui/index.html just fine
same for xmlui
but the second i put the index.html in /webapps/solr i get 403 forbidden.
The entire dspace works but i cant seem to access solr admin page.
Solr is running , i get stats up on the 14 000 items i have added.
Im running 4.2 and im out of ideas . any suggestions would be apreciated.
Ive opened a prompt , downloaded and installed lynx ... did a lynx
http:localhost:8080/solr from within the very machine dspace is running on
, in a prompt and i still get 403 forbidden.


On Tue, Nov 25, 2014 at 9:57 AM, helix84 heli...@centrum.sk wrote:

 On Mon, Nov 24, 2014 at 11:57 AM, siriom siriom sir...@gmail.com wrote:
  Greetings .
  Hope everyones having a good monday :)
  Petya Im running Dspace 4.2 , im not sure on which version of Solr is
  running on it since I cant access it via localhost:8080/solr/search.
  It says something like 403 access denied even though im accessing
 from
  localhost which is odd . Do i need to turn something on to view the admin
  panel for solr ?

 That sounds like something is configured incorrectly in your
 /etc/hosts file on the DSpace server. Anyway, try one of these methods
 to bypass the restriction:

 https://wiki.duraspace.org/display/DSPACE/Solr


 Regards,
 ~~helix84

 Compulsory reading: DSpace Mailing List Etiquette
 https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

--
Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
from Actuate! Instantly Supercharge Your Business Reports and Dashboards
with Interactivity, Sharing, Native Excel Exports, App Integration  more
Get technology previously reserved for billion-dollar corporations, FREE
http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Re: [Dspace-tech] Help with Latin Languages

2014-11-24 Thread siriom siriom
Greetings .
Hope everyones having a good monday :)
Petya Im running Dspace 4.2 , im not sure on which version of Solr is
running on it since I cant access it via localhost:8080/solr/search.
It says something like 403 access denied even though im accessing from
localhost which is odd . Do i need to turn something on to view the admin
panel for solr ?


On Sat, Nov 22, 2014 at 7:17 PM, Petya Kohts petya.ko...@gmail.com wrote:

 Hello siriom,

 I think you'd better off starting with specifying
 DSpace version and solr version (right from the dashboard).

 Next it would be handy to see some screenshots
 or at least solr ResponseHeader structure.

 Generally I have solr-spec 4.4.0 / solr-impl 4.4.0 1504776,
 query working for Cyrillic symbols.


 Petya.







 On Wed, Nov 19, 2014 at 9:01 PM, siriom siriom sir...@gmail.com wrote:
  Can anyone give me a hand with enabling solr to properly search for non
  english words ? More specifically portuguese words with ã or é for
  example.
  Right now a search for são will find nothing but a search for sao
 will
  find são.
  I was told some changes need to be made to schema.xml ?
  Anyone out there using solr with a non english language that could send
 me a
  schema.xml ?
  Thanks.
 
 
 
 --
  Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
  from Actuate! Instantly Supercharge Your Business Reports and Dashboards
  with Interactivity, Sharing, Native Excel Exports, App Integration  more
  Get technology previously reserved for billion-dollar corporations, FREE
 
 http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk
  ___
  DSpace-tech mailing list
  DSpace-tech@lists.sourceforge.net
  https://lists.sourceforge.net/lists/listinfo/dspace-tech
  List Etiquette:
  https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

--
Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
from Actuate! Instantly Supercharge Your Business Reports and Dashboards
with Interactivity, Sharing, Native Excel Exports, App Integration  more
Get technology previously reserved for billion-dollar corporations, FREE
http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Re: [Dspace-tech] Help with Latin Languages

2014-11-24 Thread siriom siriom
First all thanks for your replies but i still havent gotten this fixed.
This is a copy from my /dspace/solr/search/conf/schema.xml
fieldType name=text class=solr.TextField positionIncrementGap=100
  analyzer type=index
tokenizer class=solr.WhitespaceTokenizerFactory/
!-- in this example, we will only use synonyms at query time
filter class=solr.SynonymFilterFactory
synonyms=index_synonyms.txt ignoreCase=true expand=false/
--
!-- Case insensitive stop word removal.
  add enablePositionIncrements=true in both the index and query
  analyzers to leave a 'gap' for more accurate phrase queries.
--
filter class=solr.ASCIIFoldingFilterFactory/filter
filter class=solr.StopFilterFactory
ignoreCase=true
words=stopwords.txt
enablePositionIncrements=true
/
filter class=solr.WordDelimiterFilterFactory
generateWordParts=1 generateNumberParts=1 catenateWords=1
catenateNumbers=1 catenateAll=0 splitOnCaseChange=1/
filter class=solr.ICUFoldingFilterFactory/
filter class=solr.SnowballPorterFilterFactory language=English
protected=protwords.txt/
filter class=solr.RemoveDuplicatesTokenFilterFactory/
  /analyzer
  analyzer type=query
tokenizer class=solr.WhitespaceTokenizerFactory/
filter class=solr.ASCIIFoldingFilterFactory/filter
filter class=solr.SynonymFilterFactory synonyms=synonyms.txt
ignoreCase=true expand=true/
filter class=solr.StopFilterFactory
ignoreCase=true
words=stopwords.txt
enablePositionIncrements=true



As you can see I've added filter
class=solr.ASCIIFoldingFilterFactory/filter twice , once to each
analyzer.
This is whats happening:
If i search for accao if find tons of relevant matches including acção
in the title, if on the other hand i search for acção

i get searching for all of Dspace for screen except its looking for
acçao
Its all garbled ... and therefore wont find any relevant hits 
Ive done a re index -b as requested.
Im running Dspace 4.2
Please help,


On Thu, Nov 20, 2014 at 12:04 PM, Adan adan.ro...@gmail.com wrote:

  Hi anonimous

 You can begin searching fieldType name=”text” …… in schema.xml and
 change

 filter class=solr.EnglishPorterFilterFactory protected=protwords.txt

 with

 filter class=solr.ASCIIFoldingFilterFactory/filter
 filter class=solr.EnglishPorterFilterFactory protected=protwords.txt

 then do a

 dspace update-discovery-index -b for 3.x or a dspace index-discovery -b for a 
 4.x version

 Its explained at http://www.arvo.es/dspace/configurando-solr/ (in spanish)

 regards
 Adán Román Ruiz
 ARVO Consultores



  Can anyone give me a hand with enabling solr to properly search for non
 english words ? More specifically portuguese words with ã or é for
 example.
 Right now a search for são will find nothing but a search for sao will
 find são.
 I was told some changes need to be made to schema.xml ?
 Anyone out there using solr with a non english language that could send me
 a schema.xml ?
 Thanks.



 --
 Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
 from Actuate! Instantly Supercharge Your Business Reports and Dashboards
 with Interactivity, Sharing, Native Excel Exports, App Integration  more
 Get technology previously reserved for billion-dollar corporations, 
 FREEhttp://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk



 ___
 DSpace-tech mailing 
 listDSpace-tech@lists.sourceforge.nethttps://lists.sourceforge.net/lists/listinfo/dspace-tech
 List Etiquette: 
 https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette




 --
http://www.avast.com/

 El software de antivirus Avast ha analizado este correo electrónico en
 busca de virus.
 www.avast.com



 --
 Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
 from Actuate! Instantly Supercharge Your Business Reports and Dashboards
 with Interactivity, Sharing, Native Excel Exports, App Integration  more
 Get technology previously reserved for billion-dollar corporations, FREE

 http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech
 List Etiquette:
 https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

--
Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
from Actuate! Instantly Supercharge Your Business Reports and Dashboards
with Interactivity, Sharing, Native Excel Exports, App 

Re: [Dspace-tech] Help with Latin Languages

2014-11-24 Thread Aaron Helton
Hi Siriom,

You might also have to do this, which
is what we figured out in my office:

1) Edit /etc/tomcat7/server.xml and
change

Connector port=8080
protocol=HTTP/1.1
connectionTimeout=2
redirectPort=8443/

to

Connector port=8080
protocol=HTTP/1.1
connectionTimeout=2
redirectPort=8443
URIEncoding=UTF-8/

2) Restart tomcat

That took care of character encoding
in the search box.

Aaron Helton (Mr.)
United Nations
Department of Public Information
Outreach Division



From:   
siriom siriom sir...@gmail.com
To:   
Adan adan.ro...@gmail.com,
Hilton Gibson hilton.gib...@gmail.com, 
Cc:   
dspace-tech@lists.sourceforge.net
Date:   
24/11/2014 01:02 PM
Subject:  
 Re: [Dspace-tech]
Help with Latin Languages




First all thanks for your replies but i still havent gotten
this fixed.
This is a copy from my /dspace/solr/search/conf/schema.xml
fieldType name=text class=solr.TextField positionIncrementGap=100
 analyzer type=index
 tokenizer class=solr.WhitespaceTokenizerFactory/
 !-- in this example, we
will only use synonyms at query time
 filter class=solr.SynonymFilterFactory
synonyms=index_synonyms.txt ignoreCase=true expand=false/
 --
 !-- Case insensitive stop
word removal.
 add enablePositionIncrements=true
in both the index and query
 analyzers to leave
a 'gap' for more accurate phrase queries.
 --
 filter class=solr.ASCIIFoldingFilterFactory/filter
 filter class=solr.StopFilterFactory

ignoreCase=true

words=stopwords.txt

enablePositionIncrements=true

/
 filter class=solr.WordDelimiterFilterFactory
generateWordParts=1 generateNumberParts=1 catenateWords=1
catenateNumbers=1 catenateAll=0 splitOnCaseChange=1/
 filter class=solr.ICUFoldingFilterFactory/
 filter class=solr.SnowballPorterFilterFactory
language=English protected=protwords.txt/
 filter class=solr.RemoveDuplicatesTokenFilterFactory/
 /analyzer
 analyzer type=query
 tokenizer class=solr.WhitespaceTokenizerFactory/
 filter class=solr.ASCIIFoldingFilterFactory/filter
 filter class=solr.SynonymFilterFactory
synonyms=synonyms.txt ignoreCase=true expand=true/
 filter class=solr.StopFilterFactory

ignoreCase=true

words=stopwords.txt

enablePositionIncrements=true



As you can see I've added filter class=solr.ASCIIFoldingFilterFactory/filter
twice , once to each analyzer.
This is whats happening:
If i search for accao if find tons of relevant
matches including acção in the title, if on the other hand
i search for acção

i get searching for all of Dspace for screen
except its looking for acçao
Its all garbled ... and therefore wont find any relevant
hits 
Ive done a re index -b as requested.
Im running Dspace 4.2
Please help,


On Thu, Nov 20, 2014 at 12:04 PM, Adan adan.ro...@gmail.com
wrote:
Hi anonimous

You can begin searching fieldType name=text  in schema.xml
and change 
filter class=solr.EnglishPorterFilterFactory
protected=protwords.txt

with

filter class=solr.ASCIIFoldingFilterFactory/filter
filter class=solr.EnglishPorterFilterFactory protected=protwords.txt

then do a 

dspace update-discovery-index -b for 3.x or a dspace index-discovery -b
for a 4.x version

Its explained at http://www.arvo.es/dspace/configurando-solr/
(in spanish)

regards
Adán Román Ruiz
ARVO Consultores



Can anyone give me a hand with enabling solr to properly
search for non english words ? More specifically portuguese words with
ã or é for example. 
Right now a search for são will find nothing
but a search for sao will find são.
I was told some changes need to be made to schema.xml
?
Anyone out there using solr with a non english language
that could send me a schema.xml ?
Thanks.



--
Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
from Actuate! Instantly Supercharge Your Business Reports and Dashboards
with Interactivity, Sharing, Native Excel Exports, App Integration 
more
Get technology previously reserved for billion-dollar corporations, FREE
http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk


___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette








El
software de antivirus Avast ha analizado este correo electrónico en busca
de virus. 
www.avast.com



--
Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
from Actuate! Instantly Supercharge Your Business Reports and Dashboards
with Interactivity, Sharing, Native Excel Exports, App Integration 
more
Get technology previously reserved for billion-dollar corporations, FREE
http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo

Re: [Dspace-tech] Help with Latin Languages

2014-11-22 Thread Petya Kohts
Hello siriom,

I think you'd better off starting with specifying
DSpace version and solr version (right from the dashboard).

Next it would be handy to see some screenshots
or at least solr ResponseHeader structure.

Generally I have solr-spec 4.4.0 / solr-impl 4.4.0 1504776,
query working for Cyrillic symbols.


Petya.







On Wed, Nov 19, 2014 at 9:01 PM, siriom siriom sir...@gmail.com wrote:
 Can anyone give me a hand with enabling solr to properly search for non
 english words ? More specifically portuguese words with ã or é for
 example.
 Right now a search for são will find nothing but a search for sao will
 find são.
 I was told some changes need to be made to schema.xml ?
 Anyone out there using solr with a non english language that could send me a
 schema.xml ?
 Thanks.


 --
 Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
 from Actuate! Instantly Supercharge Your Business Reports and Dashboards
 with Interactivity, Sharing, Native Excel Exports, App Integration  more
 Get technology previously reserved for billion-dollar corporations, FREE
 http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech
 List Etiquette:
 https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

--
Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
from Actuate! Instantly Supercharge Your Business Reports and Dashboards
with Interactivity, Sharing, Native Excel Exports, App Integration  more
Get technology previously reserved for billion-dollar corporations, FREE
http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Re: [Dspace-tech] Help with Latin Languages

2014-11-20 Thread Adan Roman

Hi anonimous

You can begin searching fieldType name=text .. in schema.xml and 
change


filter class=solr.EnglishPorterFilterFactory protected=protwords.txt

with

filter class=solr.ASCIIFoldingFilterFactory/filter
filter class=solr.EnglishPorterFilterFactory protected=protwords.txt

then do a

dspace update-discovery-index -b for 3.x or a dspace index-discovery -b for a 
4.x version

Its explained at http://www.arvo.es/dspace/configurando-solr/ (in spanish)

regards
Adán Román Ruiz
ARVO Consultores


Can anyone give me a hand with enabling solr to properly search for 
non english words ? More specifically portuguese words with ã or é 
for example.
Right now a search for são will find nothing but a search for sao 
will find são.

I was told some changes need to be made to schema.xml ?
Anyone out there using solr with a non english language that could 
send me a schema.xml ?

Thanks.



--
Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
from Actuate! Instantly Supercharge Your Business Reports and Dashboards
with Interactivity, Sharing, Native Excel Exports, App Integration  more
Get technology previously reserved for billion-dollar corporations, FREE
http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk


___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette



--



---
El software de antivirus Avast ha analizado este correo electrónico en busca de 
virus.
http://www.avast.com
--
Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
from Actuate! Instantly Supercharge Your Business Reports and Dashboards
with Interactivity, Sharing, Native Excel Exports, App Integration  more
Get technology previously reserved for billion-dollar corporations, FREE
http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Re: [Dspace-tech] Help with Latin Languages

2014-11-20 Thread Adan

Hi anonimous

You can begin searching fieldType name=text .. in schema.xml and 
change


filter class=solr.EnglishPorterFilterFactory protected=protwords.txt

with

filter class=solr.ASCIIFoldingFilterFactory/filter
filter class=solr.EnglishPorterFilterFactory protected=protwords.txt

then do a

dspace update-discovery-index -b for 3.x or a dspace index-discovery -b for a 
4.x version

Its explained at http://www.arvo.es/dspace/configurando-solr/ (in spanish)

regards
Adán Román Ruiz
ARVO Consultores


Can anyone give me a hand with enabling solr to properly search for 
non english words ? More specifically portuguese words with ã or é 
for example.
Right now a search for são will find nothing but a search for sao 
will find são.

I was told some changes need to be made to schema.xml ?
Anyone out there using solr with a non english language that could 
send me a schema.xml ?

Thanks.



--
Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
from Actuate! Instantly Supercharge Your Business Reports and Dashboards
with Interactivity, Sharing, Native Excel Exports, App Integration  more
Get technology previously reserved for billion-dollar corporations, FREE
http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk


___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette




---
El software de antivirus Avast ha analizado este correo electrónico en busca de 
virus.
http://www.avast.com
--
Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
from Actuate! Instantly Supercharge Your Business Reports and Dashboards
with Interactivity, Sharing, Native Excel Exports, App Integration  more
Get technology previously reserved for billion-dollar corporations, FREE
http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

[Dspace-tech] Help with Latin Languages

2014-11-19 Thread siriom siriom
Can anyone give me a hand with enabling solr to properly search for non
english words ? More specifically portuguese words with ã or é for
example.
Right now a search for são will find nothing but a search for sao will
find são.
I was told some changes need to be made to schema.xml ?
Anyone out there using solr with a non english language that could send me
a schema.xml ?
Thanks.
--
Download BIRT iHub F-Type - The Free Enterprise-Grade BIRT Server
from Actuate! Instantly Supercharge Your Business Reports and Dashboards
with Interactivity, Sharing, Native Excel Exports, App Integration  more
Get technology previously reserved for billion-dollar corporations, FREE
http://pubads.g.doubleclick.net/gampad/clk?id=157005751iu=/4140/ostg.clktrk___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette