Re: [Dspace-tech] Restrict until date

2009-01-14 Thread Claudia Jürgen
Hi Lyn,

there is an embargo patch see 
http://wiki.dspace.org/index.php/User:Emetsger:Embargo

Hope that helps

Claudia Jürgen


Lyn Amery schrieb:
 Hi all,
 
  
 
 I know it's possible to restrict access to an item in DSpace, but is it
 possible to do
 
 so only until a specified date?  Thereafter it would go public?  Or is
 that something 
 
 I'd need to update manually when the date had arrived or perhaps modify
 via some 
 
 command?
 
  
 
 Lyn
 
  
 
 Lyn Amery
 
 Web Analyst/Programmer
 
 State Records Office
 
 (08) 9427 3493
 
 lyn.am...@sro.wa.gov.au mailto:lyn.am...@sro.wa.gov.au 
 
  
 
 
 
 
 
 
 --
 This SF.net email is sponsored by:
 SourcForge Community
 SourceForge wants to tell your story.
 http://p.sf.net/sfu/sf-spreadtheword
 
 
 
 
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech

--
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Restrict until date

2009-01-14 Thread Obiajulu Odu
Hi,
Have you looked at the embargo on bitstream at
http://wiki.dspace.org/index.php/Embargo_on_Bitstream_v2_(JSP)
http://wiki.dspace.org/index.php/Embargo_on_Bitstream_v2_(JSP) ?
 
Obi



Fra: Lyn Amery [mailto:lyn.am...@sro.wa.gov.au] 
Sendt: 14. januar 2009 08:39
Til: DSpace-tech@lists.sourceforge.net
Emne: [Dspace-tech] Restrict until date



Hi all,

 

I know it's possible to restrict access to an item in DSpace, but is it
possible to do

so only until a specified date?  Thereafter it would go public?  Or is
that something 

I'd need to update manually when the date had arrived or perhaps modify
via some 

command?

 

Lyn

 

Lyn Amery

Web Analyst/Programmer

State Records Office

(08) 9427 3493

lyn.am...@sro.wa.gov.au mailto:lyn.am...@sro.wa.gov.au 

 

--
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


[Dspace-tech] NetBeans Build problems

2009-01-14 Thread Daniel MD
Hello,

I was following the wiki instruction on how to build DSpace 1.5.1 but
came across this issue:

[#compile]

Reactor Summary:

DSpace Addon Modules .. SUCCESS [0.957s]
DSpace XML-UI (Manakin) :: Web Application  FAILED [1.117s]
DSpace LNI :: Web Application . NOT BUILT
DSpace OAI :: Web Application . NOT BUILT
DSpace JSP-UI :: Web Application .. NOT BUILT
DSpace SWORD :: Web Application ... NOT BUILT
DSpace Assembly and Configuration . NOT BUILT

[ERROR]Transitive dependency resolution for scope: compile has failed
for your project.
[ERROR]Error message: No versions are present in the repository for
the artifact with a range [1.5.0.0,1.6.0.0)
[ERROR]  org.dspace:dspace-api-lang:jar:null
[ERROR]from the specified remote repositories:
[ERROR]  central (http://repo1.maven.org/maven2),
[ERROR]  maven.dspace.org/snapshot (http://maven.dspace.org/snapshot)
[ERROR]Path to dependency:
[ERROR]1) org.dspace.modules:xmlui:war:1.5.1
[ERROR]2) org.dspace:dspace-xmlui-api:jar:1.5.1
[ERROR]Group-Id: org.dspace.modules
[ERROR]Artifact-Id: xmlui
[ERROR]Version: 1.5.1
[ERROR]From file: C:\DSpace1.5.1\dspace-1_5_1\dspace\modules\xmlui\pom.xml

any help would be much appreciated.

--
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] NetBeans Build problems

2009-01-14 Thread Tim Donohue

Daniel,

A quick question:

Are you building from the DSpace Assembly and Configuration project 
that Netbeans creates (this project corresponds to the 'dspace/' 
subfolder)?  You *must* build from that project the first time, as it 
will initialize and build all the other projects.

When I right click on DSpace Assembly and Configuration and select 
Build, I get a much larger list of DSpace projects than the 7 that 
your reactor summary listed.  I suspect this means you ran Build on 
a different project, or modified one of the pom.xml Maven config files?

Here's my Reactor Summary report from Maven after building DSpace 
1.5.x in Netbeans 6.5 (notice I have 25 projects listed instead of just 
the 7 that yours shows)


Reactor Summary:

DSpace XML-UI (Manakin) ... SUCCESS [2.141s]
DSpace XML-UI (Manakin) :: Wing-Framework . SUCCESS [4.031s]
DSpace Kernel :: API and Implementation ... SUCCESS [12.328s]
DSpace XML-UI (Manakin) :: API and Core Aspects ... SUCCESS [6.532s]
DSpace XML-UI (Manakin) :: Web Application Resources .. SUCCESS [9.062s]
DSpace Addon Modules .. SUCCESS [0.016s]
DSpace XML-UI (Manakin) :: Web Application  SUCCESS [19.219s]
DSpace LNI  SUCCESS [0.015s]
DSpace LNI :: Core Implementation . SUCCESS [9.391s]
DSpace LNI :: Web Application Resources ... SUCCESS [3.125s]
DSpace LNI :: Web Application . SUCCESS [25.078s]
DSpace OAI  SUCCESS [0.109s]
DSpace OAI :: API and Implementation .. SUCCESS [1.985s]
DSpace OAI :: Web Application Resources ... SUCCESS [1.156s]
DSpace OAI :: Web Application . SUCCESS [8.562s]
DSpace JSP-UI . SUCCESS [0.016s]
DSpace JSP-UI :: API and Implementation ... SUCCESS [21.469s]
DSpace JSP-UI :: Web Application Resources  SUCCESS [11.750s]
DSpace JSP-UI :: Web Application .. SUCCESS [26.375s]
DSpace SWORD .. SUCCESS [0.156s]
DSpace SWORD :: API and Implementation  SUCCESS [6.406s]
DSpace SWORD :: Web Application Resources . SUCCESS [2.360s]
DSpace SWORD :: Web Application ... SUCCESS [9.187s]
DSpace LNI :: CLI Client Application .. SUCCESS [1:11.688s]
DSpace Assembly and Configuration . SUCCESS [1:11.625s]


- Tim


Daniel MD wrote:
 Hello,
 
 I was following the wiki instruction on how to build DSpace 1.5.1 but
 came across this issue:
 
 [#compile]
 
 Reactor Summary:
 
 DSpace Addon Modules .. SUCCESS [0.957s]
 DSpace XML-UI (Manakin) :: Web Application  FAILED [1.117s]
 DSpace LNI :: Web Application . NOT BUILT
 DSpace OAI :: Web Application . NOT BUILT
 DSpace JSP-UI :: Web Application .. NOT BUILT
 DSpace SWORD :: Web Application ... NOT BUILT
 DSpace Assembly and Configuration . NOT BUILT
 
 [ERROR]Transitive dependency resolution for scope: compile has failed
 for your project.
 [ERROR]Error message: No versions are present in the repository for
 the artifact with a range [1.5.0.0,1.6.0.0)
 [ERROR]  org.dspace:dspace-api-lang:jar:null
 [ERROR]from the specified remote repositories:
 [ERROR]  central (http://repo1.maven.org/maven2),
 [ERROR]  maven.dspace.org/snapshot (http://maven.dspace.org/snapshot)
 [ERROR]Path to dependency:
 [ERROR]1) org.dspace.modules:xmlui:war:1.5.1
 [ERROR]2) org.dspace:dspace-xmlui-api:jar:1.5.1
 [ERROR]Group-Id: org.dspace.modules
 [ERROR]Artifact-Id: xmlui
 [ERROR]Version: 1.5.1
 [ERROR]From file: C:\DSpace1.5.1\dspace-1_5_1\dspace\modules\xmlui\pom.xml
 
 any help would be much appreciated.
 
 --
 This SF.net email is sponsored by:
 SourcForge Community
 SourceForge wants to tell your story.
 http://p.sf.net/sfu/sf-spreadtheword
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech
 

-- 
Tim Donohue
Research Programmer, IDEALS
http://www.ideals.uiuc.edu/
University of Illinois
tdono...@illinois.edu | (217) 333-4648


[Dspace-tech] Google bots and web crawlers

2009-01-14 Thread Jeffrey Trimble
Is there something simple I can place in the jsp that will prohibit  
the crawlers from

using my server resources?

TIA,

Jeff

Jeffrey Trimble
Systems Librarian
Maag Library
Youngstown State University
330-941-2483 (Office)
jtrim...@cc.ysu.edu
http://www.maag.ysu.edu
http://digital.maag.ysu.edu



--
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


[Dspace-tech] Not found FOM_WebContinuation$UserObject even though it is there

2009-01-14 Thread Andrew Marlow
Hello DSpacers,

I am quite desperate to resolve the problem I am having on debian running
any kind of admin function within DSpace. I get the exception shown below.
One way is to try to create a community but it seems like it happens when I
try to create anything. The so-called missing class is in the xmlui
installation in cocoon-2.1.9.jar.

java.lang.NoClassDefFoundError:
org/apache/cocoon/components/flow/javascript/fom/FOM_WebContinuation$UserObject

org.apache.cocoon.components.flow.javascript.fom.FOM_WebContinuation.setPageLocal(FOM_WebContinuation.java:202)

org.apache.cocoon.components.flow.javascript.fom.FOM_Cocoon.jsFunction_sendPage(FOM_Cocoon.java:267)
inv3.invoke()


Here is the output from jar (via grep) that shows the class is there.

   637 Tue Apr 11 11:56:44 BST 2006
org/apache/cocoon/components/flow/javasc/fom/FOM_WebContinuation$UserObject.class


I am using DSpace 1.5.1, tomcat 5.5 and java 1.6.0_11 (and postgres 8.3
FWIW). The jar does have read permission. I did a chmod 777 to be absolutely
sure. Can anybody help me please?

-- 
Regards,

Andrew M.
http://www.andrewpetermarlow.co.uk
--
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Google bots and web crawlers

2009-01-14 Thread Shane Beers

Jeff:

We had an issue with our local google instance crawling our DSpace  
installation and causing huge issues. I re-wrote the robots.txt to  
disallow anything besides the item pages themselves - no browsing  
pages or search pages and whatnot. Here is a copy of ours:


User-agent: *
Disallow: /dspace/browse-author
Disallow: /dspace/browse-author*
Disallow: /dspace/items-by-author
Disallow: /dspace/items-by-author*
Disallow: /dspace/browse-date*
Disallow: /dspace/browse-date
Disallow: /dspace/browse-title*
Disallow: /dspace/browse-title
Disallow: /dspace/feedback
Disallow: /dspace/feedback/*
Disallow: /dspace/items-by-subject
Disallow: /dspace/items-by-subject/*
Disallow: /dspace/handle/1920/*/browse-title*
Disallow: /dspace/handle/1920/*/browse-author*
Disallow: /dspace/handle/1920/*/browse-subject*
Disallow: /dspace/handle/1920/*/browse-date*
Disallow: /dspace/handle/1920/*/items-by-subject*
Disallow: /dspace/handle/1920/*/items-by-author*
Disallow: /dspace/bitstream/*
Disallow: /dspace/image/*
Disallow: /dspace/html/*
Disallow: /dspace/simple-search*

This likely would live in your tomcat directory.

Shane Beers
Digital Repository Services Librarian
George Mason University
sbe...@gmu.edu
http://mars.gmu.edu
703-993-3742



On Jan 14, 2009, at 1:09 PM, Jeffrey Trimble wrote:

Is there something simple I can place in the jsp that will prohibit  
the crawlers from

using my server resources?

TIA,

Jeff

Jeffrey Trimble
Systems Librarian
Maag Library
Youngstown State University
330-941-2483 (Office)
jtrim...@cc.ysu.edu
http://www.maag.ysu.edu
http://digital.maag.ysu.edu



--
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


--
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Google bots and web crawlers

2009-01-14 Thread Robert Tansley
As of DSpace 1.5, sitemaps are supported which allow search engines to
selectively crawl only new items, while massively reducing the server
load:

http://www.dspace.org/1_5_1Documentation/ch03.html#N10B44

Unfortunately, it seems that relatively few DSpace instances actually
use this feature.

I would strongly recommend against blocking  /dspace/bitstream/* and
/dspace/html/*, as these prevent crawlers from accessing the full-text
of items, vital for effective indexing. As of DSpace 1.4.2 (and
possibly earlier), these URLs support the if-modified-after header,
which means that crawlers don't re-retrieve files if they haven't been
changed since the last crawl.

Rob

On Wed, Jan 14, 2009 at 14:20, Shane Beers sbe...@gmu.edu wrote:
 Jeff:
 We had an issue with our local google instance crawling our DSpace
 installation and causing huge issues. I re-wrote the robots.txt to disallow
 anything besides the item pages themselves - no browsing pages or search
 pages and whatnot. Here is a copy of ours:
 User-agent: *
 Disallow: /dspace/browse-author
 Disallow: /dspace/browse-author*
 Disallow: /dspace/items-by-author
 Disallow: /dspace/items-by-author*
 Disallow: /dspace/browse-date*
 Disallow: /dspace/browse-date
 Disallow: /dspace/browse-title*
 Disallow: /dspace/browse-title
 Disallow: /dspace/feedback
 Disallow: /dspace/feedback/*
 Disallow: /dspace/items-by-subject
 Disallow: /dspace/items-by-subject/*
 Disallow: /dspace/handle/1920/*/brow! se-title*
 ace/handle/1920/*/browse-author*
 Disallow: /dspace/handle/1920/*/browse-subject*
 Disallow: /dspace/handle/1920/*/browse-date*
 Disallow: /dspace/handle/1920/*/items-by-subject*
 Disallow: /dspace/handle/1920/*/items-by-author*
 Disallow: /dspace/bitstream/*
 Disallow: /dspace/image/*
 Disallow: /dspace/html/*
 Disallow: /dspace/simple-search*
 This likely would live in your tomcat directory.
 Shane Beers
 Digital Repository Services Librarian
 George Mason University
 sbe...@gmu.edu
 http://mars.gmu.edu
 !
 703- lass=Apple-interchange-newline

 On Jan 14, 2009, at 1:09 PM, Jeffrey Trimble wrote:

 Is there something simple I can place in the jsp that will prohibit the
 crawlers from
 using my server resources?
 TIA,
 Jeff

 Jeffrey Trimble
 Systems Librarian
 Maag Library
 Youngstown State University
 330-941-2483 (Office)
 jtrim...@cc.ysu.edu
 http://www.maag.ysu.edu
 http! ://digita div


 --
 This SF.net email is sponsored by:
 SourcForge Community
 SourceForge wants to tell your story.
 http://p.sf.net/sfu/sf-spreadtheword___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech


 --
 This SF.net email is sponsored by:
 SourcForge Community
 SourceForge wants to tell your story.
 http://p.sf.net/sfu/sf-spreadtheword
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech



--
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


[Dspace-tech] Make LDAP login default but still allow Password logins

2009-01-14 Thread Eric Luhrs
Our DSpace server has LDAP and Password authentication enabled.  Is
there an easy way to modify the Manakin Reference theme's login
links to point directly to http://server/ldap-login rather than the
login chooser at http://server/login?  I need to leave password
authentication enabled on the system, but I'm trying to simplify login
for general users who will all use LDAP. This way, general users can
click the login link for LDAP, and administrators can manually type
the URL for Password login.

Has anyone done something similar?

Thanks in advance,

Eric Luhrs
Lafayette College

--
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Google bots and web crawlers

2009-01-14 Thread George Kozak

Jeff:

What I am using is a robots.txt file that I put in the dspace webapps 
directory in tomcat.  I think it's working (at least we haven't 
crashed lately).  If you're interested in seeing my robots.txt file, 
I can send it to you.


At 01:09 PM 1/14/2009, Jeffrey Trimble wrote:
Is there something simple I can place in the jsp that will prohibit 
the crawlers from

using my server resources?

TIA,

Jeff

Jeffrey Trimble
Systems Librarian
Maag Library
Youngstown State University
330-941-2483 (Office)
mailto:jtrim...@cc.ysu.edujtrim...@cc.ysu.edu
http://www.maag.ysu.edu
http://digital.maag.ysu.eduhttp://digital.maag.ysu.edu



--
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword

___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


***
George Kozak
Digital Library Information Technology
501 Olin Library
Cornell University
607-255-8924
***
g...@cornell.edu --
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Google bots and web crawlers

2009-01-14 Thread Tom De Mulder
On Wed, 14 Jan 2009, Shane Beers wrote:

 We had an issue with our local google instance crawling our DSpace 
 installation and causing huge issues. I re-wrote the robots.txt to disallow 
 anything besides the item pages themselves - no browsing pages or search 
 pages 
 and whatnot. Here is a copy of ours:

We've had to do that for years; without it DSpace just crumbles under the 
load. I've got a small Perl script which generates a flat html file with 
links to all our item pages, and we put a link to that in the footer.

So we can block all browse pages, but not item or bitstreams, and still 
get indexed.

DSpace 1.x has major scalability issues, alas. No matter how much hardware 
you throw at it.


Best,

--
Tom De Mulder td...@cam.ac.uk - Cambridge University Computing Service
+44 1223 3 31843 - New Museums Site, Pembroke Street, Cambridge CB2 3QH
- 14/01/2009 : The Moon is Waning Gibbous (83% of Full)

--
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


[Dspace-tech] Question about tables history and historystate in DSpace 1.4.2

2009-01-14 Thread Thornton, Susan M. (LARC-B702)[NCI INFORMATION SYSTEMS]
We don't use the data in either the history or historystate tables,
however they both have over a million rows in them.  If I delete all
rows in both tables, is this going to cause us any problems?  It doesn't
look like referential integrity is going to be a problem if I run a SQL
query to do this.  We regularly clear out our /dspace/history directory,
since we don't need the data in here either, and this has not caused any
problems.

 

Thanks,

Sue

 

 

Sue Walker-Thornton

ConITS Contract
NASA Langley Research Center
//Integrated Library Systems Application  Database Administrator

130 Research Drive

Hampton, VA  23666

Office: (757) 224-4074
Fax:(757) 224-4001
Pager: (757) 988-2547 
Email:  susan.m.thorn...@nasa.gov mailto:susan.m.thorn...@nasa.gov 

 

--
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Not found FOM_WebContinuation$UserObject even thoughit is there

2009-01-14 Thread Andrew Marlow
On Wed, Jan 14, 2009 at 9:30 PM, Thornton, Susan M. (LARC-B702)[NCI
INFORMATION SYSTEMS] susan.m.thorn...@nasa.gov wrote:

  The error message is actually looking for WebContinuation$UserObject in
 org/apache/cocoon/components/flow/*JAVASCRIPT*/fom and your grep found it
 in 
 org/apache/cocoon/components/flow/*JAVASC*/fom/FOM_WebContinuation$UserObject.class
 (I've put the difference in ALL CAPS).

 Hope this helps!


Thanks for the tip but actually that was a cut-and-paste error. And I have
found a horrible way to make the error go away. I have disabled all tomcat
security. Arrrghh. Obviously this is not the proper way to configure a
system. But it does get my DSpace working. And it shows that the problem is
related to tomcat security enforcement.

IMO this problem is going to bite more and more people as more and more use
the combination of tomcat5.5 and java 6.

I am quite desperate to resolve the problem I am having on debian running
 any kind of admin function within DSpace. I get the exception shown below.
 One way is to try to create a community but it seems like it happens when I
 try to create anything. The so-called missing class is in the xmlui
 installation in cocoon-2.1.9.jar.

 java.lang.NoClassDefFoundError: 
 org/apache/cocoon/components/flow/javascript/fom/FOM_WebContinuation$UserObject

 
 org.apache.cocoon.components.flow.javascript.fom.FOM_WebContinuation.setPageLocal(FOM_WebContinuation.java:202)

 
 org.apache.cocoon.components.flow.javascript.fom.FOM_Cocoon.jsFunction_sendPage(FOM_Cocoon.java:267)

 inv3.invoke()

 --
Regards,

Andrew M.
http://www.andrewpetermarlow.co.uk
--
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Google bots and web crawlers

2009-01-14 Thread Van Ly

This would be a good opportunity to construct a reasonably good default 
robots.txt file and add it to the documentation set.

At http://ses.library.usyd.edu.au/robots.txt, I have the following:

 User-agent: *
 Crawl-Delay: 11
 Disallow: /browse
 Disallow: /browse?
 Disallow: /browse-title
 Disallow: /bitstream
 Disallow: /dspace/
 Disallow: /feed/
 Disallow: /feedback
 Disallow: /password-login
 #Disallow: /retrieve/
 #Disallow: /handle/
 #Disallow: /oai/

/bitstream is intended to deter crawlers from triggering the catalina error + 
dspace warning.

which lines should I re-use from Jeff's example and why?

The lines I have are based on my best guess at what a crawler ought not to be 
interested in.

Thanks in advance.

--
Van Ly : University of Sydney Library


-Original Message-
From: Robert Tansley [mailto:roberttans...@google.com]
Sent: Thu 15/01/2009 7:52 AM
To: Shane Beers
Cc: dspace-tech Tech; Jeffrey Trimble
Subject: Re: [Dspace-tech] Google bots and web crawlers
 
As of DSpace 1.5, sitemaps are supported which allow search engines to
selectively crawl only new items, while massively reducing the server
load:

http://www.dspace.org/1_5_1Documentation/ch03.html#N10B44

Unfortunately, it seems that relatively few DSpace instances actually
use this feature.

I would strongly recommend against blocking  /dspace/bitstream/* and
/dspace/html/*, as these prevent crawlers from accessing the full-text
of items, vital for effective indexing. As of DSpace 1.4.2 (and
possibly earlier), these URLs support the if-modified-after header,
which means that crawlers don't re-retrieve files if they haven't been
changed since the last crawl.

Rob

On Wed, Jan 14, 2009 at 14:20, Shane Beers sbe...@gmu.edu wrote:
 Jeff:
 We had an issue with our local google instance crawling our DSpace
 installation and causing huge issues. I re-wrote the robots.txt to disallow
 anything besides the item pages themselves - no browsing pages or search
 pages and whatnot. Here is a copy of ours:
 User-agent: *
 Disallow: /dspace/browse-author
 Disallow: /dspace/browse-author*
 Disallow: /dspace/items-by-author
 Disallow: /dspace/items-by-author*
 Disallow: /dspace/browse-date*
 Disallow: /dspace/browse-date
 Disallow: /dspace/browse-title*
 Disallow: /dspace/browse-title
 Disallow: /dspace/feedback
 Disallow: /dspace/feedback/*
 Disallow: /dspace/items-by-subject
 Disallow: /dspace/items-by-subject/*
 Disallow: /dspace/handle/1920/*/brow! se-title*
 ace/handle/1920/*/browse-author*
 Disallow: /dspace/handle/1920/*/browse-subject*
 Disallow: /dspace/handle/1920/*/browse-date*
 Disallow: /dspace/handle/1920/*/items-by-subject*
 Disallow: /dspace/handle/1920/*/items-by-author*
 Disallow: /dspace/bitstream/*
 Disallow: /dspace/image/*
 Disallow: /dspace/html/*
 Disallow: /dspace/simple-search*
 This likely would live in your tomcat directory.
 Shane Beers
 Digital Repository Services Librarian
 George Mason University
 sbe...@gmu.edu
 http://mars.gmu.edu
 !
 703- lass=Apple-interchange-newline

 On Jan 14, 2009, at 1:09 PM, Jeffrey Trimble wrote:

 Is there something simple I can place in the jsp that will prohibit the
 crawlers from
 using my server resources?
 TIA,
 Jeff

 Jeffrey Trimble
 Systems Librarian
 Maag Library
 Youngstown State University
 330-941-2483 (Office)
 jtrim...@cc.ysu.edu
 http://www.maag.ysu.edu
 http! ://digita div


 --
 This SF.net email is sponsored by:
 SourcForge Community
 SourceForge wants to tell your story.
 http://p.sf.net/sfu/sf-spreadtheword___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech


 --
 This SF.net email is sponsored by:
 SourcForge Community
 SourceForge wants to tell your story.
 http://p.sf.net/sfu/sf-spreadtheword
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech



--
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech

--
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


[Dspace-tech] performance problems browing communities with large number of members

2009-01-14 Thread Andrew Marlow
I have created a journals community with over 1,400 sub-communities (phew!).
These sub-communities are the journal titles. Within each sub-community
there will be sub-sub-communities, one per issue (an issue will be a
collection). When viewing the journals community it takes ages to load the
page because it builds the complete list of sub-communities. This is proving
to be a performance problem. I wonder, what do people think to the idea of
DSpace breaking down large sub-community lists? The kind of thing I am
thinking of it a paginated list, so the first page gives sub-communities
1-10, the next page is 11-20, and so on.

-- 
Regards,

Andrew M.
http://www.andrewpetermarlow.co.uk
--
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Not found FOM_WebContinuation$UserObject even thoughit is there

2009-01-14 Thread Thornton, Susan M. (LARC-B702)[NCI INFORMATION SYSTEMS]
The error message is actually looking for WebContinuation$UserObject in
org/apache/cocoon/components/flow/JAVASCRIPT/fom and your grep found it
in
org/apache/cocoon/components/flow/JAVASC/fom/FOM_WebContinuation$UserObj
ect.class (I've put the difference in ALL CAPS).

 

Hope this helps!

Sue Walker-Thornton

susan.m.thorn...@nasa.gov

 



From: Andrew Marlow [mailto:marlow.and...@googlemail.com] 
Sent: Wednesday, January 14, 2009 1:55 PM
To: dspace-tech@lists.sourceforge.net
Subject: [Dspace-tech] Not found FOM_WebContinuation$UserObject even
thoughit is there

 

Hello DSpacers, 

I am quite desperate to resolve the problem I am having on debian
running any kind of admin function within DSpace. I get the exception
shown below. One way is to try to create a community but it seems like
it happens when I try to create anything. The so-called missing class is
in the xmlui installation in cocoon-2.1.9.jar.

java.lang.NoClassDefFoundError:
org/apache/cocoon/components/flow/javascript/fom/FOM_WebContinuation$Use
rObject

 
org.apache.cocoon.components.flow.javascript.fom.FOM_WebContinuation.set
PageLocal(FOM_WebContinuation.java:202)




 
org.apache.cocoon.components.flow.javascript.fom.FOM_Cocoon.jsFunction_s
endPage(FOM_Cocoon.java:267)

inv3.invoke()


Here is the output from jar (via grep) that shows the class is there.

   637 Tue Apr 11 11:56:44 BST 2006
org/apache/cocoon/components/flow/javasc/fom/FOM_WebContinuation$UserObj
ect.class


I am using DSpace 1.5.1, tomcat 5.5 and java 1.6.0_11 (and postgres 8.3
FWIW). The jar does have read permission. I did a chmod 777 to be
absolutely sure. Can anybody help me please?

-- 
Regards,

Andrew M.
http://www.andrewpetermarlow.co.uk

--
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Not found FOM_WebContinuation$UserObject even thoughit is there

2009-01-14 Thread Van Ly
Andrew,

Try exposing the value of CLASSPATH in the context of where java is called for 
DSpace. Locate your .jar file and check its path is listed in CLASSPATH.

I find this command useful for showing all the gory details of java and dspace

 /bin/ps -ww -Heo cmd |grep java

-- Van Ly : University of Sydney Library




From: Thornton, Susan M. (LARC-B702)[NCI INFORMATION SYSTEMS]
Sent: Thu 15/01/2009 8:30 AM
To: Andrew Marlow; dspace-tech@lists.sourceforge.net
Subject: Re: [Dspace-tech] Not found FOM_WebContinuation$UserObject even 
thoughit is there


The error message is actually looking for WebContinuation$UserObject in 
org/apache/cocoon/components/flow/JAVASCRIPT/fom and your grep found it in 
org/apache/cocoon/components/flow/JAVASC/fom/FOM_WebContinuation$UserObject.class
 (I've put the difference in ALL CAPS).

Hope this helps!
Sue Walker-Thornton
susan.m.thorn...@nasa.gov




From: Andrew Marlow [mailto:marlow.and...@googlemail.com] 
Sent: Wednesday, January 14, 2009 1:55 PM
To: dspace-tech@lists.sourceforge.net
Subject: [Dspace-tech] Not found FOM_WebContinuation$UserObject even thoughit 
is there

Hello DSpacers, 

I am quite desperate to resolve the problem I am having on debian running any 
kind of admin function within DSpace. I get the exception shown below. One way 
is to try to create a community but it seems like it happens when I try to 
create anything. The so-called missing class is in the xmlui installation in 
cocoon-2.1.9.jar.
java.lang.NoClassDefFoundError: 
org/apache/cocoon/components/flow/javascript/fom/FOM_WebContinuation$UserObject

org.apache.cocoon.components.flow.javascript.fom.FOM_WebContinuation.setPageLocal(FOM_WebContinuation.java:202)



org.apache.cocoon.components.flow.javascript.fom.FOM_Cocoon.jsFunction_sendPage(FOM_Cocoon.java:267)
inv3.invoke()

Here is the output from jar (via grep) that shows the class is there.

   637 Tue Apr 11 11:56:44 BST 2006 
org/apache/cocoon/components/flow/javasc/fom/FOM_WebContinuation$UserObject.class


I am using DSpace 1.5.1, tomcat 5.5 and java 1.6.0_11 (and postgres 8.3 FWIW). 
The jar does have read permission. I did a chmod 777 to be absolutely sure. Can 
anybody help me please?

-- 
Regards,

Andrew M.
http://www.andrewpetermarlow.co.uk/
--
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] Make LDAP login default but still allow Password logins

2009-01-14 Thread Eric Luhrs
I ended up using a Java web filter called UrlRewriteFilter to redirect
requests for /login to /ldap-login (http://tuckey.org/urlrewrite/),
which is similar to using mod_rewrite in Apache.  This solves my
immediate need, but I'd still be interested to know if there is
another way to do this, especially one that doesn't create another
library dependency.

Eric

On Wed, Jan 14, 2009 at 4:09 PM, Eric Luhrs elu...@gmail.com wrote:
 Our DSpace server has LDAP and Password authentication enabled.  Is
 there an easy way to modify the Manakin Reference theme's login
 links to point directly to http://server/ldap-login rather than the
 login chooser at http://server/login?  I need to leave password
 authentication enabled on the system, but I'm trying to simplify login
 for general users who will all use LDAP. This way, general users can
 click the login link for LDAP, and administrators can manually type
 the URL for Password login.

 Has anyone done something similar?

 Thanks in advance,

 Eric Luhrs
 Lafayette College


--
This SF.net email is sponsored by:
SourcForge Community
SourceForge wants to tell your story.
http://p.sf.net/sfu/sf-spreadtheword
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech


Re: [Dspace-tech] HANDLE server / Oracle / Firewall

2009-01-14 Thread Jordi Barrios

Hi Stuart,


Stuart Lewis [sdl] wrote:
 
 Hi Tom,
 
 I have my handle server up and running, it¹s responding over the
 necessary
 ports. But it¹s not able to get the actual handles from my Oracle
 database.
 The database box is located across a firewall, and we¹ve already opened
 a
 port
 so that the DSpace application itself can communicate with it. Does the
 handle
 server require or use a separate port that we will need to open, or does
 it
 use a different protocol?
 Which logs would even contain the errors I'm looking for?
 
 Edit [dspace]/config/log4j-handle-plugin.properties and change it so that
 rather than logging at the INFO level it logs at the DEBUG level. That
 file
 will also tell you where the log file is, probably
 [dspace]/log/handle-plugin.log
 
 If you then restart the handle server (make sure all the old handle server
 processes are killed - there might be two of them) and start it again,
 debug-level logs should appear in that file. Hopefully these will help
 track
 down the problem.
 
 Thanks,
 
 
 Stuart
 _
 
 Gwasanaethau Gwybodaeth  Information Services
 Prifysgol Aberystwyth  Aberystwyth University
 
 E-bost / E-mail: stuart.le...@aber.ac.uk
  Ffon / Tel: (01970) 622860
 _
 
 
 --
 Check out the new SourceForge.net Marketplace.
 It is the best place to buy or sell services for
 just about anything Open Source.
 http://p.sf.net/sfu/Xq1LFB
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech
 
 

I'm detecting the same problem.

The installation of dspace works fine.
Like Thomas I can do a telnet to 2641 or 8000 ports and the server respond.
If I use the gui tools of handle server and I do a Home Naming Authority I
can see the response of the server.
If I query any handle from other entity like 10.1000/3 my server redirect to
correct page, but if I query some local handle the client throw this
exception

  sending HDL-TCP request (version=2.1; oc=1; rc=0; snId=0 caCrt noAuth
10363/573 [ ] [ ]) to 213.73.40.217:2641
Saving global values to: /home/jbarrios/.handle/root_info
java.net.SocketTimeoutException: Read timed out
at java.net.SocketInputStream.socketRead0(Native Method)
at java.net.SocketInputStream.read(SocketInputStream.java:129)
at
net.handle.hdllib.HandleResolver.sendHdlTcpRequest(HandleResolver.java:1740)
at
net.handle.hdllib.HandleResolver.sendRequestToInterface(HandleResolver.java:1294)
at
net.handle.hdllib.HandleResolver.sendRequestToServerByProtocol(HandleResolver.java:1033)
at
net.handle.hdllib.HandleResolver.sendRequestToSite(HandleResolver.java:849)
at
net.handle.hdllib.HandleResolver.sendRequestToService(HandleResolver.java:747)
at
net.handle.hdllib.HandleResolver.processRequestGlobally(HandleResolver.java:544)
at
net.handle.hdllib.HandleResolver.processRequest(HandleResolver.java:519)
at
net.handle.hdllib.HandleResolver.processRequest(HandleResolver.java:529)
at
net.handle.apps.gui.hadmin.HandleTool$ProcessRequest.run(HandleTool.java:578)
at net.handle.awt.TaskIndicator.run(TaskIndicator.java:99)
at java.lang.Thread.run(Thread.java:619)
  sending HDL-HTTP request (version=2.1; oc=1; rc=0; snId=0 caCrt noAuth
10363/573 [ ] [ ]) to 213.73.40.217:8000
java.net.SocketTimeoutException: Read timed out
at java.net.SocketInputStream.socketRead0(Native Method)
at java.net.SocketInputStream.read(SocketInputStream.java:129)
at java.io.BufferedInputStream.fill(BufferedInputStream.java:218)
at java.io.BufferedInputStream.read(BufferedInputStream.java:237)
at java.io.DataInputStream.readLine(DataInputStream.java:496)
at
net.handle.hdllib.HandleResolver.sendHttpRequest(HandleResolver.java:1930)
at
net.handle.hdllib.HandleResolver.sendRequestToInterface(HandleResolver.java:1297)
at
net.handle.hdllib.HandleResolver.sendRequestToServerByProtocol(HandleResolver.java:1033)
at
net.handle.hdllib.HandleResolver.sendRequestToSite(HandleResolver.java:849)
at
net.handle.hdllib.HandleResolver.sendRequestToService(HandleResolver.java:747)
at
net.handle.hdllib.HandleResolver.processRequestGlobally(HandleResolver.java:544)
at
net.handle.hdllib.HandleResolver.processRequest(HandleResolver.java:519)
at
net.handle.hdllib.HandleResolver.processRequest(HandleResolver.java:529)
at
net.handle.apps.gui.hadmin.HandleTool$ProcessRequest.run(HandleTool.java:578)
at net.handle.awt.TaskIndicator.run(TaskIndicator.java:99)
at java.lang.Thread.run(Thread.java:619)
HandleException (CANNOT_CONNECT_TO_SERVER) Error talking to 213.73.40.217
at