Re: [Dspace-tech] sitemaps in 1.8

2014-11-12 Thread helix84
Hi Monika, the sitemap helps spiders discover the site, but is not limiting them from crawling other URLs. robots.txt serves that purpose. Therefore, DSpace feeds the spider a list of all item pages, but spiders will index the links from those pages, too. In our case - bistreams. Recently there

Re: [Dspace-tech] sitemaps in 1.8

2014-11-12 Thread Monika Mevenkamp
The sitemap generated does not include any links to bitstreams. As I understand it: sitemaps list all the links a crawler should digest, essentially saying these links but no others. If that is how it works crawlers will not index bitstreams when using the standard generated sitemap. As you

Re: [Dspace-tech] sitemaps in 1.8

2014-11-05 Thread Monika Mevenkamp
/jspui/htmlmap works great and /jspui/sitemap as well for the xml version Thanks Monika On 11/4/14, 2:25 PM, Claudia Jürgen wrote: Hello Monika, the link to access the sitemaps is http://asdspace300l.princeton.edu/jspui/htmlmap>http://asdspace300l.princeton.edu/jspui/htmlmap which is contai

Re: [Dspace-tech] sitemaps in 1.8

2014-11-04 Thread Claudia Jürgen
Hello Monika, the link to access the sitemaps is http://asdspace300l.princeton.edu/jspui/htmlmap>http://asdspace300l.princeton.edu/jspui/htmlmap which is contained as a relative link "jspui/htmlmap" in your footer. Hope this helps Claudia Am 04.11.2014 20:14, schrieb Monika Mevenkamp: I ge

[Dspace-tech] sitemaps in 1.8

2014-11-04 Thread Monika Mevenkamp
I generated sitemaps with the dspace generate-sitemap command, which created lots of files in /dspace/stemaps. But I am not sure, which url to use to get to these generated files. I used Apache to Alias /sitemap /dspace/sitemaps so you can have a look at the generated files as they sit on the

Re: [Dspace-tech] Sitemaps

2013-10-30 Thread Sean Carte
On 31 October 2013 04:29, Andrea Schweer wrote: > On 30/10/13 21:20, Sean Carte wrote: > > It would be great if I could find a way to speed up the resolution of > > handle links. Any ideas? > > When I had this issue in one of "my" repositories, it came down to > firewall settings. This is what th

Re: [Dspace-tech] Sitemaps

2013-10-30 Thread Hilton Gibson
I remember. See: http://wiki.lib.sun.ac.za/index.php/SUNScholar/Handle_Server#Step_8_-_Firewall_Ports On 31 October 2013 04:29, Andrea Schweer wrote: > Hi, > > On 30/10/13 21:20, Sean Carte wrote: > > It would be great if I could find a way to speed up the resolution of > > handle links. Any i

Re: [Dspace-tech] Sitemaps

2013-10-30 Thread Andrea Schweer
Hi, On 30/10/13 21:20, Sean Carte wrote: > It would be great if I could find a way to speed up the resolution of > handle links. Any ideas? When I had this issue in one of "my" repositories, it came down to firewall settings. This is what the CNRI people told us back then: > It doesn't look like

Re: [Dspace-tech] Sitemaps

2013-10-30 Thread Sean Carte
On 30 October 2013 10:29, Hilton Gibson wrote: > About DNS lookups, I normally trying caching the lookups using "nscd" on > an Ubuntu server. > The other culprit can be the campus DNS server. > We have a "race" between the MS DNS servers and the internet DNS servers > at the moment. > That's def

Re: [Dspace-tech] Sitemaps

2013-10-30 Thread Hilton Gibson
Hi Sean About DNS lookups, I normally trying caching the lookups using "nscd" on an Ubuntu server. The other culprit can be the campus DNS server. We have a "race" between the MS DNS servers and the internet DNS servers at the moment. Cheers hg On 30 October 2013 10:20, Sean Carte wrote: > >

Re: [Dspace-tech] Sitemaps

2013-10-30 Thread Sean Carte
On 29 October 2013 16:50, helix84 wrote: > On Tue, Oct 29, 2013 at 3:31 PM, Sean Carte wrote: > > Thanks, Ivan. I've updated dspace.cfg, and regenerated the sitemaps. > > Sitemap looks good. > > Since you've likely been using the duplicate > http://ir.dut.ac.za/xmlui/ URL for a while and you mig

Re: [Dspace-tech] Sitemaps

2013-10-29 Thread helix84
Hi Stefanie, first, when starting a new topic on the mailing list, please start a new email thread. On Tue, Oct 29, 2013 at 8:19 PM, Stefanie Behnke wrote: > I have created a file "contents" with: > 009-013.pdf permissions:-r eg-member > > but it does not work. I have got the error message: >

Re: [Dspace-tech] Sitemaps

2013-10-29 Thread Stefanie Behnke
Dears, I am referring to the following page: https://wiki.duraspace.org/display/DSDOC3x/Importing+and+Exporting+Items+via +Simple+Archive+Format I have created a file "contents" with: 009-013.pdf permissions:-r eg-member but it does not work. I have got the error message: at java.lang.r

Re: [Dspace-tech] Sitemaps

2013-10-29 Thread helix84
On Tue, Oct 29, 2013 at 3:31 PM, Sean Carte wrote: > Thanks, Ivan. I've updated dspace.cfg, and regenerated the sitemaps. Sitemap looks good. Since you've likely been using the duplicate http://ir.dut.ac.za/xmlui/ URL for a while and you might want to migrate away from it without braking links,

Re: [Dspace-tech] Sitemaps

2013-10-29 Thread Sean Carte
On 29 October 2013 11:25, helix84 wrote: > I do see it at http://ir.dut.ac.za/sitemap > Here's what the URL returns: > > > http://www.sitemaps.org/schemas/sitemap/0.9";> > http://ir.dut.ac.za:8080/xmlui/sitemap?map=0 > 2013-10-29T08:08:14Z > > > Even though everything works, you might want to

Re: [Dspace-tech] Sitemaps

2013-10-29 Thread helix84
On Tue, Oct 29, 2013 at 8:45 AM, Sean Carte wrote: > Google tries to find them at , > which generates a 'org.apache.cocoon.ResourceNotFoundException: Page cannot > be found' error. I'm not sure why that is. I do see it at http://ir.dut.ac.za/sitem

Re: [Dspace-tech] Sitemaps

2013-10-29 Thread Hilton Gibson
Also see: http://wiki.lib.sun.ac.za/index.php/SUNScholar/Repository_Website_Metrics#Google_Sitemap On 29 October 2013 09:45, Sean Carte wrote: > Our old server died and I've managed to get everything functioning apart > from sitemaps. > > The sitemaps are generated by /dspace/bin/dspace genera

Re: [Dspace-tech] Sitemaps

2013-10-29 Thread Hilton Gibson
Try: http://ir.dut.ac.za/htmlmap Cheers hg On 29 October 2013 09:45, Sean Carte wrote: > Our old server died and I've managed to get everything functioning apart > from sitemaps. > > The sitemaps are generated by /dspace/bin/dspace generate-sitemaps : > > $ ls -l /dspace/sitemaps/ > total 116

[Dspace-tech] Sitemaps

2013-10-29 Thread Sean Carte
Our old server died and I've managed to get everything functioning apart from sitemaps. The sitemaps are generated by /dspace/bin/dspace generate-sitemaps : $ ls -l /dspace/sitemaps/ total 116 -rw-rw-r-- 1 tomcat7 tomcat7 51165 Oct 29 08:08 sitemap0.html -rw-rw-r-- 1 tomcat7 tomcat7 7062 Oct 29