Re: [CODE4LIB] code4lib mailing list
Cary Gordon writes > You can get enough server for this from AWS for $5-10/mo. The problem is not the ram or the disk space. It's in the policy issues around email. Many organizations have outsourced their email to gmail these days. I doubt "enough server" was the deciding factor. -- Cheers, Thomas Krichel http://openlib.org/home/krichel skype:thomaskrichel
Re: [CODE4LIB] code4lib mailing list
Paul Hoffman writes > If you're interested, Eric, I have some experience with Mailman (though > not with Listserv) and would be happy if I can -- I have some scripts to > do bulk operations (add or remove subscribers, etc.) and could also help > to migrate the list archive. I find that this is the most important contribution I have seen here in this thread. I have run Mailman over ten years for NEP http://nep.repec.org I am also running it for NYLUG http://mail.nylug.org/mailman/listinfo It's not just a case of running a box that has Mailman on it. It's also important to have an infrastructure that sends bulk email and that is not landing up in spam filters. And it's a matter of spam filtering on the list email sending box. The NEP server has a sender score https://www.senderscore.org/ score of 99/100 last time I looked but you don't get there instantaneously. You also need a hoster that is email friendly. So the list of tasks as I see it is 1. Find a sponsor for a dedicated root server, have them pay for the server. You can get a server for about $50 a month. 2. Decide on a domain and set up access for server admin to domain records, including SPF and DKIM. 3. Set up the server with linux. 4. Set email software (exim or postfix or ...) and mailman or sympa, as well as say spam assassin. 5. Migrate members and email archives. For somebody who knows what (s)he is doing 2-4 is not a big deal but it needs a few hours of work and a commitment to some maintenance. 5 is the job that dwarfs everything else. But if Paul is volunteering (or could be sponsored) to lead that forward then you have a realistic case to run it on a community and open-source base. -- Cheers, Thomas Krichel http://openlib.org/home/krichel skype:thomaskrichel
Re: [CODE4LIB] Job: Wine Loving Developer at University of California, Davis
j...@code4lib.org writes > **PROJECT DETAILS** > The UC Davis University Library is launching a project to digitize the > [Amerine wine label collection](https://www.flickr.com/photos/brantley/sets/72 > 157655817440104/with/21116552632/) Some look like hard to read. > and engage the public to transcribe the information contained on the > labels and associated annotations. This may take a long time. I suggest rather than doing that, take somebody in a low-income country who speaks French, say, and who will type all the data in. That way you get consistency in the data. I live in Siberia, I can find somebody there. Once this data is in a simple text file, you can use in-house staff to attach it to the label images in your systems. Crowdsource sounds cool, but for 4000 label it makes no sense. If the typist gets $10/h, and gets 20 labels done in 1h, we are talking $200. The visit you are planning for your developer will cost that much. -- Cheers, Thomas Krichel http://openlib.org/home/krichel skype:thomaskrichel
[CODE4LIB] OAI9 registrations close on 30 May
The OAI9 Workshop on Current Developments in Scholarly Communication, 17-19 June 2015, is fast approaching. For a detailed view of the Tutorials and Programme for the event, see https://indico.cern.ch/event/332370/timetable/#20150617 Applicants for the poster session have now been confirmed and over 30 posters will be available for viewing and discussion with the poster submitters in Geneva. Registration for the Workshop closes on 30 May. The OAI Workshops are well known for providing a setting where developments in the world of scholarly communication are displayed and discussed. Do join us if you would like to be part of this conversation by registering to attend the Workshop at https://indico.cern.ch/event/332370/registration/ Cheers, Thomas Krichel http://openlib.org/home/krichel skype:thomaskrichel
[CODE4LIB] OAI9 poster submission deadline
The OAI Workshop on Current Developments in Scholarly Communication is being held in the University of Geneva on 17-19 June 2015. It has a call for posters. The deadline is 17 April 2015. See http://indico.cern.ch/event/332370/page/6 for more details. The Workshop will contain 6 plenary session, focussing on the following topics: 1. A Technical Open Access/Open Science session led by Herbert Van de Sompel 2. Barriers and Impact 3. Open Science Workflows: CHORUS and SHARE 4. Quality Assurance 5. Institution as Publisher 6. Digital Curation and preservation of large and complex scientific objects Use https://indico.cern.ch/event/332370/registration/ to register. Cheers, Thomas Krichel http://openlib.org/home/krichel skype:thomaskrichel
[CODE4LIB] OAI9: Call for Posters
You are invited to submit a description in the form of a short abstract if you wish to bring a poster to the workshop giving details of your project. The poster should be of interest to OAI9 participants and directly related to the general themes of the workshop (http://indico.cern.ch/event/332370/page/6). Posters will be displayed in Campus Biotech and an extended coffee break will take place on Thursday 18 June 2015. This will give attendees the chance to view these and discuss them with the author. Attendees will also have the opportunity to vote for the poster which delivers the most impact. Posters should be A0 in size (841 x 1189 mm) for portrait or A1 (594 x 841 mm) for landscape. Any special equipment requests should be addressed to the workshop organisers when a poster has been accepted. If your poster is accepted you should still register for the workshop as normal and you will be expected to pay your own expenses. Owing to the large demand on accommodation, we advise you to register early - you may cancel your registration later if your submission is not successful. Poster abstracts can be submitted between 16 March 2015 - 17 April 2015 after a quick Lightweight Accounts registration process (different from the conference registration). Decisions will be made on an ongoing basis (and no later than the end April) and communicated to the submitters (http://indico.cern.ch/event/332370/call-for-abstracts/) Key dates == Abstract submission opening day: 16 March 2015 Abstract submission deadline: 17 April 2015 Print service == If you wish, your poster can be printed by the University of Geneva print service and delivered at Campus Biotech on Thursday 18th June. If you are interested, send your work in PDF format to dimitri.do...@unige.ch before 17th May 2015. (Please note that posters created with Microsoft PowerPoint should be sent in PDF and PPT formats.) Cost of this service is CHF 33.-, to be paid on delivery at the main desk. We look forward to receiving your abstracts – and seeing your posters. Cheers, Thomas Krichel http://openlib.org/home/krichel skype:thomaskrichel
[CODE4LIB] OAI9 Workshop in Geneva 17-19 June 2015
The OAI9 Workshop on Current Developments in Scholarly Communication is taking place in the University of Geneva and in CERN, Geneva, on 17-19 June 2015. The meeting's web site is http://indico.cern.ch/event/332370/ There are six plenary sessions * Technical developments * Barriers and impact * CHORUS and SHARE * Quality assurance * The institution as publisher * Digital curation and preservation of large and complex scientific objects The tutorials, which start the Workshop, are devoted to: * The institution as publisher: getting started * Author identification systems * Open Monograph Press * Hiberlink project * Managing a digitization project * Open Access Café 2015 Five breakout groups have been arranged so far for group discussions: * OA policy * Legal framework for innovative science - text and data mining * Research data management * Open annotations * Managing APC payments There will also be 20+ posters in the timetabled poster session. We will soon issue a call for posters. The OAI Workshops provide a space for all those interested in developments in scholarly communication to come together to learn from each other, to exchange ideas, and to hear papers from leading experts in the field. They are rather prominent European events in the year in which they are held. Registration is open at http://indico.cern.ch/event/332370/registration/register#/register The OAI Organisers (see http://indico.cern.ch/event/332370/page/7) look forward to meeting you all in Geneva in June. For the OAI9 Organising Committee with cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
[CODE4LIB] announcing OAI9 in Geneva 17-19 June 2015
The CERN Workshop on Innovations in Scholarly Communication (OAI9) University of Geneva June 17th-19th 2015 This series of Workshops in Geneva has become the major community event in Europe in the year in which it is held. For these three days, librarians, IT professionals, publishers and researchers come together to network, hear presentations from keynote speakers, attend tutorials on cutting-edge themes, and congtribute their ideas through breakout/technical sessions and poster displays. The workshop is designed to provide a focus for the interchange of ideas, the building of new partnerships, the annoucement of new developments and the celebration of success in innovation in the whole scholarly communications process. The workshop will be held in the University of Geneva at the Institute of Graduate Studies and Campus Biotech. Both locations are close to each other and easily accessible on the Geneva tram network. The Programme Committee is currently drawing up an innovative programme for the meeting. Please reserve the dates for OAI9 in your diaries now. Keep an eye on the Workshop website at http://indico.cern.ch/e/oai9, which also lists the Twitter feed and hashtag for the meeting. On behalf of the OAI9 Programme Committee, I look forward to seeing you in the University of Geneva to hear news of current developments in scholarly communication. Cheers, Thomas Krichel http://openlib.org/home/krichel skype:thomaskrichel
Re: [CODE4LIB] Research Gate vs. Institutional Repositories
Matthew Sherman writes I have run into an assortment of faculty that are convinced the Research Gate should replace the institutional repository at their schools. It will only be a short time until ResearchGate is sold to an established player. I heard from a well-informed source that Elsevier are interested in buying it. -- Cheers, Thomas Krichel http://openlib.org/home/krichel skype:thomaskrichel
Re: [CODE4LIB] Canberra event -- Ed Summers at NLA, 2 December
Rosalyn Metz writes not sure if i should be jealous of nla for getting ed to speak, or of ed for getting to go to australia. For me, definitely the former. I have much respect for Ed, and for me Australia is a boring place to go to. -- Cheers, Thomas Krichel http://openlib.org/home/krichel skype:thomaskrichel
Re: [CODE4LIB] ERIC mirror during government shutdown
Jared Camins-Esakov writes Thanks to the USA's federal government shutdown, the ERIC (Educational Resources Information Center) database is unavailable. Gets me thinking: I have a copy of PubMed XML data... in case anybody needs it just contact me. Cheers, Thomas Krichel http://openlib.org/home/krichel skype:thomaskrichel
Re: [CODE4LIB] LibGuides: I don't get it
Galen Charlton writes Lock-in doesn't have to be absolute to be effective, it just has to has raise the bar sufficiently high to make users think twice about migrating away. I fully agree with this. In general, lock-in is pervasive in the use of any information product. It even appears in the informational use of non-information products. Example: a supermarket provides you with your groceries. It's not an information product. Yet, you will prefer to use a supermarket that you are familiar with because you know where to find what you want. Lock-in reduces competitive forces. An important advantage of open-source solutions is that they reduce lock-in. They can't eliminate it because it is generic to the nature of information. As a general statement -- and I know that this battle has been bitterly fought in the ILS space -- It is not bitterly fought elsewhere because people just don't think this far. They think, say, oh Google gives me such a great infrastructure for my email. And I don't care about the spying thrown in for good measure. So let me go for it. But twenty years from now will you have an archive of your mails? If you change providers, do you migrate the email archives? These are important questions to ask. I have not used Google mail, neither have I used libguides, so I have no idea how easy or how hard it is to migrate. But it is important to keep this is in mind when choosing between informational products. I believe that *all* library software services, whether based on F/LOSS software or proprietary software, should provide a way for the library to obtain a full dump of their data, in an accessible format, at no additional charge. I could not agree more. I don't think this is given enough prominence. I see that LibGuides advertises the ability to make local backups of individual pages and also provides (via a paid add-on module) an XML export function. I don't know if SpringShare will also provide free one-time exports on request, but I would hope they do. Spot on Galen, you raise the important (IMHO) issue. Cheers, Thomas Krichel http://openlib.org/home/krichel skype:thomaskrichel
Re: [CODE4LIB] LibGuides: I don't get it
Andrew Darby writes I don't get this argument at all. I breathe a sigh of relief. I didn't understand it either, but I blamed my brain fog. Maybe the vendor option makes sense, maybe the open source option does. The vendor option may be based on it just hosting the open source option. I do that sort of thing. LibGuides don't seem to do that, as they appear to have their own proprietary software. Wilhelmina Randtke writes: I also don't see vendor lock in issues in LibGuides, since the research guides concept includes routine change and replacing content. No lock in because you can rewrite everything? Hmm... Cheers, Thomas Krichel http://openlib.org/home/krichel skype:thomaskrichel
Re: [CODE4LIB] LibGuides: I don't get it
stuart yeates writes If you have no in-house technical capability, the cost of looking at an open source alternative can easily outweigh the multi-year licensing fee. Yeah, but if you don't have an in-house technical capability you condemn yourself to history. I bet that in the middle of the 21st century, no in-house technical capability will be the same thing as having no space for books in the middle of the twentieth century. Cheers, Thomas Krichel http://openlib.org/home/krichel skype:thomaskrichel
Re: [CODE4LIB] Python and Ruby
Ed Summers writes Ok, I think I'm going to have nightmares about that. It will have to support tippex on screens. Cheers, Thomas Krichelhttp://openlib.org/home/krichel skype:thomaskrichel
Re: [CODE4LIB] Anyone have access to well-disambiguated sets of publication data?
Paul Albert paa2...@med.cornell.edu writes I am exploring methods for author disambiguation, and I would like to have access to one or more set of well-disambiguated data set containing: – a unique author identifier (email address, institutional identifier) – a unique article identifier (PMID, DOI, etc.) – a unique journal identifier (ISSN) The RePEc Author service (created by yours truly in the late 90s) was the first author claiming service. Its data is freely available, bar email addresses. You probably want to get the entire RePEc dataset and then filter what you want. Cheers, Thomas Krichelhttp://openlib.org/home/krichel skype:thomaskrichel
Re: [CODE4LIB] Anyone have access to well-disambiguated sets of publication data?
Thomas Krichel writes The RePEc Author service (created by yours truly in the late 90s) was the first author claiming service. Its data is freely available, bar email addresses. Actually DBLP can also be of interest to you. The site has identified authors. I presume they are done by Michael Ley himself. Unforturnately I have not seen that data in the DBLP XML dump, so you would have to crawl the web site. But between these two sets, you sholud have enough for months of fun. Cheers, Thomas Krichelhttp://openlib.org/home/krichel skype:thomaskrichel
Re: [CODE4LIB] Get building with Trove at GovHack (AU) 2013
Tim Sherratt writes What could you do with 90 million newspaper articles, 7 million photos or objects, or the details of more than 17 million books? Nothing unless I can access to the full copy of the data on the 17 million books, of which I would then proceed to extract a subset of data and reduced elements that I need. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
Re: [CODE4LIB] On-going support for DL projects
Kevin Hawkins writes Digital humanities centers have been dealing with these questions as they've accumulated projects, and sometimes they have dumped them on libraries to try to preserve. dumped maybe but I would still try to see this as an opportunity for libraries to move into a growing activity. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
Re: [CODE4LIB] programme for OAI8 in Geneva 19-21 June 2013
Thomas Krichel writes The Programme can be found at http://indico.cern.ch/conferenceDisplay.py?confId=3D211600. This contains an extra 3D from quoted-printable encoding. The correct URL is http://indico.cern.ch/conferenceDisplay.py?confId=211600. Sorry! Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
[CODE4LIB] programme for OAI8 in Geneva 19-21 June 2013
OAI8 Workshop in Geneva, 19-21 June 2013 The OAI8 Workshop on Current Developments in Scholarly Communication is taking place in the University of Geneva and in CERN, Geneva, on 19-21 June 2013. The Programme can be found at http://indico.cern.ch/conferenceDisplay.py?confId=3D211600. There are six plenary sessions on: * Technical developments * Metrics * Semantic Indexing * Research Data * Arts, Humanities and Social Sciences * Gold Open Access Infrastructures The Tutorials, which start the Workshop, are devoted to: * Research Data Services * OJS, beyond editorial tradition * The NISO/OAI ResourceSync Synchronization Framework * Open Access Café 2013 * Metrics * Metadata for the Research Lifecycle Five Breakout Groups have been arranged so far for group discussions: * Altmetrics * Open Access Policy developments * How to make your university a monograph publisher * Open Annotations * Gold Open Access infrastructures There will also be 20+ posters in the timetabled poster session. OAI Workshops are prominent European OA events in the year in which they are held. Places are still available and registration is open at http://indico.cern.ch/confRegistrationFormDisplay.py/display?confId=3D211600. The OAI Workshops provide a space for all those interested in developments in Scholarly Communication to come together to learn from each other, to exchange ideas, and to hear papers from leading experts in the field. The OAI Organising Committee (see http://indico.cern.ch/internalPage.py?pageId=3D7confId=3D211600) looks forward to meeting you all in Geneva in June For the OAI8 Organising Committee with cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
[CODE4LIB] call for posters at OAI8 in Geneva 19-21 June 2013
OAI8, the 8th Workshop on Innovations in Scholarly Communication will be held in Geneva, Switzerland, from Wednesday 19th to Friday 21st June 2013. Program details, registration and the call for posters are now available at https://indico.cern.ch/event/oai8. It is possible to register for a part or all of the programme. You are invited to submit a description in the form of a short abstract if you wish to bring a poster to the workshop giving details of your project. The poster should be of interest to OAI8 participants and directly related to the general themes of the workshop. Posters will be displayed in the Uni Mail main hall and an extended coffee break will take place on Thursday 20 June 2013. This will give attendees the chance to view these and discuss them with the author. Attendees will also have the opportunity to vote for the poster which delivers the most impact. Posters should be A0 in size (841 x 1189 mm) for portrait or A1 (594 x 841 mm) for landscape. Any special equipment requests should be addressed to the workshop organisers when a poster has been accepted. If your poster is accepted you should still register for the workshop as normal and you will be expected to pay your own expenses. Owing to the large demand on accommodation, we advise you to register early - you may cancel your registration later if your submission is not successful. Poster abstracts can be submitted until 1 April 2013 after a quick registration process (different from the conference registration). Decisions will be made on an ongoing basis (and no later than the end April) and communicated to the submitters. The poster submission form is available at https://indico.cern.ch/abstractSubmission.py?confId=211600 The committee looks forward to welcoming you to Geneva. For the OAI8 Organising Committee with cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
Re: [CODE4LIB] back to minorities question, seeking guidance
Wilhelmina Randtke writes Pretty much the whole entire entry level programming class for the average class covers using code to do things that you can do much more easily without code. Probably it was the wrong course. I think coding should start with building web pages. A calculator can't do that. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
[CODE4LIB] registration open for OAI8 in Geneva 19-21 June 2013
OAI8, the 8th Workshop on Innovations in Scholarly Communication will be held in Geneva, Switzerland, from Wednesday 19th to Friday 21st June 2013. Program details and registration are now available at https://indico.cern.ch/event/oai8. It will be possible to register for a part or all of the programme. The workshop will follow the successful format of previous sessions mixing practical tutorials, presentations from cutting-edge projects and research, discussion groups, posters, and an intense social programme to maximise interaction and communication. Previous workshops have built a strong community spirit. The event is a unique opportunity to exchange ideas and contact details with a large public connected to the OA movement. The OAI workshops are a series of the most important international meetings in this field and take place roughly every two years. Each iteration of the workshop series has dealt with issues relevant to today. This year, research data will be one of the topics tackled. In the light of the Royal Society Report Science as an Open Enterprise, European universities are beginning themselves to consider the impact of the data deluge. The workshop will also revisit the topic of metrics and suggest new approaches. Thanks to the continued support of our sponsors, the organisers have been able to maintain the modest registration fee at the same level as for the previous workshop, i.e. CHF 275. Moreover, a special early bird fee of CHF 230 is offered until Wednesday 27th of March. Further information will be added to the website https://indico.cern.ch/event/oai8, including details of the Breakout Groups, of a pre-conference day about Duraspace, and the Call for posters. The committee looks forward to welcoming you to Geneva. For the OAI8 Organising Committee with cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
Re: [CODE4LIB] How to configure subdomain hosting without domain hosting?
Wilhelmina Randtke writes I'm trying to get a subdomain of my university's domain pointed at content on a cheapie hosting account. To do this, I can get main campus to put in a CNAME record with the IP address matching where the DNS for my cheapie hosting account is currently located in the cheapie hosting company's system. The problem is, this IP will periodically change, meaning main campus IT will have to be involved periodically down the line in order to cut and paste the new IP into their system, and meaning that the hosted services could go unavailable for a few days when this happens. I am probably something missing here, as my experience is with root servers rather than web hosting. But I do know a bit about DNS. My expernienc suggests that once you have a CNAME, in BIND notation foo IN CNAME bar the name foo is replaced by name bar. There is no IP address involved. If bar changes changes IP address, the IP address of foo also changes. In fact, all record types attached to bar carry over to foo. So you can't say foo IN CNAME bar foo IN NS widget as the NS (nameserver) for foo is the same as the NS for bar, not widget. Am I doing this the hard way? You have not told us what you do. *How would you go about getting a subdomain of your university's URL to point at your cheapie webhosting account? * If your webhoster gives you a URL at http://randtke.webhoster.com your uni DNS can just say randtke IN CNAME randtke.webhoster.com. Subdomain forwarding with masking then storing content at a random URL but having it appear to be on the university's subdomain does not work, because this causes problems responding to XML queries. I don't understand that approach, so I suspect my answer is off the mark but it may still be helpful. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
[CODE4LIB] OAI8 at the University of Geneva. 19-21 June 2013
2013 sees the 8th OAI Workshop on Innovations in Scholarly Communication at the University of Geneva and at CERN. These Workshops are major international gatherings where those interested in Scholarly Communication developments can meet, discuss and network. OAI8 will be taking place in the context of the EU’s launch of its €80 billion Horizon 2020 programme, which has Open Access as a key deliverable of the outputs from its funded research programs. New areas to be discussed at OAI8 will be Alternative Metrics and a special focus on Scholarly Communication developments in the Arts and Humanities. An introductory video, announcing the Conference, can be seen at http://indico.cern.ch/conferenceDisplay.py?confId=211600 Please reserve 19-21 June 2013 in your diaries. On behalf of the Programme Committee, we look forward to seeing you in Geneva. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
Re: [CODE4LIB] PDF manipulation
Yong Tang writes I was recently thrown into a file dumpster That sounds really painful. The original text format was lost. Extracting text from PDF is difficult. I'd try to use pdftohtml http://sourceforge.net/projects/pdftohtml/ I have used that in the past. Then use XML::LibXML's HTML parser to read the resulting HTML (if any) into Perl. Maybe I am heading in a wrong direction for this project? Direction seems right but the task is tough. PDF is where text goes to die. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
Re: [CODE4LIB] Recommendations for a teaching OPAC?
Margaret Kipp writes I've been using Omeka for about a year now in my information organisation and metadata classes. So have I but mainly for classes on repository building. This is locally called the building digital libraries class, whatever that means ;-). The way I work with Omeka in the course is that I have a set of Perl scripts. They create a separate Omeka installation for each students. Separate installation meaning complete PHP code and separate databases for each student. Students are free to install whatever modules and themes they wish. There is no interference with other students. I'm currently trying out a copy of a Koha Virtual Appliance (http://kylehall.info/index.php/projects/koha/koha-virtual-appliance/) If I were to do teach the opac I would give each student an installation of Koha. The Debian packaging of Koha allows me to build several instances of Joha on one set of perl scripts. Each student just gets a separate mySQL database. Since a lot of things in Koha can be configured through the database---I am not aware of Koha themes and modules---it would be sufficient. Just run koha-create for each student. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
Re: [CODE4LIB] Recommendations for a teaching OPAC?
Cary Gordon writes I think that having each student use their own Koha instance is a great way to go. This is what I planned to do if I had to do the course again. In the past, I was less advanced, I just had a library branch for each student. It's because when I ran the class in 2010, the option to create several koha installation on one machine wan set available. Assuming that they all have computers with reasonable specs, they can use VirtualBox (free) to import the Koha .ova file. I'd rather have them use a single server I have root access to so I can find out when the shit has hit the fan as and when it does. I used to run this Koha on server space that I had arranged with the university, they hosted a box that I had root access to. Recently a new director declared that this was a rogue server and threw it out. The obvious caveat is that the teacher would presumably need to be conversant with Koha. The obvious reply is that, in theory, as an LIS prof these days you have to be very adaptable to technology changes, and you got to have the capability to roll out such technology, as I do with Omeka, Drupal, Koha. In practice, not a lot of technology is taught/used in the LIS curriculum. But that's a topic for another day. Margaret Kipp and I have talked about this a lot in the past. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
Re: [CODE4LIB] Any libraries have their sites hosted on Amazon EC2?
Roy Tennant writes I'd also be interested in getting some real world cost information. I installed an app on EC2 that went mostly unused for a couple months but meanwhile racked up over $300 in charges. Color me surprised. I am not impressed by Amazon either. I have an instance given to me by a sponsor, and there I have been taken aback by the old Debian kernel version this puts me in. I rent three root servers with Hetzner.de. That's for large-scale work. To run a blog, a 3TB disk 16 Gig ram box from Hetzner is overkill. With Hetzner you have the exchange rate risk but the cost structure is much simpler. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
Re: [CODE4LIB] Any libraries have their sites hosted on Amazon EC2?
Erik Hetzner writes Another satisfied customer. Actually I did not write that I was/am satisfied. ;-) They once managed to disassemble my server and I lost all the data on it. They were so embarrassed that they gave my sponsor the box for free for a year. I was fine because I had a backup so not much of a problem. The lesson learnt is that in any case you always need a backup, and it better be a local one or something hosted with a different company. There is no substitute for system administration skills. PS: But seriously, no relation. Neither do I have with them, other than being a customer. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
Re: [CODE4LIB] Why are we afraid to criticize library software in public?
Kyle Banerjee writes Our profession is very risk averse. But fortunately we don't have any stereotypes around here. ;-) Isn't the vendor the one the name of which starts with O, and the product name ends with M? Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
Re: [CODE4LIB] Open datasets
Alexander Johannesen writes So in order to test the integrity and performance of my system so far I'm wondering if there's a suitable open dataset of bibliographic records that aren't too obscure (meaning, I can find the titles at amazon or Open Library) that you could recommend? More than 1000 records, but less than a million, maybe? Look at RePEc at http://repec.org. You can mirror the dataset from ftp://repec.org but for efficiency we can set up some rsync delivery for you. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
[CODE4LIB] conference on The Value of Unique Scholarly Identifiers to Academics, Institutions and Countries
On behalf of the organizers, I am pleased to announce the conference The Value of Unique Scholarly Identifiers to Academics, Institutions and Countries. The conference is organized by the Association of Lithuanian Serials (see http://serials.lt). It will take place in Parliament of Lithuania (see http://lrs.lt), in Vilnius, Lithuania on February 14th, 2012. We think that the topic of researcher identification is relevant not only to librarians and publishers but also to researchers of all subject areas. Therefore, the main goal of the conference is to explain to the wide academic community how the latest technologies have been changing their profile in the scholarly universe and what they can expect in the future. Well-known speakers will give insight on the future of the scholarly universe and the image of academics in it. Read more about the conference at http://uniqueids.org. If you wish to contact the organizers write to Eleonora Dagiene, Chair of Council, Director of VGTU Press, eleon...@serials.lt. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
Re: [CODE4LIB] What software for a digital library
Lars Aronsson writes To be clear: I need a platform where regular users, logged in or not, can upload new books through a web interface. Does that leave me with anything else than Mediawiki? Try http://omeka.org. I use it for teaching purposes. http://openlib.org/home/krichel/courses/lis654.html It's small enough that I can install a copy for each student, with a script that I run as root http://openlib.org/home/krichel/courses/lis654/bin/maintain_omeka Although primarily designed for image-based repositories, omeka has a bunch of plugins that you may find help you what you want to do. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
Re: [CODE4LIB] Patents and open source projects
Joann Ransom writes LibLime Koha is not Koha. The rest of the community use Koha. Misunderstanding of this issue is wide-spread. Case in point http://lists.webjunction.org/wjlists/web4lib/2010-September/052195.html Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
Re: [CODE4LIB] Automatic Content Classification recommendations?
Peter Neish writes Just wondering if anyone has any recommendations for systems that will do automatic content classification through machine learning? I use LibSVM in AuthorClaim (http://authorclaim.org) and svm_light in NEP (http://nep.repec.org). I found both very helpful. I would switch to LibSVM in NEP since it LibSVM is still actively being developed. Just using a simple binary term weighing scheme and default SVM parameters should get you a long way. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
Re: [CODE4LIB] Plea for help from Horowhenua Library Trust to Koha Community
Kåre Fiedler Christiansen quotes PTFS/LibLime is prepared to transfer the trademark to a non-profit Koha Foundation with the provision that the Foundation hold the trademark in trust and not enforce it against any individual, organization, or company who chooses to promote services around Koha in New Zealand. PTFS/LibLime encourages a direct dialog with Koha stakeholders to determine an equitable solution for the disposition of the trademark that serves the best interests of the libraries who use Koha. That organization has existed since the start of Koha. It is called the Horowhenua Library Trust. That sounds promising. Has LibLime seen reason, or am I misinterpreting things? As much reason as somebody who comes to steal your belongings and then offers them to hand them back to you may be at some stage. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
Re: [CODE4LIB] Domain lookup madness
Roy Tennant writes I'm mystified as well, since a curl request from one of my servers (not an OCLC one) resolves just fine. I get the same result as Yitzchak krichel@sahure:~$ curl http://www.questionpoint.org/crs/servlet/org.oclc.chat.QPWOnlineStatus?library=10253rid=0; -m 3 -v * name lookup timed out * Couldn't resolve host 'www.questionpoint.org' * Closing connection #0 curl: (6) name lookup timed out Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
Re: [CODE4LIB] Job Posting: Digital Library Repository Developer, Boston Public Library (Boston, MA)
Roy Tennant writes So BPL is developing its own digital repository system? Mind if I ask why? Why not? Do you suggest they should stick to print? Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
[CODE4LIB] open bibliographic principles
On behalf of the Open Bibilographic Working Group of the Open Knowledge Foundation, I would like to bring your attention to the Principles on Open Bibliographic data at http://openbiblio.net/principles/ The group continues to offer the opportunity, for both individuals and groups, to sign up to the principles. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
Re: [CODE4LIB] Net::OAI::Harvester
Westman, Stephen writes Thanks! I'll look into it. I've put out a sample script using it at http://wotan.liu.edu/home/mamf/tmp/westman In this work I used OpenDOAR to get repository sources. I am not maintaining this any more becaues I now use a source from BASE for feeding repository data into AuthorClaim. That source is documented at http://wotan.liu.edu/base Take care, Enjoy, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
Re: [CODE4LIB] Net::OAI::Harvester
Westman, Stephen writes I'm working on a Perl-based OAI harvester and have run a problem. The module that I'm using - Net::OAI::Harvester - does a great job of parsing out the different OAI tagged fields so that they can be put into a MySQL table of retrieved OAI records for searching. I suggest you use HTTP::OAI instead. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorprofile.org/pkr1 skype: thomaskrichel
Re: [CODE4LIB] Is there a (current) CONTENTdm support group?
Patrick Berry writes I'm going to be hacking on CDM6 this summer, so I'm +1 for github and will do whatever I can to contribute to a wiki. Well, would it not be better to hack on an open-source competitor for ContentDM, such as Omeka? I hasten to add I have no affilitation with Omeka. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorclaim.org/profile/pkr1 skype: thomaskrichel
Re: [CODE4LIB] unwanted (bogus) characters in marc
stuart yeates writes Thomas Krichel wrote: ... It will try to guess between UTF-8 and ISO-8859-1. This can be done because UTF-8 has many invalid byte sequences. But say if you wanted to guess between ISO-8859-1 and ISO-8859-2, you'd be out of luck. Not necessarily. I meant you would be out of luck with the tool I proposed. There are tools such as http://www.let.rug.nl/~vannoord/TextCat/ which provide very reliable guessing of languages. I am happy to read this, I had requirements for language detection several times already. But the detection of languages is a bit of a different problem than the detection of character codes. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorclaim.org/profile/pkr1 skype: thomaskrichel
Re: [CODE4LIB] unwanted (bogus) characters in marc
Ere Maijala writes On 7.10.2010 15:17, Thomas Krichel wrote: ... use Encode::Guess qw/latin-1/; $decoded=decode(Guess, $dodgy_input); $decoded then should be a utf-8 string with utf8 flag on. Would that work for a predominantly proper utf-8 input with some mistakes thrown in? It will try to guess between UTF-8 and ISO-8859-1. This can be done because UTF-8 has many invalid byte sequences. But say if you wanted to guess between ISO-8859-1 and ISO-8859-2, you'd be out of luck. The module seems to do a good job for me. I use it for a robot on CrossRef's sigg API. The engine is reliable, but the data there is poorly character coded and marked up. I'd be happy to share the robot with anyone who wants to go out there get the character creeps. After all, we have Halloween coming up. ;-) Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorclaim.org/profile/pkr1 skype: thomaskrichel
Re: [CODE4LIB] unwanted (bogus) characters in marc
Ere Maijala writes # Fix non-UTF-8 characters with two highest bits set (we assume they are actually ISO-8859-1) What about use Encode::Guess qw/latin-1/; $decoded=decode(Guess, $dodgy_input); $decoded then should be a utf-8 string with utf8 flag on. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorclaim.org/profile/pkr1 skype: thomaskrichel
Re: [CODE4LIB] Running a repository on Debian Stable
Mike Taylor writes I was surprised to find that there seems to be no package for DSpace, EPrints, http://wiki.eprints.org/w/Installing_EPrints_3_via_apt_%28Debian/Ubuntu%29 Fedora, The problem there, as I understand it is that Fedora expects everything to be in one directory. This setup in inimical to the Debian setup. Most of all, I want something that I can install from the standard operating system packages, using apt-get. I suggest you use aptitude instead. It has superior dependency resolution. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorclaim.org/profile/pkr1 skype: thomaskrichel
Re: [CODE4LIB] valueforkey in javascript
Eric Lease Morgan writes Unfortunately, I am unable to access the water_id attribute. While the water_id attribute displays as a part of my data source and list views, whenever I try to actually access the water_id attribute my application crashes as illustrated by the linked screen shot. [1] I don't use JavaScript, but DOMscripting. var water_elements=d.getElementsByTagName('water'); for (count=0; count water_elements.length; count++) { var my_water_element = water_elements[count]; var id_attribute = my_water_element.getAttribute('id'); alert id_attribute; } Disclaimer: I had a class of mead with mixed with sherry before typing this. I don't know what an iphone is. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorclaim.org/profile/pkr1 skype: thomaskrichel
Re: [CODE4LIB] Online PHP course?
Joe Hourcle writes ps. yes, I could've used this response as an opportunity to bash PHP ... and I didn't, because they might be learning PHP to migrate it to something else. controversial ;-) what's the problem(s) with PHP? Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorclaim.org/profile/pkr1 skype: thomaskrichel
Re: [CODE4LIB] good and best open source software
Brett Bonfield writes I think Jonathan and Nicole nailed it with community health, I beg to differ. If you requiree a healthy community to start working with a piece of software, how do you want a grassroots project to start? Obviously a small project will start with one or two developers, and it won't grow, until a few people work with it despite the fact that it's a small thing to start with. Requiring an upfront healthy community is particurly problematic is a small community such as digital library work. On the other kind, there is widely adopted software that I got cajoled into maintaining, that consider bad. Apache is one of them. I run maybe 50 virtual servers an a bunch of boxes, I am still puzzled how it works and it's trial and error with each software upgrade, where goes that NameVirtualServer thing into, the constant croaks server foo has no virtualserver. I'm not a dunce, but Apache makes me feel I am one. When I look at these config files that are half-baked XML, I wonder what weed the guy smoked who invented this. If I could do it allover again, I would do it in lighttpd. Oh well it was not there in 1995 where I started running web servers. Other problematic case: Mailman. I run about 130 mailing lists, over 80 have a non-standard config, I am running every few months into problems with onne of them, despite the fact that I wrote a script to configure all the non-standard lists the same way. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorclaim.org/profile/pkr1 skype: thomaskrichel
Re: [CODE4LIB] good and best open source software
Nicole Engard writes That's why I added in 'user' to the community. No matter how many people use Apache based web sites, it does not make it Apache software better. Telling people to use what others are using is just simple propaganda to stifle competition. Cheers, Thomas Krichelhttp://openlib.org/home/krichel http://authorclaim.org/profile/pkr1 skype: thomaskrichel