[CODE4LIB] Two developer positions at Cornell University Library

2015-09-14 Thread Simeon Warner
We (Cornell University Library IT -- CUL-IT) are looking for two software developers to work on arXiv.org [1], Project Euclid [2] and other repository systems. The successful candidates will help develop the library’s evolving repository information architecture and services, and will work in a sm

Re: [CODE4LIB] Plagiarism checker

2015-01-23 Thread Simeon Warner
The code used for overlap detection within the arXiv corpus (see [1] which significantly extended earlier work [2]) does a matching based on a sliding window of hashed 7-word sequences on extracted ASCII text. Perhaps more the required for the case in question, but this approach scales to a cor

Re: [CODE4LIB] OAI harvesters, please use URLs in User Agent Strings

2014-11-17 Thread Simeon Warner
Way back when, in the harvester guidelines, we suggested using both the User-Agent (along the lines proposed but without URI suggestion) and the From (for email contact) headers: http://www.openarchives.org/OAI/2.0/guidelines-harvester.htm#AgentInfo So, a long winded +1 to a URI being a useful

[CODE4LIB] IIIF Image and Presentation API Version 2.0 Published

2014-09-11 Thread Simeon Warner
are welcome but (free) registration is required. Feedback, comments and questions are welcomed on the discussion list at iiif-disc...@googlegroups.com Sincerely and on behalf of the community, Benjamin Albritton Michael Appleby Robert Sanderson Stuart Snydman Jon Stroop Simeon Warner [1

Re: [CODE4LIB] very large image display? (static files/tiles)

2014-07-29 Thread Simeon Warner
I'll chime in a bit late on this that static file support is also a use case for IIIF and something we've kept in mindd while working on the APIs. This is the model for "level0" compliance [6]. A while ago I made a little demo of OpenSeadragon over IIIF Image API 1.1 using tiles and included a

Re: [CODE4LIB] Question about OAI Harvesting via Perl

2014-01-15 Thread Simeon Warner
On 1/14/14 10:45 PM, Edward Summers wrote: Just out of curiosity, does it work for a little bit then stop working? I know arXiv throttle crawlers, and am not sure if they throttle oai-pmh clients. Simeon Warner who helps run arXiv has been know to post code4lib, so maybe this will cross his

Re: [CODE4LIB] Discovery layer for Primo

2013-12-04 Thread Simeon Warner
At Cornell we are developing a Blacklight based discovery interface over Voyager (currently in beta [1]). We are planning to migrate the LMS to Alma yet provide continuity in our discovery environment through the Blacklight based discovery interface. IMO, the notion of separating the "stock con

Re: [CODE4LIB] The lie of the API

2013-12-02 Thread Simeon Warner
On 12/2/13 10:50 AM, Robert Sanderson wrote: On Sun, Dec 1, 2013 at 4:25 PM, Richard Wallis < richard.wal...@dataliberate.com> wrote: As to discovering then using the (currently implemented) URI returned from a content-negotiated call - The standard http libraries take care of that, like any ot

[CODE4LIB] Should linked data URIs be https?

2013-11-20 Thread Simeon Warner
Connecting two recent c4l threads... It seems that the web is rapidly moving toward https. I'm tempted to wonder how soon it will be before https is the default protocol when you type a bare domain name into your browser? [1] With linked data we want cool URIs, where one element of coolness is

Re: [CODE4LIB] mass convert jpeg to pdf

2013-11-08 Thread Simeon Warner
I agree with Ed that going to PDF seems unfortunate. Check out Jon Stroop's Loris [1] for a lightweight implementation of tiling using IIIF [2,3] that the Open Seadragon zoom-pan viewer works over. Cool demo at: http://libimages.princeton.edu/osd-demo/ Cheers, Simeon [1] https://github.com/

Re: [CODE4LIB] CODE4LIB equivalent in UK?

2012-10-02 Thread Simeon Warner
Have a look at http://devcsi.ukoln.ac.uk/ . This is mainly focused on repositories but seems somewhat similar from an outside view. Cheers, Simeon (lurking expat Brit) On 10/2/12 4:11 AM, Michael Hopwood wrote: Yes - my question was implicitly aimed at lurking UKavians. -Original Message-

[CODE4LIB] Repository developer job @ Cornell Library

2011-12-20 Thread Simeon Warner
ampus in Ithaca, NY. Located in the beautiful Finger Lakes region, Ithaca consistently appears in Top-10 lists of desirable places to live. Feel free to contact me if you have questions about the position. -- Simeon Warner Director of Repos