[CODE4LIB] METS question for @USE in file/fileGrp

2011-08-03 Thread Ethan Gruber
Hi all,

According to the METS schema, the USE attribute is  A tagging attribute to
indicate the intended use of files within this file group (e.g., master,
reference, thumbnails for image files). I'm using the mets:fileSec to
capture images of a coin within a larger numismatic XML schema.  The @USE is
an unrestricted xs:string, and this example document -
http://fedora.library.northwestern.edu/fedora/get/inu:inu-afrmap-4333843/source-
uses recto as a USE for images of a map.  By this same token, is it
semantically acceptable to have obverse and reverse as a USE?

Example:

fileGrp use=obverse
  file use=master
  file use=reference
  file use=thumbnail
fileGrp use=reverse
  file use=master
  file use=reference
  file use=thumbnail

Thanks,
Ethan


Re: [CODE4LIB] [lita-l] Re: Seeking feedback on database design for an open source software registry

2011-08-03 Thread Peter Murray
In the E-R diagram, there is a place for a URL and a chunk of HTML code in each 
Release entity.  I was thinking that is where release-specific information 
would go, and I will update the document to call it out more explicitly.

Thanks again, Lori.


Peter

On Aug 2, 2011, at 7:25 PM, Lori Bowen Ayre wrote:
 
 I think providing a URL for information about specific features is a good 
 idea.  You may want to allow for listing more than one version of a 
 particular software package and allow for a link to the features listing 
 associated with each version (or perhaps release notes) for each supported 
 version.
 


-- 
Peter Murray peter.mur...@lyrasis.orgtel:+1-678-235-2955
 
Ass't Director, Technology Services Development   http://dltj.org/about/
LYRASIS   --Great Libraries. Strong Communities. Innovative Answers.
The Disruptive Library Technology Jesterhttp://dltj.org/ 
Attrib-Noncomm-Share   http://creativecommons.org/licenses/by-nc-sa/2.5/


Re: [CODE4LIB] Programmer Orientation to Library/Lib Sci

2011-08-03 Thread Jonathan Rochkind

  I'm especially interested in anything which
  gave you an ah-ha! moment when you were working with library data --
  the implicit things which didn't make sense until you knew why those
  crazy librarians did things the way they did.



I'd add that you should be open to accepting that some of those things 
STILL won't make sense once you know why librarians do things the way 
they do.


Much of what we do indeed simply doesn't make sense -- I mean, we got 
here somehow for certain reasons, and understanding the history can help 
understand how we got to where we are, but often where we are is really 
really unfortunate.  (And even many of us librarians don't entirely 
understand how we got here, if we're under a certain age!)


But anyway, you've probably already seen it, but Jason Thomale's 
Code4Lib Journal article might be useful for providing some of that 
background for Marc (although I don't neccesarily think his 
explanation/analysis/solutions are airtight, they're an introduction).


Re: [CODE4LIB] Seeking feedback on database design for an open source software registry

2011-08-03 Thread Eric Lease Morgan
On Aug 1, 2011, at 4:22 PM, Peter Murray wrote:

 As part of the Mellon Foundation grant funding the start-up of LYRASIS 
 Technology Services, LTS is to produce a series of tools that enable 
 libraries to decide whether open source is right for their environments.  
 I’ve put a page up on the Code4Lib wiki describing the kinds of tools that 
 will initially fall into this area.  After review by the LTS Advisory Panel 
 and comments from the community, statements of work will be drafted for 
 consultants to create these tools and the work will be let out for contract. 
 The completed tools will be turned into web documents in the form of 
 whitepapers, checklists, spreadsheets, etc., and published along with the 
 open source software registry now under development. To encourage consultants 
 to share their knowledge, we are considering allowing consultants to identify 
 themselves in the text of the document (e.g. “Prepared for LYRASIS with 
 funding from the 2011-2012 Mellon Foundation! Open Source Support Grant by 
 name of consultant.”)
 
 With this background in mind, answers to these questions would be helpful:
 
   • Based on your experience and/or knowledge of open source software 
 adoption, are there other tools or techniques that would be useful to 
 document and make available?


Peter, if I understand you correctly, your approach seems novel. Usually open 
source software developers have scratched their itch, made their software 
available to the world, and if so inclined, spent time and effort building a 
community around the software. Your approach seems RFP-like. Statements of work 
will be drafted by LTS. Developers (consultants) will respond, be selected, and 
contracted. Software will be created.

-- 
Eric Lease Morgan


[CODE4LIB] Job Posting: Senior Systems Librarian, Brandeis University, Waltham MA

2011-08-03 Thread Tania Fersenheim
Senior Systems Librarian

Brandeis University, a private research university in the metropolitan
Boston area, seeks an experienced, innovative and service-oriented
Senior Systems Librarian to serve as an integral member of the library
systems team.  The primary responsibility of the position is to
implement, develop and support complex software systems for the
Libraries.  The Senior Systems Librarian will serve as the primary
administrator and support person for a variety of commercial and open
source systems, and develop, maintain and enhance integration between
library systems, other campus systems and cloud services.


Responsibilities include:
  * Providing project management and oversight of technical
operations, in partnership with the Manager of Library Systems
  * Developing integration between library systems, other campus
systems and cloud services
  * Providing administration, support and enhancement of a variety of
library systems, including but not limited to the integrated library
system, link resolver, proxy server, federated search system, archives
management system, interlibrary loan management system
  * Perform ad-hoc scripting, report writing and data munging

Qualifications include:
  * Master's degree in library science or related field and 3-5 years
demonstrated experience administering library systems, preferably in
an academic setting.  Relevant work experience may be substituted for
some of the required education.
  * Strong knowledge of Linux and Windows operating systems, Apache,
and relational databases
  * Proficiency with HTML and XML, Javascript, Perl (or PHP), SQL
  * Experience maintaining and supporting library automated systems
and resources, especially including, but not limited to, Aleph,
Metalib, SFX, EZProxy, ILLiad.
  * Strong interpersonal and communication skills ; ability to
establish good working relationships with colleagues and vendors
  * Strong analytical, technical and troubleshooting skills for
complex applications

The successful candidate will be flexible, creative, and enthusiastic,
with a strong interest in exploring and implementing emerging
technologies and innovative services.  S/he will have a demonstrated
ability to work collaboratively and possess a strong service
commitment, with a demonstrated ability to plan, coordinate and carry
out complex projects.  Preference will be given to candidates who
demonstrate the ability to operate and maintain library systems,
knowledge of current issues and trends in library technology,
knowledge of contemporary web design and development, and project
management abilities.


How to apply:
Submit cover letter and resume as a single document at
http://www.brandeis.edu/humanresources/jobs/external.html
Job ID 520056

Closing Statement:
Brandeis University operates under an affirmative action plan and
encourages minorities, women, disabled individuals, and eligible
veterans to apply. It is the policy of the University not to
discriminate against any applicant or employee on the basis of race,
ancestry, color, religion, sex, sexual orientation, age, genetic
information, national origin, disability, veteran status, or on the
basis of any other legally protected category.


[CODE4LIB] Job Announcement: Web Librarian, subject to award, to perform e-reference duties and manage web-based content (Washington, DC)

2011-08-03 Thread Patty De Anda Gates
LAC Group seeks experienced Web Librarian on behalf of our client, a 
prestigious government agency library. Web Librarian will have thorough 
knowledge of web-based library resources (electronic resources), web page 
management, OPACs, and Internet-based services. Position subject to contract 
award.  Work location Washington, D.C.



Primary Responsibilities:



* Perform web-based library services including electronic reference, 
e-resources accessed via the Internet, managing web-based content, and other 
related functions.

* Provides general input and suggestions for improvement of processes 
and products for the benefit of the library's clientele

* Assists users in formulating search strategies and in locating 
materials using all available sources and tools

* Maintains current awareness about new products and sources of 
relevant materials or information.  Coordinates with government counterparts 
when information requests include extensive research



Qualifications:

* Bachelor's Degree or equivalent and 5 years relevant experience are 
mandatory

* MLS from an ALA accredited institution preferred

* Work experience with Millennium's Innovative Interfaces Inc (III) 
integrated library system, OCLC Worldcat, EBSCO, Innopac, Webpac Pro, or other 
similar library automation software is highly desired

* Knowledge of HTML (XHTML, HTML5), CSS, RSS, and web-accessibility 
issues are strongly preferred.

* Experience with electronic resources management (ERM) and content 
management systems (CMS) are strongly preferred

* Knowledge of or familiarity with any other web-development languages 
/ scripts including Java, Javascript, PHP, Perl, SQL, XML or other would be a 
plus




To Apply: To submit your application please visit:  http://bit.ly/WebLibrarianDC


To view all of our currently open positions please visit: 
http://lac-group.com/lac-group/careers/

LAC Group is an Equal Opportunity / Affirmative Action Employer who values 
diversity in the workplace.





Patty De Anda Gates
Communications  Projects Associate
323.302.9439 - direct
323.852.1083 - main
323.852.1093 - fax

LAC Group, 6500 Wilshire Boulevard, Suite 2240, Los Angeles, CA 90048
LAC on LinkedInhttp://www.linkedin.com/groups?mostPopular=gid=1235317 | LAC 
on 
Facebookhttp://www.facebook.com/pages/LAC-Group/136401033040717?v=app_4949752878ref=ts#!/pages/LAC-Group/136401033040717?v=wallref=ts
 | LAC on Twitterhttp://twitter.com/libassociates  | LAC Group Newsletter 
Sign-uphttp://lac-group.com/lac-group/newsletter/
lac-group.comhttp://lac-group.com/

The information contained in this e-mail message is confidential.  If you are 
not the intended recipient, any dissemination or copying is strictly 
prohibited.  If you think that you have received this e-mail message in error, 
please contact the sender.


Re: [CODE4LIB] Seeking feedback on database design for an open source software registry

2011-08-03 Thread stuart yeates

On 02/08/11 08:22, Peter Murray wrote:

Colleagues -- please excuse the cross-posting; I've found the circle of people 
potentially interested in this was wider than I thought.


As part of the Mellon Foundation grant funding the start-up of LYRASIS 
Technology Services, LTS is to produce a series of tools that enable libraries 
to decide whether open source is right for their environments.  I’ve put a page 
up on the Code4Lib wiki 
(http://wiki.code4lib.org/index.php/Decision_Support_Tools) describing the 
kinds of tools that will initially fall into this area.  After review by the 
LTS Advisory Panel and comments from the community, statements of work will be 
drafted for consultants to create these tools and the work will be let out for 
contract. The completed tools will be turned into web documents in the form of 
whitepapers, checklists, spreadsheets, etc., and published along with the open 
source software registry now under development. To encourage consultants to 
share their knowledge, we are considering allowing consultants to identify 
themselves in the text of the document (e.g. “Prepared for LYRASIS with funding 
from the 2011-2012 Mellon Found

ation Open Source Support Grant by name of consultant.”)

Two points:

(1) The model seems appears not to capture Project A builds on Project 
B This will make the model less-than-optimal for comparing (for 
example) an open source Project A with an propriety Project B when B is 
a fork of A.


(2) Standards. They appear not to be mentioned at all.

cheers
stuart
--
Stuart Yeates
Library Technology Services http://www.victoria.ac.nz/library/


[CODE4LIB] Apps to reduce large file on the fly when it's requested

2011-08-03 Thread Ranti Junus
Dear All,

My colleague came with this query and I hope some of you could give us some
ideas or suggestion:

Our Digital Multimedia Center (DMC) scanning project can produce very large
PDF files. They will have PDFs that are about 25Mb and some may move into
the 100Mb range. If we provide a link to a PDF of that large, a user may not
want to try to download it even though she really needs to see the
information. In the past, DMC has created a lower quality, smaller versions
to the original file to reduce the size. Some thoughts have been tossed
around to reduce the duplication or the work (e.g. no more creating the
lower quality PDF manually.)

They are wondering if there is an application that we could point to the end
user, who might need it due to poor internet access, that if used will
simplify the very large file transfer for the end user. Basically:
- a client software that tells the server to manipulate and reduce the file
on the fly
- a server app that would to the actual manipulation of the file and then
deliver it to the end user.

Personally, I'm not really sure about the client software part. It makes
more sense to me (from the user's perspective) that we provide a download
the smaller size of this large file link that would trigger the server-side
apps to manipulate the big file. However, we're all ears for any suggestions
you might have.


thanks,
ranti.


-- 
Bulk mail.  Postage paid.


Re: [CODE4LIB] Apps to reduce large file on the fly when it's requested

2011-08-03 Thread Gabriel Farrell
I agree that your client software should be nothing more than a link
or button in the web browser. As for the server, it sounds akin to
image servers that resize on the fly. I would probably just proxy
requests to a script or cgi that compresses/converts the files,
especially if you're not planning to get a lot of hits per second. If
that's not robust enough, there are a number of results from a search
for pdf server that might work for you.


On Wed, Aug 3, 2011 at 7:36 PM, Ranti Junus ranti.ju...@gmail.com wrote:
 Dear All,

 My colleague came with this query and I hope some of you could give us some
 ideas or suggestion:

 Our Digital Multimedia Center (DMC) scanning project can produce very large
 PDF files. They will have PDFs that are about 25Mb and some may move into
 the 100Mb range. If we provide a link to a PDF of that large, a user may not
 want to try to download it even though she really needs to see the
 information. In the past, DMC has created a lower quality, smaller versions
 to the original file to reduce the size. Some thoughts have been tossed
 around to reduce the duplication or the work (e.g. no more creating the
 lower quality PDF manually.)

 They are wondering if there is an application that we could point to the end
 user, who might need it due to poor internet access, that if used will
 simplify the very large file transfer for the end user. Basically:
 - a client software that tells the server to manipulate and reduce the file
 on the fly
 - a server app that would to the actual manipulation of the file and then
 deliver it to the end user.

 Personally, I'm not really sure about the client software part. It makes
 more sense to me (from the user's perspective) that we provide a download
 the smaller size of this large file link that would trigger the server-side
 apps to manipulate the big file. However, we're all ears for any suggestions
 you might have.


 thanks,
 ranti.


 --
 Bulk mail.  Postage paid.



[CODE4LIB] Job Announcement: Systems Librarian

2011-08-03 Thread Salwa Ismail
Georgetown University, Lauinger Library has the following new position open. : 
Systems Librarian
Lauinger Library/Library Information Technology
Georgetown University
Washington D.C.
Date Position Posted: July 27, 2011

Duties And Responsibilities  Minimum Requirements:
Responsibilities for the Systems Librarian include: •administer, maintain and 
support the Innovative Interfaces, Inc. (III) hardware, software and associated 
peripherals which make up the Library’s primary computing system; •Manage and 
support the Library’s suite of information management systems like TDNet, 
WebBridge, AquaBrowser discovery platform and ILLiad. •act as liaison between 
III and the Library for problem resolution and enhancement requests and act as 
Library’s representative to the national user group; •provide end user support 
in the form of report production and distribution of reports, database 
maintenance and system modifications as required; and •consult with library 
staff with regard to system capabilities and functionality.

Minimum Requirements: Master’s degree in Library Science with four years of 
extensive experience managing III integrated library systems, platforms and 
computer operations; two yeas experience with ERM systems and Link Resolvers; 
Familiarity with library technology standards and protocols such as FRBR, 
Z39.50, OAI/PHM, RDA. Familiarity with EZProxy, digital library tools and 
standards, RSS Web 2.0 etc.

To apply visit:
http://www12.georgetown.edu/hr/employment_services/joblist/job_description.cfm?CategoryID=7RequestNo=20110625

http://www12.georgetown.edu/hr/employment_services/joblist/jobs.cfm


Re: [CODE4LIB] Seeking feedback on database design for an open source software registry

2011-08-03 Thread Peter Murray
Thanks for the reply, Stuart.  With the first question, I've updated the 
diagram to add an Association entity.  (Technically, I don't think this is an 
entity but rather a specialization of a relationship.)  This is based off some 
great work I saw at the NITRC.  Take a look at the Associations section of 
these page:

  http://www.nitrc.org/projects/fcon_1000/
  http://www.nitrc.org/projects/fsl/

This fits the use case you describe and that of modules that would be a part of 
a Drupal installation or how djatoka can be a component of several different 
projects.

Regarding the second question, I think of standards as a kind of technology.  
I've added standard to the list of enumerations at 
http://wiki.code4lib.org/index.php/Registry_E-R_Diagram


Peter

On Aug 3, 2011, at 7:06 PM, stuart yeates wrote:
 Two points:
 
 (1) The model seems appears not to capture Project A builds on Project 
 B This will make the model less-than-optimal for comparing (for 
 example) an open source Project A with an propriety Project B when B is 
 a fork of A.
 
 (2) Standards. They appear not to be mentioned at all.
 
 cheers
 stuart



-- 
Peter Murray peter.mur...@lyrasis.orgtel:+1-678-235-2955
 
Ass't Director, Technology Services Development   http://dltj.org/about/
LYRASIS   --Great Libraries. Strong Communities. Innovative Answers.
The Disruptive Library Technology Jesterhttp://dltj.org/ 
Attrib-Noncomm-Share   http://creativecommons.org/licenses/by-nc-sa/2.5/


Re: [CODE4LIB] Apps to reduce large file on the fly when it's requested

2011-08-03 Thread Joe Hourcle
On Aug 3, 2011, at 7:36 PM, Ranti Junus wrote:

 Dear All,
 
 My colleague came with this query and I hope some of you could give us some
 ideas or suggestion:
 
 Our Digital Multimedia Center (DMC) scanning project can produce very large
 PDF files. They will have PDFs that are about 25Mb and some may move into
 the 100Mb range. If we provide a link to a PDF of that large, a user may not
 want to try to download it even though she really needs to see the
 information. In the past, DMC has created a lower quality, smaller versions
 to the original file to reduce the size. Some thoughts have been tossed
 around to reduce the duplication or the work (e.g. no more creating the
 lower quality PDF manually.)
 
 They are wondering if there is an application that we could point to the end
 user, who might need it due to poor internet access, that if used will
 simplify the very large file transfer for the end user. Basically:
 - a client software that tells the server to manipulate and reduce the file
 on the fly
 - a server app that would to the actual manipulation of the file and then
 deliver it to the end user.
 
 Personally, I'm not really sure about the client software part. It makes
 more sense to me (from the user's perspective) that we provide a download
 the smaller size of this large file link that would trigger the server-side
 apps to manipulate the big file. However, we're all ears for any suggestions
 you might have.


I've been dealing with related issues for a few years, and if you have
the file locally, it's generally not too difficult to have a CGI or similar
that you can call that will do some sort of transformation on the fly.

Unfortunately, what we've run into is that in some cases, in part because
it tends to be used by people with slow connections, and for very large
files, they'll keep restarting to the process, and because it's a generated
on-the-fly, the webserver can't just pick up where it left off, so has to
re-start the process.

The alternative is to write it out to disk, and then let the webserver
handle it as a normal file.  Depending on how many of these you're
dealing with, you may have to have something manage the scratch
space and remove the generated files that haven't been viewed in
some time.

What I've been hoping to do is:

1. Assign URLs to all of the processed forms, of the format:
http://server/processing/ID
(where 'ID' includes some hashing in it, so it's not 10mil 
files in a directory)

2. Write a 404 handler for each processing type, so that
should a file not exist in that directory, it will:
(a) verify that the ID is valid, otherwise, return a 404.
(b) check to see if the ID's being processed, otherwise, kick
off a process for the file to be generated
(c) return a 503 status.

Unfortunately, my initial testing (years ago) suggested that no
clients at the time properly handled 503 requests (effectively,
try back in (x) minutes, and you give 'em a time)

The alternative is to just basically sleep for a period of time, and
then return the file once it's been generated ... but for ones
that take some time (some of my processing might take hours,
as the files that it needs as input are stored near-line, and we're
at the mercy of a tape robot)

...

You might also be able to sleep and then use one of the various
30x status codes, but I don't know what a client might do if you
returned the same URL.  (they might abort, to prevent looping)

-Joe