[CODE4LIB] Job posting: Metadata Librarian, Princeton, NJ
Metadata Librarian Princeton Theological Seminary is seeking a Metadata Librarian to assist with the production of metadata for a variety of digital projects. Responsibilities Responsibilities include analyzing metadata requirements and specifications, creating and editing metadata documents, developing metadata crosswalks, facilitating outsourcing workflows, and performing quality control. Requirements The successful candidate will be enthusiastic about collaborating with a small, cross-functional, agile team using the scrum framework. This position requires a master’s degree in Library Science, Information Science, or the equivalent, as well as understanding a wide variety of metadata standards (including AACR2, DOI, Dublin Core, EAD, HTML, MARCXML, METS, MODS, and TEI). Preferred qualifications include experience with XML and related standards such as CSS, DTD, SRU, XML Schema, XPath, XQuery, and XSLT as well as familiarity with relational and XML-native databases. Application Information We offer a pleasant, academic work environment in an attractive campus setting and a benefits package that includes 20 days vacation after one year of service. Interested candidates should send resume with salary requirement to: PRINCETON THEOLOGICAL SEMINARY Human Resources Office 64 Mercer Street Princeton, NJ 08540 Fax: (609) 924-2973 E-Mail: ap...@ptsem.edu We Are An Equal Opportunity Employer Christine Schwartz XML Database Administrator Princeton Theological Seminary Library christine.schwa...@ptsem.edu
Re: [CODE4LIB] Cataloging4Coders @ C4L12 - We need your brains
Hi Becky, Is this pre-conference going to be cataloging/metadata training for coders? Thanks, Chris Christine Schwartz Metadata Librarian Princeton Theological Seminary Library > From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Becky > Yoose > Sent: Thursday, November 10, 2011 11:14 AM > To: CODE4LIB@LISTSERV.ND.EDU > Subject: [CODE4LIB] Cataloging4Coders @ C4L12 - We need your brains > > Hey folks, > > There's been increasing discussion and interest about cataloging around > this community (and others like it) for quite a while. I found some > co-conspirators and we are planning to propose a pre-conference on > cataloging/library metadata creation geared towards the huddled code4lib > masses (otherwise known as coders) who are yearning for knowledge of this > Darkest of Library Arts. > > We need you help before we post our proposal. We realize that there's a > wide range of cataloging knowledge and experience in the community, and we > want to make sure that those interested get the most out of the > pre-conference. If this pre-conference has perked your interest, can you > help us in letting us know: > > - What experience do you have with cataloging/library metadata creation? > - What do you want us to cover? Do you have any questions that you want > covered? > > This information will help us greatly in how we structure the > pre-conference both in content and schedule. For now, we're planning a > half-day pre-conference, but if there's enough interest between beginners > and more experienced folks, we will consider offering two half-day > preconferences in order to focus on specific participant needs. > > Feel free to ask questions as well - I'll try to answer them as best as > possible given what our group has brainstormed so far. > > Thanks for reading, > Becky > Official cat[aloger] herder > > > --- > Becky Yoose > Systems Librarian > Grinnell College Libraries > yoose...@grinnell.edu >
[CODE4LIB] XML Database Administrator, Princeton, NJ
Princeton Theological Seminary Location: Princeton, NJ http://www.ptsem.edu Princeton Theological Seminary is seeking a XML Database Administrator in the library. Reporting to the Curator of Special Collections, the XML Database Administrator administers the MarkLogic Content Server for the Digital Library Team: diagnosing and resolving issues related to performance, availability, and scalability; managing security and data integrity; assisting with data modeling and indexing; creating lexicons and thesauri; carrying out regular backups and ensuring disaster recovery plans; maintaining technical infrastructure, including the source control server and the image server. This entry-level position requires a bachelor's degree in Computer Science, Information Technology, Software Engineering, or a master's degree in Library and Information Science, or equivalent experience. Candidates should have experience with database administration, a solid grasp of NoSQL/XML databases, excellent knowledge of XML and related technologies, particularly XPath, XQuery, and XSLT. Acquaintance with library metadata formats (EAD, MARCXML, METS, MODS, TEI) preferred. The successful candidate will be enthusiastic about collaborating with a small, cross-functional, agile team using the scrum framework. We offer a pleasant, academic work environment in an attractive campus setting and a generous benefits package that includes 20 days’ vacation after one year of service. Apply for this position Interested candidates should send resume with salary requirement to: PRINCETON THEOLOGICAL SEMINARY Human Resources Office 64 Mercer Street Princeton, NJ 08540 Fax: (609) 924-2973 E-Mail: ap...@ptsem.edu We Are An Equal Opportunity Employer Christine Schwartz Metadata Librarian Princeton Theological Seminary Library
[CODE4LIB] Job posting: Digital Library Application Developer, Princeton Theological Seminary
Princeton Theological Seminary Library is seeking a Digital Library Application Developer to join a small team of content and technical specialists building web-based digital collections of religious texts. The position reports to the Curator of Special Collections and is primarily responsible for developing XML-based applications in XQuery and XSLT on a MarkLogic Content Server. For more information: http://jobs.37signals.com/jobs/6517 Christine Schwartz Metadata Librarian Princeton Theological Seminary Libraries christine.schwa...@ptsem.edu http://digital.library.ptsem.edu/
Re: [CODE4LIB] METS and TEI?
Mark Logic is a native XML database and most of the code is XQuery. So, it's operating directly on the XML. It has specific optimized queries that it runs. I like you idea about considering what comes after Mark Logic. I've never been one for making metadata decisions based on the current system. Thanks, Chris On Wed, Nov 4, 2009 at 3:50 PM, Andrew Ashton wrote: > Chris, > > Does MarkLogic operate directly on the XML or does it index it? If > it is > running Xqueries or something like that, you may not see much of a > performance increase by splitting them out. In fact I'd say that Xqueries > are typically a lot faster when they're operating on a single document or > collection of similar document. You may also want to consider what comes > after MarkLogic.
Re: [CODE4LIB] METS and TEI?
Hi Mike, I'm pretty comfortable with METS, but very new to TEI. So, what I write here is just an attempt to reflect what my more experienced colleagues are saying: * It seems the structure of TEI documents can be problematic since they follow a logical structure, by paragraphs/sections. And the structMap of all our METS documents, so far, are divided up by pages of text, not paragraphs. So the TEI structure does not fit nicely into METS the way we're using METS. * We're also concerned with not having redundant metadata in the TEI header and the dmdSec of the METS document. So, we're considering keeping the TEI header very brief and relying on the METS doc for descriptive/administrative/technical metadata. (We won't be deriving METS from TEI which is another issue.) * The other issue has already been raised by Liza Daly: performance. We've been told by one of the programmers at Mark Logic that we should embed the TEI docs into METS for good performance, but we have other reasons why we don't want to embed the TEI (editing, maintenance, etc.). So, we are considering writing a script that would integrate the METS and TEI at the point a search is deployed. * From the metadata standpoint, I want to keep the TEI docs separate and link out to them from the METS docs, because I'm not convinced that library metadata standards are stable. If we move away from using METS in the next 5-10 years, I think it would be easier if all the text/image files remained separate from the metadata. So, I'd prefer links in the fileSec of METS that link out to external TEI files. Chris On Wed, Nov 4, 2009 at 2:32 PM, Michael J. Giarlo < leftw...@alumni.rutgers.edu> wrote: > On Wed, Nov 4, 2009 at 14:38, stuart yeates > wrote: > > Christine Schwartz wrote: > > > >> Should we consider embedding the TEI in the METS documents, or just link > >> out > >> to them? > > > > It depends on what you're doing and who the likely users of the METS are. > > > > The trouble with separate files, is that they inevitably get separated. > > Not to threadjack, but I am curious, Christine: how would you handle > linking between the TEI and the METS? > > It could be there's an obvious answer and I'm having a "duh" moment > (or lifetime), of course. > > -Mike >
[CODE4LIB] METS and TEI?
Hi, I've received one good answer on Twitter, but want to get a sense of what others are doing. We've been using METS/DC/MIX for awhile, but now were are going to integrate full-text searching with texts marked up in TEI (our platform is the MarkLogic Server). Should we consider embedding the TEI in the METS documents, or just link out to them? Any help would be greatly appreciated. Thanks, Chris Christine Schwartz Metadata Librarian Princeton Theological Seminary Libraries
[CODE4LIB] Recommend book scanner?
We are looking into buying a book scanner which we'll probably use for archival papers as well--probably something in the $1,000.00 range. Any advice? Thanks, Chris Christine Schwartz Metadata Librarian Princeton Theological Seminary Libraries christine.schwa...@ptsem.edu
[CODE4LIB] Code4Lib Journal: new issue 6 now available
Editorial Introduction - Issue 6 Christine Schwartz http://journal.code4lib.org/articles/1376 The intelligent use of technology in libraries continues to be one of our most crucial challenges. For those of us who became librarians because we loved to explore the book stacks, we are now finding new ways to explore both old and new content in digital form. With issue 6 of the Code4Lib Journal we hope you will find new ways to explore, experiment, and bring to your library users what they want and need. Using OAI-ORE to Transform Digital Repositories into Interoperable Storage and Services Applications David Tarrant, Ben O'Steen, Tim Brody, Steve Hitchcock, Neil Jefferies and Leslie Carr http://journal.code4lib.org/articles/1062 In the digital age libraries are required to manage large numbers of diverse objects. One advantage of digital objects over fixed physical objects is the flexibility of ‘binding’ them into publications or other useful aggregated intellectual entities while retaining the ability to reuse them independently in other contexts. An emerging framework for managing flexible aggregations of digital objects is provided by the Open Archives Initiative (OAI) with its work on Object Reuse and Exchange (ORE). This paper will show how OAI-ORE is being used to manage content in digital repositories, in particular institutional repositories, and has the potential ultimately to transform the conception of digital repositories. Semi-automatic Citation Correction with Lemon8-XML MJ Suhonos http://journal.code4lib.org/articles/1011 The Lemon8-XML software application, developed by the Public Knowledge Project (PKP), provides an open-source, computer-assisted interface for reliable citation structuring and validation. Lemon8-XML combines citation parsing algorithms with freely-available online indexes such as PubMed, WorldCat, and OAIster. Fully-automated markup of entire bibliographies may be a genuine possibility using this approach. Automated markup of citations would increase bibliographic accuracy while reducing copyediting demands. The Wise Use of Statistics in a Library-Oriented Environment Mathias Weyland http://journal.code4lib.org/articles/1275 As with most businesses, libraries use statistics to justify expenses, to monitor the library’s expansion and to predict prospective developments. This article describes SQL and shell techniques for data retrieval as well as further processing of the data using the open source statistical environment R. The article emphasizes some of the pitfalls and reasoning errors librarians could easily slip into. Having an academic background on statistics, the author is appointed to projects and tasks which need mathematical and statistical methods to be successfully accomplished. Tree Representations: Graphics Libraries for Displaying Hierarchical Data Mark Wilhelm http://journal.code4lib.org/articles/1083 Tree representations can be useful for presenting hierarchical data on the screen. In this article I’ll briefly describe building trees using the Dojo, Yahoo User Interface, Java Server Faces, and Google Web Toolkit libraries. Visualizing Media Archives: A Case Study Chris Beer, Courtney Michael, and Mayo Todorovic http://journal.code4lib.org/articles/1119 The WGBH Media Library and Archives is piloting an online media archive for scholarly research. In conversation with users, we have discovered they want to quickly pinpoint items relevant to their work and get an overview of collections and their relationships to other materials. To demonstrate the size and complexity of our collection to users in a meaningful way, WGBH is employing data visualization techniques to provide an interactive, graphical representation of the various relationships between items. This article discusses the techniques employed in implementing our relationship map, emphasizes the cataloging techniques required for this effort, and offers code and examples to spark discussion about ways to improve or extend this effort. Course Views: A Scalable Approach to Providing Course-Based Access to Library Resources Jason Casden, Kim Duckett, Tito Sierra and Joseph Ryan http://journal.code4lib.org/articles/1218 The NCSU Libraries’ Course Views project, along with a locally developed widget web service, improves course-based access to library collections and services by dynamically generating library course pages for all 6000+ courses at NCSU. By automatically generating custom content when possible and showcasing authored content when available, Course Views is able to achieve full course coverage without significantly increasing staff time to create and manage content. This paper will describe the system and the use of web services to achieve scalable and sustainable delivery of course-related library content. Integrating Process Management with Archival Management Systems: Lessons Learned J. Gordon Daines, III and Cory L. Nimer http://journal.code4lib.org/articles/1016
[CODE4LIB] Code4Lib Journal: call for proposals, March issue
Call for Submissions: The Code4Lib Journal (C4LJ) exists to foster community and share information among those interested in the intersection of libraries, technology, and the future. The Code4Lib Journal is now accepting proposals for publication in its 6th issue. Don't miss out on this opportunity to share your ideas and experiences. To be included in the 6th issue, which is scheduled for publication in late March 2009, please submit articles, abstracts, or proposals to [EMAIL PROTECTED] by Friday, December 12, 2008. When submitting, please include the title or subject of the proposal in the subject line of the email message. C4LJ encourages creativity and flexibility, and the editors welcome submissions across a broad variety of topics that support the mission of the journal. Possible topics include, but are not limited to: * Practical applications of library technology (both actual and hypothetical) * Technology projects (failed, successful, proposed, or in-progress), including how they were done and challenges faced * Case studies * Best practices * Reviews * Comparisons of third party software or libraries * Analyses of library metadata for use with technology * Project management and communication within the library environment * Assessment and user studies C4LJ strives to promote professional communication by minimizing the barriers to publication. While articles should be of a high quality, they need not follow any formal structure. Writers should aim for the middle ground between blog posts and articles in traditional refereed journals. Where appropriate, we encourage authors to submit code samples, algorithms, and pseudo-code. For more information, visit C4LJ's Article Guidelines or browse articles from the first 4 issues published on our website: http://journal.code4lib.org. Remember, for consideration for the 6th issue, please send proposals, abstracts, or draft articles to [EMAIL PROTECTED] no later than Friday, December 12, 2008. (If accepted: first draft due, January 23, 2009.) Send in a submission. Your peers would like to hear what you are doing. Code4Lib Journal Editorial Committee
[CODE4LIB] Question about SRU APIs
Does anyone have a web services contract for the SRU base profile (http://www.loc.gov/standards/sru/specs/base-profile.html)? We're looking to implement SRU in XQuery and it would really help to see how others have designed the interfaces, whether or not they have implemented them in XQuery. I see that the Library of Congress has WSDL files available for SRU (at least for 1.1 http://www.loc.gov/standards/sru/sru1-1archive/xml-files.html), but I am not sure how to use these to design a REST-based SRU service. Any advice about how to get started would be most welcome! Thanks, Chris Christine Schwartz Metadata Librarian Princeton Theological Seminary Libraries [EMAIL PROTECTED] http://www.catalogingfutures.com