[CODE4LIB] Code4Lib Journal Issue 33 is now available!

2016-07-19 Thread Ron PETERSON
The Code4Lib Journal, Issue 33 is now available!

http://journal.code4lib.org/issues/issue33

The Editorial Committee is pleased to submit issue 33 for your summer
reading pleasure. We encourage you to explore this issue, engage in the
comments, and reach out to the authors that contributed their work.

Editorial Introduction – Summer Reading List
by Ron Peterson
http://journal.code4lib.org/articles/11859
New additions for your summer reading list!

Emflix – Gone Baby Gone
by Netanel Ganin
http://journal.code4lib.org/articles/11762
Enthusiasm is no replacement for experience. This article describes a tool
developed at the Emerson College Library by an eager but overzealous
cataloger. Attempting to enhance media-discovery in a familiar and
intuitive way, he created a browseable and searchable Netflix-style
interface. Though it may have been an interesting idea, many of the crucial
steps that are involved in this kind of high-concept work were neglected.
This article will explore and explain why the tool ultimately has not been
maintained or updated, and what should have been done differently to ensure
its legacy and continued use.

Introduction to Text Mining with R for Information Professionals
by Monica Maceli
http://journal.code4lib.org/articles/11626
The ‘tm: Text Mining Package’ in the open source statistical software R has
made text analysis techniques easily accessible to both novice and expert
practitioners, providing useful ways of analyzing and understanding large,
unstructured datasets. Such an approach can yield many benefits to
information professionals, particularly those involved in text-heavy
research projects. This article will discuss the functionality and
possibilities of text mining, as well as the basic setup necessary for
novice R users to employ the RStudio integrated development environment
(IDE). Common use cases, such as analyzing a corpus of text documents or
spreadsheet text data, will be covered, as well as the text mining tools
for calculating term frequency, term correlations, clustering, creating
wordclouds, and plotting.

Data for Decision Making: Tracking Your Library’s Needs With TrackRef
by Michael Carlozzi
http://journal.code4lib.org/articles/11740
Library services must adapt to changing patron needs. These adaptations
should be data-driven. This paper reports on the use of TrackRef, an open
source and free web program for managing reference statistics.

Are games a viable solution to crowdsourcing improvements to faulty OCR? –
The Purposeful Gaming and BHL experience
by Max J. Seidman; Dr. Mary Flanagan;Trish Rose-Sandler; Mike Lichtenberg
http://journal.code4lib.org/articles/11781
The Missouri Botanical Garden and partners from Dartmouth, Harvard, the New
York Botanical Garden, and Cornell recently wrapped up a project funded by
IMLS called Purposeful Gaming and BHL: engaging the public in improving and
enhancing access to digital texts (
http://biodivlib.wikispaces.com/Purposeful+Gaming). The goals of the
project were to significantly improve access to digital texts through the
applicability of purposeful gaming for the completion of data enhancement
tasks needed for content found within the Biodiversity Heritage Library
(BHL). This article will share our approach in terms of game design choices
and the use of algorithms for verifying the quality of inputs from players
as well as challenges related to transcriptions and marketing. We will
conclude by giving an answer to the question of whether games are a
successful tool for analyzing and improving digital outputs from OCR and
whether we recommend their uptake by libraries and other cultural heritage
institutions.

>From Digital Commons to OCLC: A Tailored Approach for Harvesting and
Transforming ETD Metadata into High-Quality Records
by Marielle Veve
http://journal.code4lib.org/articles/11676
The library literature contains many examples of automated and
semi-automated approaches to harvest electronic theses and dissertations
(ETD) metadata from institutional repositories (IR) to the Online Computer
Library Center (OCLC). However, most of these approaches could not be
implemented with the institutional repository software Digital Commons
because of various reasons including proprietary schema incompatibilities
and high level programming expertise requirements our institution did not
want to pursue. Only one semi-automated approach was found in the library
literature which met our requirements for implementation, and even though
it catered to the particular needs of the DSpace IR, it could be
implemented to other IR software if further customizations were applied.
The following paper presents an extension of this semi-automated approach
originally created by Deng and Reese, but customized and adapted to address
the particular needs of the Digital Commons community and updated to
integrate the latest Resource Description & Access (RDA) content standards
for ETDs. Advantages and disadvantages of this w

[CODE4LIB] Code4Lib Journal, Issue 24 is now available!

2014-04-16 Thread Ron Peterson
Apologies for crossposting:

The Code4Lib Journal, Issue 24 is now available!

http://journal.code4lib.org/issues/issue24

Here is what you will find inside:  
Editorial Introduction: Seeking a Diversity of Voices
Ron Peterson
Making the Journal the best that it can be.
EgoSystem: Where are our Alumni?
James Powell, Harihar Shankar, Marko Rodriguez, Herbert Van de Sompel
Comprehensive social search on the Internet remains an unsolved problem. Social 
networking sites tend to be isolated from each other, and the information they 
contain is often not fully searchable outside the confines of the site. 
EgoSystem, developed at Los Alamos National Laboratories (LANL), explores the 
problems associated with automated discovery of public online identities for 
people, and the aggregation of the social, institution, conceptual, and 
artifact data connected to these identities. EgoSystem starts with basic 
demographic information about former employees and uses that information to 
locate person identities in various popular online systems. Once identified, 
their respective social networks, institutional affiliations, artifacts, and 
associated concepts are retrieved and linked into a graph containing other 
found identities. This graph is stored in a Titan graph database and can be 
explored using the Gremlin graph query/traversal
 language and with the EgoSystem Web interface. 
Enhancing Descriptive Metadata Records with Freely-Available APIs
Mark Phillips and Hannah Tarver
This article describes how the University of North Texas Libraries’ Digital 
Projects Unit used simple, freely-available APIs to add place names to metadata 
records for over 8,000 maps in two digital collections. These textual place 
names enable users to easily find maps by place name and to find other maps 
that feature the same place, thus increasing the accessibility and usage of the 
collections. This project demonstrates how targeted large-scale, automated 
metadata enhancement can have a significant impact with a relatively small 
commitment of time and staff resources.
Using Open Source Tools to Create a Mobile Optimized,  Crowdsourced Translation 
Tool
Evviva Weinraub Lajoie, Trey Terrell, Susan McEvoy, Eva Kaplan, Ariel Schwartz, 
and Esther Ajambo
In late 2012, OSU Libraries and Press partnered with Maria’s Libraries, an NGO 
in Rural Kenya, to provide users the ability to crowdsource translations of 
folk tales and existing children’s books into a variety of African languages, 
sub-languages, and dialects. Together, these two organizations have been 
creating a mobile optimized platform using open source libraries such as Wink 
Toolkit (a library which provides mobile-friendly interaction from a website) 
and Globalize3 to allow for multiple translations of database entries in a Ruby 
on Rails application. Research regarding successes of similar tools has been 
utilized in providing a consistent user interface. The OSU Libraries  Press 
team delivered a proof-of-concept tool that has the opportunity to promote 
technology exploration, improve early childhood literacy, change the way we 
approach foreign language learning, and to provide opportunities for 
cost-effective, multi-language publishing.
EPUB as Publication Format in Open Access Journals: Tools and Workflow
Trude Eikebrokk, Tor Arne Dahl, and Siri Kessel
In this article, we present a case study of how the main publishing format of 
an Open Access journal was changed from PDF to EPUB by designing a new workflow 
using JATS as the basic XML source format. We state the reasons and discuss 
advantages for doing this, how we did it, and the costs of changing an 
established Microsoft Word workflow. As an example, we use one typical 
sociology article with tables, illustrations and references. We then follow the 
article from JATS markup through different transformations resulting in XHTML, 
EPUB and MOBI versions. In the end, we put everything together in an automated 
XProc pipeline. The process has been developed on free and open source tools, 
and we describe and evaluate these tools in the article. The workflow is 
suitable for non-professional publishers, and all code is attached and free for 
reuse by others.
Customizing Android Tablets for a Shared Environment
Jane Nichols, Uta Hussong-Christian and Ryan Ordway
The Valley Library at Oregon State University Libraries  Press supports access 
to technology by lending laptops and e-readers. As a newcomer to tablet 
lending, The Valley Library chose to implement its service using Google Nexus 
tablets and an open source custom firmware solution, CyanogenMod, a free, 
community-built Android distribution.  They created a custom build of 
CyanogenMod featuring wireless updates, website shortcuts, and the ability to 
quickly and easily wipe devices between patron uses.  This article shares code 
that simplifies Android tablet maintenance and addresses Android application 
licensing issues for shared devices.
An Introduction

Re: [CODE4LIB] Call for Proposals: Code4Lib Journal

2014-01-05 Thread Ron Peterson
Just a reminder that this Friday, January 10th, is the deadline for submitting 
article proposals for issue 24 of the Code4Lib Journal.
 


  

Call for Proposals (and apologies for cross-posting):

The Code4Lib Journal (C4LJ) exists to foster community and share information 
among those interested in the intersection of libraries, technology, and the 
future.

We are now accepting proposals for publication in our 24th issue. Don't miss 
out on this opportunity to share your ideas and experiences. To be included in 
the 24th issue, which is scheduled for publication in mid April 2014, please 
submit articles, abstracts, or proposals at 
http://journal.code4lib.org/submit-proposal or to jour...@code4lib.org by 
Friday, January 10, 2014.  When submitting, please include the title or subject 
of the proposal in the subject line of the email message.

C4LJ encourages creativity and flexibility,
 and the editors welcome submissions across a broad variety of topics that 
support the mission of the journal.  Possible topics include, but are not 
limited to:

* Practical applications of library technology (both actual and hypothetical)
* Technology projects (failed, successful, or proposed), including how they 
were done and challenges faced
* Case studies
* Best practices
* Reviews
* Comparisons of third party software or libraries
* Analyses of library metadata for use with technology
* Project management and communication within the library environment
* Assessment and user studies

C4LJ strives to promote professional communication by minimizing the barriers 
to publication.  While articles should be of a high quality, they need not 
follow any formal structure.  Writers should aim for the middle ground between 
blog posts and articles in traditional refereed journals.  Where
 appropriate, we encourage authors to submit code samples, algorithms, and 
pseudo-code.  For more information, visit C4LJ's Article Guidelines or browse 
articles from the first 23 issues published on our website: 
http://journal.code4lib.org.

Remember, for consideration for the 24th issue, please send proposals, 
abstracts, or draft articles to jour...@code4lib.org no later than Friday, 
January 10, 2014.

Send in a submission.  Your peers would like to hear what you are doing.


[CODE4LIB] Call for Proposals: Code4Lib Journal

2013-12-10 Thread Ron Peterson
Call for Proposals (and apologies for cross-posting):

The Code4Lib Journal (C4LJ) exists to foster community and share information 
among those interested in the intersection of libraries, technology, and the 
future.

We are now accepting proposals for publication in our 24th issue. Don't miss 
out on this opportunity to share your ideas and experiences. To be included in 
the 24th issue, which is scheduled for publication in mid April 2014, please 
submit articles, abstracts, or proposals at 
http://journal.code4lib.org/submit-proposal or to jour...@code4lib.org by 
Friday, January 10, 2014.  When submitting, please include the title or subject 
of the proposal in the subject line of the email message.

C4LJ encourages creativity and flexibility, and the editors welcome submissions 
across a broad variety of topics that support the mission of the journal.  
Possible topics include, but are not limited to:

* Practical applications of library technology (both actual and hypothetical)
* Technology projects (failed, successful, or proposed), including how they 
were done and challenges faced
* Case studies
* Best practices
* Reviews
* Comparisons of third party software or libraries
* Analyses of library metadata for use with technology
* Project management and communication within the library environment
* Assessment and user studies

C4LJ strives to promote professional communication by minimizing the barriers 
to publication.  While articles should be of a high quality, they need not 
follow any formal structure.  Writers should aim for the middle ground between 
blog posts and articles in traditional refereed journals.  Where appropriate, 
we encourage authors to submit code samples, algorithms, and pseudo-code.  For 
more information, visit C4LJ's Article Guidelines or browse articles from the 
first 23 issues published on our website: http://journal.code4lib.org.

Remember, for consideration for the 24th issue, please send proposals, 
abstracts, or draft articles to jour...@code4lib.org no later than Friday, 
January 10, 2014.

Send in a submission.  Your peers would like to hear what you are doing. 



[CODE4LIB] Effective Data Visualization Practices | CORS Workshop at ALA Annual Conference, 2012 Anaheim

2012-06-18 Thread Ron Peterson
Apologies for any duplication 
 
Effective Data Visualization Practices | CORS Workshop | Monday, June 25, 2012, 
Anaheim, CA
Anaheim Convention Center, Room 203A

by Ray Lyons, MLIS, MPA
The workshop will provide an overview of best practices in the graphical 
presentation of quantitative information, emphasizing the need for clarity, 
fairness, and efficiency.
 
The purpose of data visualization is to display data impartially in order to 
uncover the ‘stories’ data have to tell.
 
Attendees will learn to adopt practices that enhance effective portrayal of 
these stories and to avoid those that interfere. 
 
Guidelines for choosing chart styles, labeling and captioning, determining axis 
scaling, use of color, improving readability, and highlighting key data 
patterns will be presented.
 
Examples of poor practices lurking in popular graphical software features will 
be included.
 
Ray Lyons is a statistical programmer and independent consultant and trainer in 
library performance measurement. He is the co-creator of the Library Journal 
Index of Public Library Service.  His articles on library assessment have 
appeared in Public Library Quarterly, Public Libraries, Library  Information 
Science Research, and Evidence Based Library and Information Practice.  Ray has 
masters degrees in library and information science and in public 
administration, with a specialty in quantitative methods.   He writes at 
http://libperformance.com.
 
Presented by the Committee on Research and Statistics.


[CODE4LIB] Code4Lib Journal - Call for Proposals

2012-05-29 Thread Ron Peterson
 Call for Proposals (and apologies for cross-posting): 
 
The Code4Lib Journal (C4LJ) exists to foster community and share information 
among those interested in the intersection of libraries, technology, and the 
future. 
 
We are now accepting proposals for publication in our 18th issue. Don't miss 
out on this opportunity to share your ideas and experiences. To be included in 
the 18th issue, which is scheduled for publication in October 2012, please 
submit articles, abstracts, or proposals at 
http://journal.code4lib.org/submit-proposal or to jour...@code4lib.org by 
Friday, June 29, 2012.  When submitting, please include the title or subject of 
the proposal in the subject line of the email message. 
 
C4LJ encourages creativity and flexibility, and the editors welcome submissions 
across a broad variety of topics that support the mission of the 
journal. Possible topics include, but are not limited to: 
 
* Practical applications of library technology (both actual and hypothetical)
* Technology projects (failed, successful, or proposed), including how they 
were done and challenges faced
* Case studies
* Best practices
* Reviews
* Comparisons of third party software or libraries
* Analyses of library metadata for use with technology
* Project management and communication within the library environment
* Assessment and user studies 
 
C4LJ strives to promote professional communication by minimizing the barriers 
to publication.  While articles should be of a high quality, they need not 
follow any formal structure.  Writers should aim for the middle ground between 
blog posts and articles in traditional refereed journals.  Where appropriate, 
we encourage authors to submit code samples, algorithms, and pseudo-code. For 
more information, visit C4LJ's Article Guidelines or browse articles from the 
first 17 issues published on our website: http://journal.code4lib.org. 
 
Remember, for consideration for the 18th issue, please send proposals, 
abstracts, or draft articles to jour...@code4lib.org no later than Friday, June 
29, 2012. 
 
Send in a submission.  Your peers would like to hear what you are doing. 
 
Code4Lib Journal Editorial Committee


[CODE4LIB] Code4Lib Journal issue 12 now available!

2010-12-21 Thread Ron Peterson
Issue 12 of the Code4Lib Journal is now available. The contents are as follows:

Editorial Introduction: The Code4Lib Journal isn’t just for Coders
Ron Peterson
Although the primary goal of the Code4Lib Journal is to provide practical 
solutions for technologists working in libraries, it has a lot to offer 
non-technologists. Technology affects all of the work that our libraries are 
doing and will define what the future of libraries will look like.  

http://journal.code4lib.org/articles/4587

Practical Ways to Promote and Support Collaborative Data Analysis Projects
Joyce Chapman and Cory Lown
This article is an appeal to technically-oriented library staff to initiate 
collaborative, bottom-up data-analysis efforts across their libraries. We 
discuss successful strategies used at North Carolina State University (NCSU) 
Libraries for initiating cross-departmental outreach for data-analysis work, as 
well as structuring and storing data, and disseminating findings. We present 
several specific examples of collaborative data-analysis projects undertaken at 
NCSU Libraries. 

http://journal.code4lib.org/articles/4258

How To Build a Computer Availability Map
Kim Griggs
Most libraries house one or more computer labs. Wouldn’t it be nice to be able 
to let your patrons view how many and what type of computers are available at 
any given time? Well, now you can. Follow along in this tutorial that takes you 
through the stages of implementing a real-time computer availability map that 
works for a mobile and full website. The complete code package is provided 
under 
the GPL v3 license, and is available at: 
http://github.com/griggsk/availability-map.
http://journal.code4lib.org/articles/4067

Creating Library Websites with Joomla: Not Too Big, Not Too Small, Just Right
Ellen Bahr and Matt Speed
Many organizations, including libraries, are turning to content management 
systems to simplify the management of their websites. Alfred University‘s 
Herrick Memorial Library recently implemented a new website using Joomla, an 
open-source content management system. While Drupal has received significant 
attention in the library community, Joomla may be a more practical choice for 
some libraries. The purpose of this paper is to share our experience with 
Joomla 
so that other libraries can more easily evaluate its suitability to their 
environment.
http://journal.code4lib.org/articles/4226

Improving the Drupal User Experience
Rachel Vacek, Sean Watkins, Christina M. Morris, and Derek Keller
Drupal is a powerful, but complex, Web Content Management System, being adopted 
by many libraries. Installing Drupal typically involves adding additional 
modules for flexibility and increased functionality. Although installing 
additional modules does increase functionality, it inevitably complicates 
usability. At the University of Houston Libraries, the Web Services department 
researched what modules work well together to accomplish a simpler interface 
while simultaneously providing the flexibility and advanced tools needed to 
create a successful user experience within Drupal. This article explains why 
particular modules were chosen or developed, how the design enhanced the user 
experience, how the CMS architecture was created, and how other library systems 
were integrated into Drupal.
http://journal.code4lib.org/articles/4578

Using Amazon S3 in Digital Preservation in a mid sized academic library: A case 
study of CCSU ERIS digital archive system
Edward Iglesias and Wittawat Meesangnil
With the increasing numbers of born digital and digitized objects in academic 
libraries from sources such as digital collections and institutional 
repositories many academic libraries need to seriously consider implementing 
some form of digital preservation system. In 2009 the Central Connecticut State 
University Library decided to use Amazon S3 for digital preservation storage 
despite some drawbacks. The library has developed a system, ERIS Digital 
Archive, to manage all digital preservation processes and to make the system as 
compliant with the OAIS model and “Trustworthy Digital Repositories” as 
possible.
http://journal.code4lib.org/articles/4468

FRBRizing an E-Library : Migrating from Dublin Core to FRBR and MODS
Jeremy Nelson and Alan Cleary
Western State College in Gunnison, Colorado developed an open-source eCataloger 
Framework, based on Dublin Core metadata, on Google’s App Engine to manage and 
serve electronic resources to the library’s patrons. Pressed to find new 
solutions for failing manual workflows for serials and government document 
resource management, the eCataloger Framework was extended to FRBR to automate 
and enhance serials management and government documents receiving. Based on 
successfully FRBRizing the eCataloger, Western State College converted their 
e-Library management from Dublin Core to FRBR and MODS. This paper examines the 
processes of each of these implementations using Python, AJAX

[CODE4LIB] Call for Papers: Code4Lib Journal

2010-08-16 Thread Ron Peterson
Call for Papers (and apologies for cross-posting): 
 
The Code4Lib Journal (C4LJ) exists to foster community and share information 
among those interested in the intersection of libraries, technology, and the 
future. 
 
The Code4Lib Journal is now accepting proposals for publication in its 12th 
issue. Don't miss out on this opportunity to share your ideas and experiences. 
To be included in the 12th issue, which is scheduled for publication in mid 
December 2010, please submit articles, abstracts, or proposals at 
http://journal.code4lib.org/submit-proposal or to 
c4lj-artic...@googlegroups.com 
by Friday, September 17, 2010. When submitting, please include the title or 
subject of the proposal in the subject line of the email message. 

C4LJ encourages creativity and flexibility, and the editors welcome submissions 
across a broad variety of topics that support the mission of the journal. 
Possible topics include, but are not limited to: 

* Practical applications of library technology (both actual and hypothetical)   
* Technology projects (failed, successful, or proposed), including how they 
were 
done and challenges faced   

* Case studies   
* Best practices   
* Reviews   
* Comparisons of third party software or libraries   
* Analyses of library metadata for use with technology   
* Project management and communication within the library environment   
* Assessment and user studies 
 
C4LJ strives to promote professional communication by minimizing the barriers 
to 
publication. While articles should be of a high quality, they need not follow 
any formal structure. Writers should aim for the middle ground between blog 
posts and articles in traditional refereed journals. Where appropriate, we 
encourage authors to submit code samples, algorithms, and pseudo-code. For more 
information, visit C4LJ's Article Guidelines or browse articles from the 
first 10 issues published on our website: http://journal.code4lib.org. 


Remember, for consideration for the 12th issue, please send proposals, 
abstracts, or draft articles to c4lj-artic...@googlegroups.com no later than 
Friday, September 17, 2010. 


Send in a submission. Your peers would like to hear what you are doing. 
 
Code4Lib Journal Editorial Committee 


[CODE4LIB] Code4Lib Journal, Issue 3

2008-06-23 Thread Ron Peterson
The third issue of the Code4Lib Journal is now available at 
http://journal.code4lib.org/.  Issue 3 continues the breadth and depth of 
articles that you found in the first 2 issues.  Check out the table of contents 
included below:

Code4Lib Journal
* Editorial Introduction - Issue - http://journal.code4lib.org/articles/104
Ron Peterson

* Alternative Solutions for Off-Campus Authentication - 
http://journal.code4lib.org/articles/73
Rebekah Kilzer, Elizabeth L. Black and James Muir
The Ohio State University Libraries created scripts to overcome the local 
limitations of the proxy server and to offer resource connections at the point 
of need. All libraries struggle to provide seamless authentication for access 
to paid resources, such as research databases and electronic journals. In order 
to obtain access to this content, the libraries must sign contracts promising 
to limit access to these resources to their user community. The challenge then 
comes in balancing the patron’s need for easy access to these rich data sources 
from any computer and the vendors’ desire to protect their assets.

* Distributed Version Control and Library Metadata - 
http://journal.code4lib.org/articles/86
Galen M. Charlton
Distributed version control systems (DVCSs) are effective tools for managing 
source code and other artifacts produced by software projects with multiple 
contributors. This article describes DVCSs and compares them with traditional 
centralized version control systems, then describes extending the DVCS model to 
improve the exchange of library metadata.

* The Planets Testbed: Science for Digital Preservation - 
http://journal.code4lib.org/articles/83
Brian Aitken (HATII), Petra Helwig (NANETH), Andrew Jackson (BL), Andrew 
Lindley (ARC), Eleonora Nicchiarelli (ONB), Seamus Ross (HATII)
The preservation of digital objects requires specific software tools or 
services. These can be characterisation tools that abstract the essential 
characteristics of a digital object from a file, migration tools that convert 
digital objects to different formats, or emulation tools that render digital 
objects in their original context on a new infrastructure. Until recently 
digital preservation has been characterised by practices and processes that 
could best be described as more art and craft than science. The Planets Testbed 
provides a controlled environment where preservation tools can be tested and 
evaluated, and where experiment results can be empirically compared. This paper 
presents an overview of the Testbed application, an analysis of the experiment 
methodology and a description of the Testbed’s web service approach.

* Bringing Sheet Music to Life: My Experiences with OMR - 
http://journal.code4lib.org/articles/84
Andrew Bullen
This article describes the process of digitizing sheet music celebrating 
Pullman porters and rail travel from the 1870s-1920s. The process involves 1) 
digitizing sheet music, 2) running the digitized sheet music through an Optical 
Musical Recognition (OMR) software package, 3)cleaning up the resulting file, 
4) converting it into an .mp3/MIDI file, and 5) tweaking it to use the 
voices/instruments of a music editing software program. The pros and cons of 
some popular OMR programs are discussed.

* Building an Archival Collections Portal - 
http://journal.code4lib.org/articles/77
Terry Catapano, Joanna DiPasquale, and Stuart Marquis
Columbia University Libraries has developed the Archival Collections Portal, a 
unified search system helping users discover archival resources in a 
streamlined way. We combined the power of Lucene and Solr to search XML, parse 
JSON objects, create EAD-compliant documents, and deliver results in an 
easy-to-use interface. By reusing MARC records and employing new search engine 
features and techniques, we are able to bring important and hard-to-find 
collections to researchers and archivists. The canonical home page of the 
Portal is http://www.columbia.edu/library/archival/.

* Developing an Academic Image Collection with Flickr - 
http://journal.code4lib.org/articles/74
Jeremy McWilliams
A group at Lewis  Clark College in Portland are in the process of developing 
an educational collection of contemporary ceramics images using the photo 
sharing site Flickr as a back end. This article discusses the evolution of the 
project, Flickr machine tags, and the concept of Flickr as an application 
database layer. The article includes code samples for creating and querying 
machine tags using the Flickr API.

* Making Patron Data Work Harder: User Search Terms as Access Points? - 
http://journal.code4lib.org/articles/78
Jason A. Clark
Montana State University (MSU) Libraries are experimenting with re-using 
patron-generated data to create browseable access points for the Electronic 
Theses and Dissertations (ETD) collection. A beta QueryCatcher module logs 
recent search terms and the number of associated hits. These terms are used to 
create browseable

[CODE4LIB] Code4Lib Journal: Call for Submissions

2008-03-10 Thread Ron Peterson
Call for Submissions:

The Code4Lib Journal (C4LJ) provides a forum to foster community and share 
information among those interested in the intersection of libraries, 
technology, and the future.

The deadline for submissions for the third issue of the Code4Lib Journal is 
approaching quickly.  To be included in the third issue, scheduled to be 
published in
late
June
2008, please submit articles, abstracts, or proposals for articles to [EMAIL 
PROTECTED] by Friday, March 14, 2008.

Possible topics for articles include, but are not limited to:

*
 Practical applications of library technology. Both actual and hypothetical 
applications invited.
* Technology projects (failed, successful, proposed, or  in-progress), how 
they were done, and challenges faced
* Case studies
* Best practices
* Reviews
* Comparisons of third party software or libraries
* Analyses of library metadata for use with technology
* Project management and communication within the library environment
* Assessment and user studies

Above all, C4LJ encourages creativity and flexibility, and the editors welcome 
submissions across a broad variety of topics. Anything that supports the 
mission of C4LJ is welcome.

The goal of the journal
 is to promote
 professional communication by minimizing the barriers to publication. While 
articles in the journal should be of a high quality, they need not follow any 
formal structure or guidelines. Writers should aim for the middle ground 
between, on the one hand, blog or mailing-list posts, and, on the other hand, 
articles in traditional journals. We want publishing in the journal to be easy 
and painless, helping the community to share timely, relevant information that 
is currently shared all too rarely.

Articles need not include comprehensive literature reviews and bibliographies, 
although pointing the reader to useful work that has gone before can certainly 
be helpful. Authors are encouraged to include code samples, algorithms, and 
pseudo-code where appropriate.

The
first
issue
of
the
Code4Lib
Journal
was
published
in
December
2007 and
is
available
at http://journal.code4lib.org/issues/issue1.
The second
issue
of
Code4Lib
Journal
is
well
underway
and
set
for publication
in
March.

The Journal will be electronic only and is edited rather than refereed.

Please contact us with proposals or queries, as well as draft articles,at
 [EMAIL PROTECTED] (a private list read only by C4LJ editors) no later than 
Friday, March 14, 2007.

For more information, you can find information on our mission, processes and 
structures, and guidelines for authors at: http://journal.code4lib.org/

We look forward to hearing from interested people,

Code4Lib Journal Editorial Committee








Call for Submissions


[CODE4LIB] Code4Lib Journal: Call for Submissions

2008-02-13 Thread Ron Peterson
Code4Lib Journal: Call for Submissions



Consider submitting proposals for articles, book  software reviews, code 
snippets  algorithms, conference reports, opinion pieces, etc. to Code4Lib 
Journal.



  Code4Lib Journal strives to fill a communication niche above simple 
description of software implementations but below pure theoretical digital 
library investigation. The editorial board is looking for content that is 
practical, demonstrates how to exploit technology to create digital library 
collections and services, or offers insight and forethought regarding the use 
of computers in any type of library setting. While articles in the journal 
should be of a high quality, they need not follow any formal structure or 
guidelines. Writers should aim for the middle ground between, on the one hand, 
blog or mailing-list posts, and, on the other hand, articles in traditional 
journals. We want publishing in the journal to be easy and painless, helping 
the community to share timely, relevant information that is currently shared 
all too rarely.



The first issue of the Code4Lib Journal was published in December 2007 and is 
available at http://journal.code4lib.org/issues/issue1.  The second issue of 
Code4Lib Journal is well underway and set for publication in March. The third 
issue is not quite full. The Journal welcomes submissions at any time on a 
rolling acceptance basis. Proposals received by Friday, March 14, 2008 will 
receive consideration for publication in late June 2008. Upon provisional 
acceptance, authors will be given an article deadline, typically 6-8 weeks 
before journal publication.



  Send in a submission. Your peers would like to hear what you are doing.



 Submission of articles proposals and articles should be submitted 
electronically to [EMAIL PROTECTED] (a private list read only by C4LJ editors) 
in a format likely to be widely readable. The article should be accompanied by 
some brief author metadata and, if appropriate, disclosure statement (see 
below). We have no more precise specs at this time, but may develop some at a 
later date, and reserve the right to ask for submission in a different format 
if we have trouble with yours. Authors are welcome to contact the Editorial 
Committee (http://journal.code4lib.org/editorial-committee) with a proposal, 
rather than a complete article.

 For more information, see our Article Guidelines 
(http://journal.code4lib.org/article-guidelines/) and  the Journal's Process 
and Structure (http://journal.code4lib.org/process-and-structure/).






Re: [CODE4LIB] pspell aspell: make your own word lists/dictionaries

2007-04-03 Thread Ron Peterson
Kevin,

At my former place of work I did something similar to what you are trying to 
do.  I never was able to get pspell in PHP to work with the custom dictionary.  
As a work-around I used the shell_exec command to run aspell as if from the 
command line, e.g.,:
$return = shell_exec(cat $temptext | /usr/local/bin/aspell -a -d[custom 
dictionary file]);

From there you need to parse the $return string.  You can see more details on 
how this is done at http://chxo.com/scripts/spellcheck.php

Unfortunately, I don't have access to the code that I actually used, but 
hopefully that will get you started.

Ron Peterson
University of Delaware Library

- Original Message 
From: Kevin Kierans [EMAIL PROTECTED]
To: CODE4LIB@listserv.nd.edu
Sent: Tuesday, April 3, 2007 12:40:53 PM
Subject: [CODE4LIB] pspell aspell: make your own word lists/dictionaries

Has anyone created their own dictionaries
for aspell?  We've created blank delimited
lists of words from our opac.  One for title,
one for subjects, and one for authors.  (We're thinking
of a series one as well)

We would like to use
one of these word lists to offer suggestions
depending on which search the patron is making.
We're assuming we can make better suggestions
if the words come from our actual opac.

We've got it working with the dictionary that
comes with aspell, but having problems (we can't do it!)
substituting our own  dictionaries.

Does anyone have any experience/knowledge/hints/pointers
they can share with us?

We are using linux, php 5,  aspell 0.50.5, and
php - pspell functions.

Thanks,
Kevin
TNRD Library System, Kamloops, British Columbia, Canada