[CODE4LIB] Job opening: Digital Data Repository Architect

2007-01-31 Thread Hilmar Lapp

(I apologize in advance if it is considered spam to post job openings
to this list, but my understanding was that it is acceptable.)

The National Evolutionary Synthesis Center (NESCent)
(http://www.nescent.org) seeks a highly motivated Digital Data
Repository Architect. The incumbent will deploy an initial repository
implementation and further develop systems for preservation,
discovery, sharing and integration of scientific data.

The context of the opening is the Digital Repository of Information
And Data for Evolution (DRIADE) project, of which a synopsis can be
found at
http://driade.nescent.org.

Further details on the job description and qualifications can be
found on the NESCent employment page at
http://www.nescent.org/about/
employment.php#Digital_Data_Repository_Architect

(or http://tinyurl.com/2bwwkf if your email client breaks up the URL
above)

About NESCent: NESCent is an NSF-funded research center for
evolutionary biology located in Durham, North Carolina. The Center is
jointly supported by Duke University (Durham), UNC Chapel Hill, and
NC State University (Raleigh). Durham is part of the so-called
Triangle area, which consistently ranks among the best places to live
in the US, and has one of the highest PhD densities in the country.
We offer competitive salary and the job will be a staff position
eligible for full Duke benefits.

Please feel free to distribute this ad to whomever you think might be
interested. Thanks,

   -hilmar
--
===
: Hilmar Lapp  -:-  Durham, NC  -:- hlapp at duke dot edu :
===


Re: [CODE4LIB] not munging reply-to (was Re: [CODE4LIB] E-Resource Access & Management Services)

2007-03-30 Thread Hilmar Lapp

It's not a charged issue, it's simply a harmful but entirely
unnecessary practice. For a much more eloquent explanation, see for
example

http://www.unicom.com/pw/reply-to-harmful.html

Besides, not all email clients have a reply-to-sender feature (mine -
Apple Mail - for example doesn't), but practically all have a reply-
to-all feature.

   -hilmar


On Mar 30, 2007, at 1:45 PM, Erik Hatcher wrote:


On Mar 30, 2007, at 1:40 PM, Daniel Chudnov wrote:


On Fri, 30 Mar 2007, Ross Singer wrote:


Well that probably didn't need to go to the whole world, but there
you go.


/me votes for turning off reply-to munging on this list.


-1   - replies should go to lists.  :)   i know, i know, its a very
charged issue, but i feel strongly that an e-mail list is about
community and i can easily hit reply-to-sender in my mail interface
if i want to send something privately.   and i am very aware that
others feel strongly on the opposite side of this issue.

   Erik


--
=======
: Hilmar Lapp  -:-  Durham, NC  -:- hlapp at duke dot edu :
===


Re: [CODE4LIB] not munging reply-to (was Re: [CODE4LIB] E-Resource Access & Management Services)

2007-03-30 Thread Hilmar Lapp

On Mar 30, 2007, at 3:08 PM, Erik Hatcher wrote:


On Mar 30, 2007, at 2:40 PM, Hilmar Lapp wrote:


It's not a charged issue, it's simply a harmful but entirely
unnecessary practice. For a much more eloquent explanation, see for
example

http://www.unicom.com/pw/reply-to-harmful.html


bah!reply-all sucks, it ends up duplicating mails unless you
manually munge the sender list.


First, I find identifying and deleting duplicate emails rather
trivial than noticing it as an issue. Second, decent mailing list
managers will do that for you, for example mailman.




Besides, not all email clients have a reply-to-sender feature (mine -
Apple Mail - for example doesn't), but practically all have a reply-
to-all feature.


not true.  click on the From and "Reply to Sender".


I'm a button feeder, sorry. Obscure features aren't obscure because
they are meant for frequent and everyone's consumption.

I'm also a huge fan of simplicity in life. Reply-to munging makes
things difficult (or call it obscure) that ought to (and can) be very
simply, whereas not making things even simpler that are already simple.

Finally, I'm also too old to engage in an argument about whether the
decision should be mine where a reply to an email that I receive
should go to, or that of a mailing list administrator. I also feel
fortunate that virtually all of the communities I interact with (no,
I'm not a librarian) don't think that there's any debate about that
question.

   -hilmar

--
=======
: Hilmar Lapp  -:-  Durham, NC  -:- hlapp at duke dot edu :
===


Re: [CODE4LIB] mysql to postgres

2007-08-15 Thread Hilmar Lapp

SQL::Translator (written in Perl) does this.

http://sqlfairy.sourceforge.net/

   -hilmar

On Aug 15, 2007, at 2:28 PM, Eric Lease Morgan wrote:


Can anybody here recommend a MySQL database schema to Postgres
database schema converter?

I have a (MyLibrary) MySQL database schema. Do y'all know of some
sort of script I can run against this text file to create a Postgres
database schema?

--
Eric Lease Morgan
University Libraries of Notre Dame


--
===
: Hilmar Lapp  -:-  Durham, NC  -:- hlapp at duke dot edu :
===


[CODE4LIB] CfP: Dublin Core 2008/Berlin, Germany

2008-01-07 Thread Hilmar Lapp

(apologies for any cross posting)

  CONFERENCE ANNOUNCEMENT AND INITIAL CALL FOR PAPERS

  DC-2008 -- International Conference on Dublin Core
 and Metadata Applications
 http://dc2008.de/

 "Metadata for Semantic and Social Applications"

 22-26 September 2008
 Berlin

The annual Dublin Core conferences bring together leading
metadata researchers and professionals from around the world.
DC-2008 in Berlin will be the eighth in a series of conferences
held previously in Tokyo, Florence, Seattle, Shanghai, Madrid,
Manzanillo, and Singapore.  The conference is organized jointly
by the Competence Centre for Interoperable Metadata (KIM),
Max Planck Digital Library, Göttingen State and University
Library, the German National Library, Humboldt Universität zu
Berlin, and Dublin Core Metadata Initiative with sponsorship
from Wikimedia Deutschland.

CONFERENCE THEME

Metadata is a key aspect of our evolving infrastructure for
information management, social computing, and scientific
collaboration.

DC-2008 will focus on metadata challenges, solutions, and
innovation in initiatives and activities underlying semantic
and social applications.  Metadata is part of the fabric of
social computing, which includes the use of wikis, blogs,
and tagging for collaboration and participation.   Metadata
also underlies the development of semantic applications,
and the Semantic Web -- the representation and integration
of multimedia knowledge structures on the basis of semantic
models.  These two trends flow together in applications such
as Wikipedia, where authors collectively create structured
information that can be extracted and used to enhance access
to and use of information sources.

Recent discussion has focused on how existing bibliographic
standards can be expressed as Semantic Web vocabularies
to facilitate the integration of library and cultural
heritage data with other types of data.  Harnessing the
efforts of content providers and end-users to link, tag,
edit, and describe their information in interoperable ways
("participatory metadata") is a key step towards providing
knowledge environments that are scalable, self-correcting,
and evolvable.

DC-2008 will explore conceptual and practical issues in the
development and deployment of semantic and social applications
to meet the needs of specific communities of practice.

Papers, reports, and poster submissions are welcome on a wide
range of metadata topics, such as:

+ Metadata generation (methods, tools, and practices)

+ Semantic Web metadata and applications

+ Conceptual models and frameworks (e.g., RDF, DCAM, OAIS)

+ Social tagging

+ Knowledge Organization Systems (KOS) and Simple Knowledge
  Organization Systems (SKOS) (e.g., ontologies, taxonomies,
  authority files, folksonomies, and thesauri)

+ Metadata in e-Science and grid applications

+ Metadata interoperability and internationalization

+ Metadata quality, normalization, and mapping

+ Cross-domain metadata uses (e.g., recordkeeping,
  preservation, institutional repositories)

+ Vocabulary registries and registry services

+ Domain metadata (e.g., for corporations, cultural memory
  institutions, education, government, and scientific fields)

+ Application profiles

+ Accessibility metadata

+ Search engines and metadata

+ Metadata principles, guidelines, and best practices

+ Bibliographic standards (e.g., Resource Description and Access
  (RDA), Functional Requirements for Bibliographic Records (FRBR),
  subject headings) as Semantic Web vocabularies

SUBMISSIONS:

All submissions  will be peer-reviewed by the International
Program Committee and published in the conference proceedings.
The Committee is soliciting paper contributions of the
following three types:

-- FULL PAPERS (8 to 10 pages)

   Full papers either describe innovative original work in
   detail or provide critical, well-referenced overviews of key
   developments or good practice in the areas outlined above.
   Full papers will be assessed using the following criteria:

   o  Originality of the approach to implementation

   o  Generalizability of the methods and results described

   o  Quality of the contribution to the implementation community

   o  Significance of the results presented

   o  Clarity of presentation

-- PROJECT REPORTS (4 pages)

   Project reports describe a specific model, application, or
   activity in a concise, prescribed format.  Project reports
   will be assessed using the following criteria:

   o  Conciseness and completeness of technical description

   o  Usability of the technical description by other potential
implementers

   o  Clarity of presentation

Paper submissions in both categories must be in English and
will be published in both the print and the official electronic
versions of the conference proceedings.  Accepted papers must
be presented in Berlin by at least one of their authors.

-- POSTERS (1 pa

Re: [CODE4LIB] Shibboleth

2008-03-21 Thread Hilmar Lapp

We use Shibboleth to allow resident scientists to authenticate to our
wikis using the university ID/pwd, and to secure other parts of our
website to certain (resident) people.

It's been a pain - there are caching issues and users need to reload
the landing page so their identity is recognized, you need
certificates, and they need to match the hostname exactly, which
makes virtual host naming a pain.

   -hilmar

On Mar 21, 2008, at 10:23 AM, K.G. Schneider wrote:

If you have been involved in investigating or implementing
Shibboleth --
or alternative approaches -- I'd like to hear from you.

Wearing my official chapeau I am,

Karen G. Schneider
Research & Development
College Center for Library Automation
http://www.cclaflorida.org
Voice: 850-922-6044
AIM/Email: [EMAIL PROTECTED]


--
===
: Hilmar Lapp  -:-  Durham, NC  -:- hlapp at duke dot edu :
===


[CODE4LIB] extended deadline/Dublin Core 2008

2008-03-30 Thread Hilmar Lapp

Due to a number of requests, the deadline for submissions for Dublin
Core 2008 in Berlin, Germany, has been extended to April 13.

The formal CfP is below.

We'd appreciate help from colleagues and friends forwarding this post
to appropriate lists, and apologize for any duplication in advance.

thank you very much, jane and wolfgang





CONFERENCE ANNOUNCEMENT AND INITIAL CALL FOR PAPERS

  DC-2008 -- International Conference on Dublin Core
 and Metadata Applications
 http://dc2008.de/

 "Metadata for Semantic and Social Applications"

 22-26 September 2008
 Berlin

The annual Dublin Core conferences bring together leading
metadata researchers and professionals from around the world.
DC-2008 in Berlin will be the eighth in a series of conferences
held previously in Tokyo, Florence, Seattle, Shanghai, Madrid,
Manzanillo, and Singapore.  The conference is organized jointly
by the Competence Centre for Interoperable Metadata (KIM),
Max Planck Digital Library, Gttingen State and University
Library, the German National Library, Humboldt Universitt zu
Berlin, and Dublin Core Metadata Initiative with sponsorship
from Wikimedia Deutschland.

CONFERENCE THEME

Metadata is a key aspect of our evolving infrastructure for
information management, social computing, and scientific
collaboration.

DC-2008 will focus on metadata challenges, solutions, and
innovation in initiatives and activities underlying semantic
and social applications.  Metadata is part of the fabric of
social computing, which includes the use of wikis, blogs,
and tagging for collaboration and participation.   Metadata
also underlies the development of semantic applications,
and the Semantic Web -- the representation and integration
of multimedia knowledge structures on the basis of semantic
models.  These two trends flow together in applications such
as Wikipedia, where authors collectively create structured
information that can be extracted and used to enhance access
to and use of information sources.

Recent discussion has focused on how existing bibliographic
standards can be expressed as Semantic Web vocabularies
to facilitate the integration of library and cultural
heritage data with other types of data.  Harnessing the
efforts of content providers and end-users to link, tag,
edit, and describe their information in interoperable ways
("participatory metadata") is a key step towards providing
knowledge environments that are scalable, self-correcting,
and evolvable.

DC-2008 will explore conceptual and practical issues in the
development and deployment of semantic and social applications
to meet the needs of specific communities of practice.

Papers, reports, and poster submissions are welcome on a wide
range of metadata topics, such as:

+ Metadata generation (methods, tools, and practices)

+ Semantic Web metadata and applications

+ Conceptual models and frameworks (e.g., RDF, DCAM, OAIS)

+ Social tagging

+ Knowledge Organization Systems (KOS) and Simple Knowledge
  Organization Systems (SKOS) (e.g., ontologies, taxonomies,
  authority files, folksonomies, and thesauri)

+ Metadata in e-Science and grid applications

+ Metadata interoperability and internationalization

+ Metadata quality, normalization, and mapping

+ Cross-domain metadata uses (e.g., recordkeeping,
  preservation, institutional repositories)

+ Vocabulary registries and registry services

+ Domain metadata (e.g., for corporations, cultural memory
  institutions, education, government, and scientific fields)

+ Application profiles

+ Accessibility metadata

+ Search engines and metadata

+ Metadata principles, guidelines, and best practices

+ Bibliographic standards (e.g., Resource Description and Access
  (RDA), Functional Requirements for Bibliographic Records (FRBR),
  subject headings) as Semantic Web vocabularies

SUBMISSIONS:

All submissions  will be peer-reviewed by the International
Program Committee and published in the conference proceedings.
The Committee is soliciting paper contributions of the
following three types:

-- FULL PAPERS (8 to 10 pages)

   Full papers either describe innovative original work in
   detail or provide critical, well-referenced overviews of key
   developments or good practice in the areas outlined above.
   Full papers will be assessed using the following criteria:

   o  Originality of the approach to implementation

   o  Generalizability of the methods and results described

   o  Quality of the contribution to the implementation community

   o  Significance of the results presented

   o  Clarity of presentation

-- PROJECT REPORTS (4 pages)

   Project reports describe a specific model, application, or
   activity in a concise, prescribed format.  Project reports
   will be assessed using the following criteria:

   o  Conciseness and completeness of technical description

   o  Usability of the techn

[CODE4LIB] Cyberinfrastructure Summer Traineeships for repository interoperability

2009-03-29 Thread Hilmar Lapp

*** Please disseminate widely to students at your institution ***

CYBERINFRASTRUCTURE SUMMER TRAINEESHIPS 2009

  VIRTUAL DATA CENTER FOR
   BIODIVERSITY, EARTH, ECOLOGICAL, AND EVOLUTIONARY SCIENCE DATA

http://hackathon.nescent.org/Cyberinfrastructure_Summer_Traineeships_2009

Summer traineeships are available for up to four students and postdocs  
interested in informatics as applied to scientific data ranging from  
the fields of biodiversity, ecology, and evolutionary biology. The  
program provides a unique opportunity for undergraduate, masters, and  
PhD students as well as postdocs to obtain hands-on experience writing  
and extending open-source software as part of a distributed  
collaborative software development team building a Virtual Data Center  
(VDC) that includes major data and metadata repositories in those  
fields.


Trainees accepted into the program will receive a stipend ($4,500),  
and with the exception of attending one meeting near the beginning and  
one near the end of the 3-month program period may work from their  
home, or home institution. Travel costs incurred in connection with  
the meetings will be reimbursed. Each student will have at least one  
dedicated mentor to show them the ropes and help them complete their  
project.


Initial project ideas are listed on the website. These range from  
validation of metadata and identifier resolution, to supporting LSID  
and semantic-web compliant PURLs for digital data objects, to  
implementing modern web-service APIs, to cataloging the diversity of  
metadata schemas. The project ideas are flexible and can be adjusted  
in scope to match the skills of the student. We also welcome novel  
project ideas that dovetail with student interests.


The traineeships are supported by a National Science Foundation (NSF)  
grant to a consortium of major repositories for biodiversity, earth  
and environmental, ecological, and evolutionary science. The  
consortium includes the LTER Network Office, the U.S. Geological  
Survey, NASA and Oak Ridge National Laboratory, the Global  
Biodiversity Information Facility (GBIF), the National Evolutionary  
Synthesis Center(NESCent), and the National Center for Ecological  
Analysis and Synthesis (NCEAS). It aims to develop the  
cyberinfrastructure and technologies necessary to build a Virtual Data  
Center (VDC) based on a network of existing and new physical  
repositories ("nodes") that interoperate using open standards and  
protocols. The network will enable discovery of as well as open,  
stable, and secure access to data in any of its member nodes.


TO APPLY: Students apply online. Instructions for applying are at the  
website (see "When you apply"), along with program rules and  
eligibility requirements.  The 15-day application period for students  
opens on Monday March 30th and runs through Monday, April 13th, 2009.


INQUIRIES: vdc-twg {at} ecoinformatics {dot} org. We strongly  
encourage all interested students to get in touch with us with their  
ideas as early as possible.


Cyberinfrastructure Traineeships Website:
http://hackathon.nescent.org/Cyberinfrastructure_Summer_Traineeships_2009

2009 NESCent Phyloinformatics Summer of Code (NESCent's participation  
in the Google Summer of Code; managed separately; postdocs not  
eligible; ***student application period ends April 3rd***)

http://hackathon.nescent.net/Phyloinformatics_Summer_of_Code_2009

To sign up for quarterly NESCent newsletters: 
http://www.nescent.org/about/contact.php

-----

Todd Vision and Hilmar Lapp
National Evolutionary Synthesis Center
http://nescent.org


Re: [CODE4LIB] registering info: uris?

2009-03-30 Thread Hilmar Lapp

On Mar 30, 2009, at 11:18 AM, Ray Denenberg, Library of Congress wrote:


From: "Ross Singer" 

nobody gives a damn about info:uris outside of
libraries,


Nor do people outside of libraries care about identifiers.


You might be surprised: http://www.lsrn.org/

-hilmar
--
===
: Hilmar Lapp  -:-  Durham, NC  -:- hlapp at duke dot edu :
===


[CODE4LIB] Cyberinfrastructure Summer Internships for repository interoperability: application deadline reminder

2009-04-07 Thread Hilmar Lapp

*** Please disseminate widely to students at your institution ***

CYBERINFRASTRUCTURE SUMMER INTERNSHIPS 2009 -
  REMINDER: Student Application Deadline is April 13, 2009

http://hackathon.nescent.org/ 
Cyberinfrastructure_Summer_Traineeships_2009


Summer training internships are available for up to four students and  
postdocs interested in informatics as applied to scientific data in  
such fields as biodiversity, ecology, and evolutionary biology. The  
program provides a unique opportunity for undergraduate, masters, and  
PhD students as well as postdocs to obtain hands-on experience  
writing and extending open-source software as part of a distributed  
collaborative software development team building a Virtual Data  
Center (VDC) that includes major data and metadata repositories in  
those fields.


The application deadline for students (April 13, 2009) is approaching  
rapidly.


Trainees accepted into the program will receive a stipend ($4,500),  
and with the exception of attending one meeting near the beginning  
and one near the end of the 3-month program period may work from  
their home, or home institution. Travel costs incurred in connection  
with the meetings will be reimbursed. Each student will have at least  
one dedicated mentor to show them the ropes and help them complete  
their project.


Initial project ideas are listed on the website. These range from  
validation of metadata and identifier resolution, to supporting LSID  
and semantic-web compliant PURLs for digital data objects, to  
implementing modern web-service APIs, to cataloging the diversity of  
metadata schemas. The project ideas are flexible and can be adjusted  
in scope to match the skills of the student. We also welcome novel  
project ideas that dovetail with student interests.


The program is supported by a National Science Foundation (NSF) grant  
to a consortium of major repositories for biodiversity, earth and  
environmental, ecological, and evolutionary science. The consortium  
includes the LTER Network Office, the U.S. Geological Survey, NASA  
and Oak Ridge National Laboratory, the Global Biodiversity  
Information Facility (GBIF), the National Evolutionary Synthesis  
Center(NESCent), and the National Center for Ecological Analysis and  
Synthesis (NCEAS). It aims to develop the cyberinfrastructure and  
technologies necessary to build a Virtual Data Center (VDC) based on  
a network of existing and new physical repositories ("nodes") that  
interoperate using open standards and protocols. The network will  
enable discovery of as well as open, stable, and secure access to  
data in any of its member nodes.


TO APPLY: Students apply online. Instructions for applying are at the  
website (see "When you apply"), along with program rules and  
eligibility requirements.  The 15-day application period for students  
end on Monday, April 13th, 2009.



INQUIRIES: vdc-twg {at} ecoinformatics {dot} org. We strongly  
encourage all interested students to get in touch with us with their  
ideas as early as possible.


Cyberinfrastructure Traineeships Website:
http://hackathon.nescent.org/ 
Cyberinfrastructure_Summer_Traineeships_2009


To sign up for quarterly NESCent newsletters: http://www.nescent.org/ 
about/contact.php


-----

Todd Vision and Hilmar Lapp
National Evolutionary Synthesis Center
http://nescent.org


Re: [CODE4LIB] "code 4 museums"

2009-04-14 Thread Hilmar Lapp

There is the Specify software for natural history collections:

http://specifysoftware.org/

The source code has apparently just recently been deposited on  
SourceForge.


-hilmar

On Apr 14, 2009, at 3:12 PM, Ethan Gruber wrote:


Hi all,

I've been a software developer in a research library for several  
years, and
I have worked with objects typically viewed as museum collections to  
a large
degree (particularly ancient coins and eighteenth century European  
sheet
music).  Since I'm from a library and am familiar with library  
technological
standards as far as metadata practices and software applications go,  
I tend
to apply library standards toward the museum collections I have been  
in
contact with--which involves Encoded Archival Description for  
metadata,
opensource applications like tomcat, cocoon, and lucene/solr.  My  
knowledge
of museum practices is fairly limited, but I have noticed that many  
museums
have tended to adopt proprietary databases to describe their  
collections.  I
feel museums tend to lag behind their library counterparts with  
respect to
the adoption of opensource frameworks and open standards, but if you  
think

about it, museums are scarcely different than many archives/special
collections libraries in content and organization.  I'm thinking of
PastPerfect in particular.  It's quite common in the museum world  
and costs

almost $1000 per license.

I'm wondering if anyone else on code4lib actually works for a museum  
or has
first-hand experience in providing access to museum collections and  
has
noticed the same general differences between libraries and museums  
that I

have.

Ethan Gruber
University of Virginia Library


--
=======
: Hilmar Lapp  -:-  Durham, NC  -:- hlapp at duke dot edu :
===