Re: [CODE4LIB] Regex Question

2015-07-07 Thread Katherine N. Deibel
To add on a few things that others have said in this thread:

- Another good online regex tool is https://regex101.com/ I really like the 
testing tools it provides.

- Although it's not exactly what you need, Word does have an ability to search 
by format (it's under the Select menu on the Home tab of the ribbon).

Kate Deibel, PhD | Web Applications Specialist
Information Technology Services 
University of Washington Libraries 
http://staff.washington.edu/deibel

--

"When Thor shows up, it's always deus ex machina."

-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Matt 
Sherman
Sent: Tuesday, July 7, 2015 9:45 AM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Regex Question

Thanks everyone, this really helps.  I'll have to work out the italicized 
stuff, but this gets me much closer.

On Tue, Jul 7, 2015 at 12:43 PM, Kyle Banerjee 
wrote:

> Y'all are doing this the hard way. Word allows regex replacements as 
> well as format based criteria.
>
> For this particular use case:
>
>1. Open the find/replace dialog (CTL+H)
>2. In the "Find what" box, put (<*>) -- make sure the option for "Use
>Wildcards" is selected, and for the format, specify italic
>3. For the"Replace box," just put \1 and specify All caps
>
> And you're done
>
> kyle
>
> On Tue, Jul 7, 2015 at 9:32 AM, Thomas Krichel 
> wrote:
>
> >   Eric Phetteplace writes
> >
> > > You can match a string of all caps letters like "[A-Z]"
> >
> >   This works if you are limited to English. But in a multilingual
> >   setting, you need to watch out for other uppercases, such as
> >   крихель vs КРИХЕЛЬ. It then depends in the unicode implementation
> >   of your regex application. In Perl, for example, you would use
> >   [[:upper:]].
> >
> >
> > --
> >
> >   Cheers,
> >
> >   Thomas Krichel  http://openlib.org/home/krichel
> >   skype:thomaskrichel
> >
>


Re: [CODE4LIB] Regex Question

2015-07-07 Thread Kyle Banerjee
For clarity, Word does regex, not just wildcards.  It's not quite as
complete as what you'd get with some other environments such as OpenOffice
Writer since matching is lazy rather than greedy which can be a big deal
depending on what you're doing and there are a couple other catches --
notably no support for "|" -- but it's reasonably powerful. There is no
regexp capability in Excel unless you're willing to use VBA.

kyle

On Tue, Jul 7, 2015 at 1:10 PM, Gordon, Bonnie  wrote:

> OpenOffice Writer (or a similar program) may be useful for this. It would
> allow you to search by format while using a more controlled regular
> expression than MS Word's wildcards.
>
> -Original Message-
> From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
> Matt Sherman
> Sent: Tuesday, July 07, 2015 12:45 PM
> To: CODE4LIB@LISTSERV.ND.EDU
> Subject: Re: [CODE4LIB] Regex Question
>
> Thanks everyone, this really helps.  I'll have to work out the italicized
> stuff, but this gets me much closer.
>
> On Tue, Jul 7, 2015 at 12:43 PM, Kyle Banerjee 
> wrote:
>
> > Y'all are doing this the hard way. Word allows regex replacements as
> > well as format based criteria.
> >
> > For this particular use case:
> >
> >1. Open the find/replace dialog (CTL+H)
> >2. In the "Find what" box, put (<*>) -- make sure the option for "Use
> >Wildcards" is selected, and for the format, specify italic
> >3. For the"Replace box," just put \1 and specify All caps
> >
> > And you're done
> >
> > kyle
> >
> > On Tue, Jul 7, 2015 at 9:32 AM, Thomas Krichel 
> > wrote:
> >
> > >   Eric Phetteplace writes
> > >
> > > > You can match a string of all caps letters like "[A-Z]"
> > >
> > >   This works if you are limited to English. But in a multilingual
> > >   setting, you need to watch out for other uppercases, such as
> > >   крихель vs КРИХЕЛЬ. It then depends in the unicode implementation
> > >   of your regex application. In Perl, for example, you would use
> > >   [[:upper:]].
> > >
> > >
> > > --
> > >
> > >   Cheers,
> > >
> > >   Thomas Krichel  http://openlib.org/home/krichel
> > >   skype:thomaskrichel
> > >
> >
>


[CODE4LIB] Job: Senior Consultant at AVPreserve

2015-07-07 Thread jobs
 Senior Consultant
AVPreserve
New York City

AVPreserve is currently accepting applications for the role
of Senior Consultant. Senior Consultants at AVPreserve are the face of the
company, responsible for using their expertise and all available resources to
contribute to envisioning, formulating, planning, synthesizing, and
implementing strategies and solutions in alignment with the goals and
objectives of our clients and the AVPreserve vision. Senior Consultants are
responsible for identifying opportunities for AVPreserve to support the goals
and objectives of past, present and prospective clients, and are also expected
to be a leader in the company, mentoring and developing AVPreserve staff and
providing support on AVPreserve projects where needed, as well as a knowledge
leader among colleagues and within professional organizations. This
versatility and cross-fertilization among business lines is an important
factor underlying AVPreserve's success.

  
AVPreserve provides a wide array of services, and our team members require
diverse skill-sets in order to contribute across a range of those offerings.
At this time we are particularly looking for people who can immediately step
into roles providing services in the areas of data management and governance,
data migration, data flow, and data modeling; the implementation of digital
preservation standards, policies, and best practices; and the identification
and integration of digital asset management procedures and systems in both
production and archival environments. Applicants should have 5 years
experience in at least one of those areas as well as existing proficiency in
the use of applicable hardware, software, and computing language(s), along
with the willingness to gain expertise with additional skills as needed.

  
At AVPreserve, you will find challenging and meaningful work through
collaboration with talented and engaged colleagues and clients, tackling real-
world problems through development and implementation of innovative solutions
and strategies. You will work alongside others in a culture of continual
improvement that supports and motivates constant growth. You will be
contributing to the continued evolution of AVPreserve at an exciting time for
the company and for the clients we serve. Over time, you will be given greater
responsibilities, autonomy and the ability to be creative and innovative in
your work, with AVPreserve as a platform for support and promotion in the
attainment of your own goals.

  
To Apply

  
Email a PDF of a cover letter and resume to careers[at]avpreserve[dot]com. No
phone calls please.

  
General goals and objectives of this position include:

  * Work in support of the AVPreserve vision;
  * Be an active participant in establishing and maintaining AVPreserve as the 
premiere consulting and software development firm focused on leveraging a deep 
understanding of technology, information management, business, and people to 
advance the ways in which information is used for the benefit of individuals, 
organizations, and causes;
  * Provide the most effective and meaningful support possible to our clients;
  * Develop and retain clientele;
  * Take responsibility for planning, attaining, managing and reporting on 
AVPreserve budgetary goals;
  * Continually learn and consistently contribute to the AVPreserve skill-set 
and knowledge base;
  * Increase awareness of and promote the AVPreserve brand and services; and
  * Promote increased communication and collaboration within AVPreserve to aid 
in effective coordination and implementation of project management, marketing 
and revenue generation efforts.
Specific responsibilities of this position include:

  * Contributing to the successful completion of consulting engagements, 
including but not limited to:
  * Researching, compiling, synthesizing and analyzing information;
  * Conducting interviews, surveys, assessments and other forms of discovery;
  * Leading the analysis and writing of project reports, including the creation 
of diagrams, spreadsheets and other supporting documents;
  * Development of specifications and guidelines;
  * Working with AVPreserve team members to generate project deliverables;
  * Preparing and making presentations; and
  * Leading meetings
  * Leading and managing projects, including but not limited to:
  * Developing and managing project timelines, schedules, milestones, 
deliverables, budgets and resource allocation;
  * Identifying resources required;
  * Managing work plan execution;
  * Tracking actual vs. budgeted hours and expenditures;
  * Overseeing project team members including subcontractors;
  * Serving as the client liaison; and
  * Familiarizing staff with projects and activities, training staff when 
necessary and delegating work so that staff are able to easily assist with 
completion of work.
  * Reporting on projects and activities including:
  * Regular meetings to discuss and present on business, projects and 
foreca

[CODE4LIB] Job: Digital Initiative Librarian/Assistant Professor at Idaho State University

2015-07-07 Thread jobs
Digital Initiative Librarian/Assistant Professor
Idaho State University
Pocatello

Supports digital scholarship initiatives including: providing assistance in
the digitization of special collections, managing existing digital library
services, and informing the creation of new digital services. Reporting to the
Head of Collection Management, the Digital Initiatives Librarian works with
staff and faculty throughout the library to provide innovative digital library
services.

  
Key Responsibilities

  * Manage and coordinate digital library projects among faculty, librarians, 
and staff; develop project plans, general project documentation, 
specifications, workflows, and timelines for successful completion of digital 
library projects.
  * Supervise a classified and professional staff of 3 FTE to support library 
technology including workstations, servers, and other technology. Lead staff in 
the identification of digital resource needs and solutions.
  * Identify and lead new digital initiatives.
  * Assist with staff training related to digital collections and technologies.
  * Develop and implement new digital library services in partnership with 
staff across the libraries such as mobile applications or cloud based 
integrated library systems
  * Formulate policies, procedures and best practices for the production, 
management, and preservation of digital content and metadata, technical 
workflow, quality control, and associated intellectual property issues.
  * Assist in setting up new collections or content in the various service 
areas including institutional repository and digital archives; assist in 
improving workflows for these services as needed.
  * Provide technical expertise in digital applications and lead the evaluation 
and implementation of technologies related to the management and dissemination 
of digital content, e.g. digital asset management system, institutional 
repository, and online exhibition tools.
  * Leverage technical skills in managing content migration, sharing 
collections with aggregators, improving and normalizing encoding and 
descriptive practices
  * Engage with and stay current on digital trends, technologies, and industry 
standards to provide direction and leadership for the library in the rapidly 
changing digital environment. Maintains in-depth knowledge of the 
identification, assessment, and implementation of emerging technologies that 
strategically advance the Library's mission.
  * Contribute to the campus discussion about digital initiatives and 
information.
  * Participates in reference desk duty. Assists faculty, students, staff and 
other users on a one-on-one basis by answering specific information requests, 
offering research assistance and guiding users in the effective use of a broad 
range of information resources offered by the Library.
  * Serves as a bibliographer to designated academic departments and oversees 
the selection of materials in assigned subject areas. Assists in collection 
development and evaluation; monitors pertinent portions of the acquisitions 
budget; works to develop and maintain good relations with faculty members.
  * Participates in professional activities, including university and library 
committees and task forces, and attends professional meetings.
  * Participates in the Library's instruction program.
  
Minimum Qualifications

  * M.L.S., M.L.I.S. or equivalent degree from an ALA-accredited program by 
date of hire
  * Demonstrated ability to plan, coordinate and manage existing digital 
library projects
  * Demonstrated knowledge or awareness of tools and technology standards 
typically used in digital library environments
  * Demonstrated experience with multiple computer platforms including 
UNIX/Linux, and Windows and associated hardware.
  * Demonstrated excellent written and oral communication skills
  * Demonstrated good organizational skills and an aptitude for analytical and 
detailed work
  * Demonstrated ability to work independently as well as collaboratively in a 
complex, rapidly changing and professionally diverse environment.
  * Demonstrated ability to participate in library faculty activities as needed 
such as public services, instruction, bibliography, and library committee work.
  * Demonstrated ability to represent the Library to appropriate campus units, 
to participate in professional activities and committee assignments within the 
Library and on campus, and to participate in local, regional, and national 
outreach and professional activities.
  
Preferred Qualifications

  * Demonstrated experience with relational databases such as SQL, MySQL, SOLR 
AND MSAccess.
  * Demonstrated experience and knowledge of metadata and content standards 
i.e., MARC, EAD, Dublin Core, and AACR2, etc.
  * Familiarity with of digital objects in various formats (text, image, audio 
and/or video) and best practices and standards associated with each format
  * Demonstrated experience with markup languages suc

[CODE4LIB] Job: Video Preservation Engineer at Indiana University Bloomington

2015-07-07 Thread jobs
Video Preservation Engineer
Indiana University Bloomington
Bloomington

The Video Preservation Engineer works in the IU media digitization facility
operated by the Media Digitization and Preservation Initiative (MDPI). The
Video Preservation Engineer is responsible for the preservation transfer
(digitization) of analog videotape recordings including the Betamax, Hi8, 8mm,
U-matic, VHS, and EIAJ formats, to preservation master digital files at the
highest quality possible. This includes optimal playback of actively degrading
media on highly obsolete formats, quality control procedures,
alignment/calibration/manipulation of playback machines, and verification of
signal chain performance. Manipulation of degrading media for preservation
work requires the independent application of ethical principles while making
subjective judgments related to accuracy and quality. The Engineer is also
responsible for collecting extensive documentation on both the source
recordings and the transfer process. This position will participate in
research related to the development of methodologies and workflows for the
digitization of particularly difficult formats such as EIAJ, gaining
efficiencies within a 1:1 workflow, developing and using workflow management
software, and performing quality control for the products of large-scale
digitization.

  
Qualifications:

  
REQUIRED: Bachelora€™s degree required in telecommunications or related field
and at least three years' experience with the playback of legacy video formats
in a studio, post house, archive, or production setting including experience
with analysis and diagnostic tools such as waveform monitors and vectorscopes.

  
Strongly preferred experience with:

  
a) video archiving or preservation transfer work,

b) deteriorating or otherwise problematic tapes

c) documenting video projects in detail,

d) lengthy projects requiring meticulous attention to detail,

e) application of preservation principles to digitization work.

  
Well-developed critical viewing skills, knowledge of legacy video formats.

  
Indiana University is an equal opportunity employer. All qualified applicants
will receive consideration for employment without regard to race, color,
religion, age, sex, sexual orientation or identity, national origin,
disability status, or protected veteran status. This institution is also a
provider of ADA services.



Brought to you by code4lib jobs: http://jobs.code4lib.org/job/21759/
To post a new job please visit http://jobs.code4lib.org/


Re: [CODE4LIB] Regex Question

2015-07-07 Thread Gordon, Bonnie
OpenOffice Writer (or a similar program) may be useful for this. It would allow 
you to search by format while using a more controlled regular expression than 
MS Word's wildcards.

-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Matt 
Sherman
Sent: Tuesday, July 07, 2015 12:45 PM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Regex Question

Thanks everyone, this really helps.  I'll have to work out the italicized 
stuff, but this gets me much closer.

On Tue, Jul 7, 2015 at 12:43 PM, Kyle Banerjee 
wrote:

> Y'all are doing this the hard way. Word allows regex replacements as 
> well as format based criteria.
>
> For this particular use case:
>
>1. Open the find/replace dialog (CTL+H)
>2. In the "Find what" box, put (<*>) -- make sure the option for "Use
>Wildcards" is selected, and for the format, specify italic
>3. For the"Replace box," just put \1 and specify All caps
>
> And you're done
>
> kyle
>
> On Tue, Jul 7, 2015 at 9:32 AM, Thomas Krichel 
> wrote:
>
> >   Eric Phetteplace writes
> >
> > > You can match a string of all caps letters like "[A-Z]"
> >
> >   This works if you are limited to English. But in a multilingual
> >   setting, you need to watch out for other uppercases, such as
> >   крихель vs КРИХЕЛЬ. It then depends in the unicode implementation
> >   of your regex application. In Perl, for example, you would use
> >   [[:upper:]].
> >
> >
> > --
> >
> >   Cheers,
> >
> >   Thomas Krichel  http://openlib.org/home/krichel
> >   skype:thomaskrichel
> >
>


[CODE4LIB] Job: Digital Repository Librarian at University of British Columbia

2015-07-07 Thread jobs
Digital Repository Librarian
University of British Columbia
Vancouver

The Digital Repository Librarian is responsible for developing, supporting,
and assessing recruitmentactivities as well as managing the
content in cIRcle, UBC's open access digital repository. The role is
alsoresponsible for managing the repository software and
technical system administration of the cIRcle
managing the development and implementation of content policies and
procedures; ensuringcopyright compliance of repository
content; and undertaking promotion of cIRcle and open access.
TheDigital Repository Librarian is responsible for many of
the day-to-day activities of cIRcle, includingliaising with
reference librarians and with faculty regarding content recruitment, assuring
the applicationof interoperability standards for metadata;
assisting with ongoing development of an assessmentprogram
for the digital repository; overseeing internal records management activities
in the cIRcle office,and developing and maintaining the
cIRcle web site.



Brought to you by code4lib jobs: http://jobs.code4lib.org/job/21761/
To post a new job please visit http://jobs.code4lib.org/


[CODE4LIB] Job: Digitization Manager at Indiana University Bloomington

2015-07-07 Thread jobs
Digitization Manager
Indiana University Bloomington
Bloomington

Job Summary: Manages production digitization activities of the Lilly Library;
and hires, trains, and supervises appointed and temporary employees involved
in the digitization process. Coordinates the movement of Lilly Library
materials through in-house digitization workflows; and prepares and transports
Lilly Library materials to be digitized at the Wells Library or elsewhere.
Maintains records related to materials being digitized; and serves as hardware
and software administrator for digital imaging computers. Develops
digitization, workflow, and quality control procedures for text, photographic,
manuscript, and other materials.

  
Qualifications: Review your qualifications prior to applying to ensure that
you meet the minimum qualifications for the position. Resume and cover letter
required.

  
REQUIRED: Bachelor's degree in audio/visual art, informatics, computer
science, photography, or related field, two years of digital imaging
experience and one year of supervisory and/or training experience.

  
Expertise with the following: hardware/software setup, maintenance, and
troubleshooting; multiple computer platforms including UNIX, Windows, and
Macintosh OS; varied imaging technologies such as photographic equipment
including digital camera and lenses, video editing, imaging software, and file
conversions. Knowledge of: digital imaging software such as Adobe Photoshop,
sufficient to create and manipulate digital images; Optical Character
Recognition (OCR) software and tools; XML-editors; digital objects standards
and formats and associated best practices; and rare and fragile materials
handling and preservation practices. Familiarity with metadata and markup
standards such as EAD, MODS and TEI; and knowledge of and experience with
copystand and studio lighting.

  
Excellent interpersonal and communication skills, written and verbal including
ability to interact with a wide variety of individuals. Solid problem solving,
time, and project management skills; demonstrated ability to work
independently in a complex, rapidly changing environment; and excellent
attention to detail.

  
Physical Requirements: This position requires a full range of movement and can
be physically demanding. Individual must be able to stand for long periods of
time; lift heavy books or bulky or oversize items up to 25 lbs. on a regular
basis, and 50 lbs. occasionally; push and pull heavy book trucks; and bend,
stretch, stoop and climb. This position also requires the ability to work at a
computer many hours each day with much mental and visual concentration.

  
Preferred: Photography experience; some experience working in a special
collections library or archives; experience with digital library workflows.

  
Applications accepted until July 9, or until position is filled.



Brought to you by code4lib jobs: http://jobs.code4lib.org/job/21760/
To post a new job please visit http://jobs.code4lib.org/


[CODE4LIB] Job: Paid Digitization Project Internship at Densho: The Japanese American Legacy Project

2015-07-07 Thread jobs
Paid Digitization Project Internship
Densho: The Japanese American Legacy Project
Seattle

Densho is a Seattle-based non-profit organization started in 1996, with the
initial goal of documenting oral histories from Japanese Americans who were
incarcerated during World War II. This evolved into a mission to educate,
preserve, collaborate and inspire action for equity. Densho uses digital
technology to preserve and make accessible primary source materials on the
World War II incarceration of Japanese Americans. We present these materials
and related resources for their historic value and as a means of exploring
issues of democracy, intolerance, wartime hysteria, civil rights and the
responsibilities of citizenship in our increasingly global society.

  
Job Description

  
Densho is seeking 2 to 3 interns to support our Digitizing Photos for Japanese
American Oral Histories Project through the following activities:
corresponding with narrators and their families to identify and collect
materials, digitizing the materials (newspapers, photographs, documents,
etc.), editing and performing quality control on digital images, transcribing
documents, cataloging, data management in digital repository software, and
other duties as needed.

  
The internships run from September 2015 through December 2015 or January 2016
at $12 an hour, no benefits.

  
Current students or recent graduates from library and information science or
archival studies programs preferred. The ideal candidates are self-directed,
detail-oriented, and comfortable with technology. Knowledge of Japanese
American history is welcome but not required. The internships are funded by an
award from 4Culture.

  
Apply

  
To apply for the position, please send your resume and cover letter to
caitlin.oiye(at)densho.org by Wednesday, August 12, 2015. All applications
will be held in confidence. All submissions and questions should be sent via
email - no phone inquiries please.



Brought to you by code4lib jobs: http://jobs.code4lib.org/job/21751/
To post a new job please visit http://jobs.code4lib.org/


Re: [CODE4LIB] Regex Question

2015-07-07 Thread Jason R Peak
In the case of xml, I think xpath is the simpler tool.


 Brian Zelip wrote 

Hi Matt.

Re: finding words in all caps, yes it's possible. See this SO answer to
help: http://stackoverflow.com/a/4255225/2145103

Re: italics, my hunch is that you could do so if you got hold of the xml
behind the word doc, which I'd assume would have something like an
`` tags or attribute values of `italic` in the markup.


good luck!

Brian Zelip

---

Emerging Technologies Librarian

Health Sciences & Human Services Library

University of Maryland, Baltimore

bze...@hshsl.umaryland.edu

410-706-8865


On Tue, Jul 7, 2015 at 11:56 AM, Matt Sherman 
wrote:

> Hi all,
>
> I am working my way through teaching myself regex to parse an annotated
> bibliography docx file and had a question as I can't seem to get a succinct
> answer from Google.  Is it possible to have regex find words, or in the
> case names, in displayed in all caps?  Also similarly is it possible to
> have regex find words, or in this case titles, that are italicized?  Given
> how the document is formatted doing both would be nice so that I could
> parse them into a table or or database, but I cannot find a clear answer on
> that, though I am very new to regex so it is probably jumping into the deep
> end on this.  Any answers are appreciated.
>
> Matt Sherman
>


Re: [CODE4LIB] Regex Question

2015-07-07 Thread Matt Sherman
Thanks everyone, this really helps.  I'll have to work out the italicized
stuff, but this gets me much closer.

On Tue, Jul 7, 2015 at 12:43 PM, Kyle Banerjee 
wrote:

> Y'all are doing this the hard way. Word allows regex replacements as well
> as format based criteria.
>
> For this particular use case:
>
>1. Open the find/replace dialog (CTL+H)
>2. In the "Find what" box, put (<*>) -- make sure the option for "Use
>Wildcards" is selected, and for the format, specify italic
>3. For the"Replace box," just put \1 and specify All caps
>
> And you're done
>
> kyle
>
> On Tue, Jul 7, 2015 at 9:32 AM, Thomas Krichel 
> wrote:
>
> >   Eric Phetteplace writes
> >
> > > You can match a string of all caps letters like "[A-Z]"
> >
> >   This works if you are limited to English. But in a multilingual
> >   setting, you need to watch out for other uppercases, such as
> >   крихель vs КРИХЕЛЬ. It then depends in the unicode implementation
> >   of your regex application. In Perl, for example, you would use
> >   [[:upper:]].
> >
> >
> > --
> >
> >   Cheers,
> >
> >   Thomas Krichel  http://openlib.org/home/krichel
> >   skype:thomaskrichel
> >
>


Re: [CODE4LIB] Regex Question

2015-07-07 Thread Kyle Banerjee
Y'all are doing this the hard way. Word allows regex replacements as well
as format based criteria.

For this particular use case:

   1. Open the find/replace dialog (CTL+H)
   2. In the "Find what" box, put (<*>) -- make sure the option for "Use
   Wildcards" is selected, and for the format, specify italic
   3. For the"Replace box," just put \1 and specify All caps

And you're done

kyle

On Tue, Jul 7, 2015 at 9:32 AM, Thomas Krichel  wrote:

>   Eric Phetteplace writes
>
> > You can match a string of all caps letters like "[A-Z]"
>
>   This works if you are limited to English. But in a multilingual
>   setting, you need to watch out for other uppercases, such as
>   крихель vs КРИХЕЛЬ. It then depends in the unicode implementation
>   of your regex application. In Perl, for example, you would use
>   [[:upper:]].
>
>
> --
>
>   Cheers,
>
>   Thomas Krichel  http://openlib.org/home/krichel
>   skype:thomaskrichel
>


Re: [CODE4LIB] Regex Question

2015-07-07 Thread Brian Zelip
I think I figured out the all-caps need, see http://regexr.com/3bbfi

Cheers


bzelip

On Tue, Jul 7, 2015 at 12:32 PM, Thomas Krichel  wrote:

>   Eric Phetteplace writes
>
> > You can match a string of all caps letters like "[A-Z]"
>
>   This works if you are limited to English. But in a multilingual
>   setting, you need to watch out for other uppercases, such as
>   крихель vs КРИХЕЛЬ. It then depends in the unicode implementation
>   of your regex application. In Perl, for example, you would use
>   [[:upper:]].
>
>
> --
>
>   Cheers,
>
>   Thomas Krichel  http://openlib.org/home/krichel
>   skype:thomaskrichel
>


Re: [CODE4LIB] Regex Question

2015-07-07 Thread Thomas Krichel
  Eric Phetteplace writes

> You can match a string of all caps letters like "[A-Z]"

  This works if you are limited to English. But in a multilingual
  setting, you need to watch out for other uppercases, such as 
  крихель vs КРИХЕЛЬ. It then depends in the unicode implementation
  of your regex application. In Perl, for example, you would use
  [[:upper:]].


-- 

  Cheers,

  Thomas Krichel  http://openlib.org/home/krichel
  skype:thomaskrichel


[CODE4LIB] Job: Curator of Manuscripts & Digital Projects at Bowling Green State University

2015-07-07 Thread jobs
Curator of Manuscripts & Digital Projects
Bowling Green State University
Bowling Green

The Curator of Manuscripts and Digital Projects provides the oversight and
coordination of digital projects for the Center for Archival Collections (CAC)
and participates in departmental digital initiatives. Develops and oversees
innovative, digital projects that enhance access, discovery and retrieval of
collections, and coordinates the promotion and preservation of digital
collections and manuscripts. Initiates and supervises archival processing and
description of manuscript collections. Provides reference services for
students, faculty, and off-campus researchers utilizing primary and secondary
research materials.

  * Coordinates and assists in the processing of manuscript collections; 
develops policies, procedures and guidelines for the accessioning, description, 
loan, handling and care of manuscript collections. Provides acquisition and 
deaccessioning recommendations, and participates in consultation activities for 
the conservation and preservation of manuscripts.
  * Develops and oversees innovative, collaborative digital projects for the 
CAC that enhance online access to primary source materials. Participates in 
departmental digital initiatives sometimes in a leadership role. Consults with 
the Head Librarian and Reference Archivist on selection and scope of digital 
projects.
  * Develops partnerships with key stakeholders on and off campus on projects 
that will promote, enhance and preserve the CAC's print and digital manuscript 
holdings. Serves as curator for outward facing exhibits. Researches, writes, 
and designs exhibition publications and grants. Assists in developing and 
implementing educational tours, programs, and activities including exhibition 
receptions and openings.
  * Provides reference services, including in-depth knowledge of the collection 
to assist patrons in discovering, accessing and effectively using the full 
range of resources available in the CAC and the University Libraries (UL). 
Writes new and updates finding aids. Facilitates the transfer of and service to 
organizational archives not within the scope of the National Student Affairs 
Archives.
  * Participates in public outreach activities of the Center for Archival 
Collections, including: providing tours and instructional presentations on 
CAC/UL holdings and their research value to BGSU classes and off-campus groups.
  * Assists in the selection, training, and supervision of student employees 
and interns.
Knowledge, Skills or Abilities

  * Strong technology skills including knowledge of digitization practices and 
Encoded Archival Description (EAD).
  * Strong oral and written communication skills.
  * Experience processing manuscript collections preferred.
  * Knowledge of best practices for managing and preserving digital collections.
  * Knowledge of archival management issues and trends including applications 
for organizing and managing archival research data.
  * Demonstrated presentation experience.
  * Demonstrated project management experience.
  * Demonstrated experience mounting exhibits.



Brought to you by code4lib jobs: http://jobs.code4lib.org/job/21747/
To post a new job please visit http://jobs.code4lib.org/


Re: [CODE4LIB] Regex Question

2015-07-07 Thread Ivan Goldsmith
Hi Matt, 

I don't know much about your docx file, but I've also recently been learning & 
using regular expressions, and I thought I'd send you a link to a handy tool in 
case you hadn't seen it yet: http://regexr.com/ 

I've found regexr extremely helpful while trying to create useful regular 
expressions. You can tweak your regular expression in regexr and instantly see 
the results. (They provide some default sample text to search, though you're 
free to type/paste in your own.) 
If you hover your cursor over pieces of the regular expression, hints pop up 
and tell you what each part of the expression does; I've found it useful for 
learning how regular expressions work. There's also a nice cheatsheet on the 
left, which sometimes cuts down on how much Googling you need to do. 

Also, in case this is potentially helpful... here is a regular expression that 
matches groups of two or more capital letters: http://regexr.com/3bbet 
Perhaps this will do the trick when searching for words that are in all caps? 
(I make no guarantees; you might need to fiddle with it a bit.) 

As for searching for italicized words, I have no idea how to search for them 
unless they are surrounded by certain tags or signifiers. For instance, perhaps 
all italicized words are surrounded by tags like this: Some Nice 
Title. You could search for all phrases surrounded by those tags. But 
without a textual signifier like that, it's beyond me. 

Best, 

-- Ivan Goldsmith 
Web Project Analyst 
University of Pennsylvania Libraries 


- Original Message -

From: "Matt Sherman"  
To: CODE4LIB@LISTSERV.ND.EDU 
Sent: Tuesday, July 7, 2015 11:56:15 AM 
Subject: [CODE4LIB] Regex Question 

Hi all, 

I am working my way through teaching myself regex to parse an annotated 
bibliography docx file and had a question as I can't seem to get a succinct 
answer from Google. Is it possible to have regex find words, or in the 
case names, in displayed in all caps? Also similarly is it possible to 
have regex find words, or in this case titles, that are italicized? Given 
how the document is formatted doing both would be nice so that I could 
parse them into a table or or database, but I cannot find a clear answer on 
that, though I am very new to regex so it is probably jumping into the deep 
end on this. Any answers are appreciated. 

Matt Sherman 


Re: [CODE4LIB] Regex Question

2015-07-07 Thread Eric Phetteplace
Hi Matt!

You can match a string of all caps letters like "[A-Z]". Those brackets say
"match anything inside" and the hyphen indicates the full range of capital
letters.

You cannot, unfortunately, match italics since that's formatting and not
text. Regex is really only meant for strings of characters and not their
formatting.

Lastly, I'd be remiss if I didn't point you to Bohyun Kim's nice intro to
regex: http://acrl.ala.org/techconnect/?p=3549

Good luck!
On Tue, Jul 7, 2015 at 08:56 Matt Sherman  wrote:

> Hi all,
>
> I am working my way through teaching myself regex to parse an annotated
> bibliography docx file and had a question as I can't seem to get a succinct
> answer from Google.  Is it possible to have regex find words, or in the
> case names, in displayed in all caps?  Also similarly is it possible to
> have regex find words, or in this case titles, that are italicized?  Given
> how the document is formatted doing both would be nice so that I could
> parse them into a table or or database, but I cannot find a clear answer on
> that, though I am very new to regex so it is probably jumping into the deep
> end on this.  Any answers are appreciated.
>
> Matt Sherman
>


Re: [CODE4LIB] Regex Question

2015-07-07 Thread Brian Zelip
Hi Matt.

Re: finding words in all caps, yes it's possible. See this SO answer to
help: http://stackoverflow.com/a/4255225/2145103

Re: italics, my hunch is that you could do so if you got hold of the xml
behind the word doc, which I'd assume would have something like an
`` tags or attribute values of `italic` in the markup.


good luck!

Brian Zelip

---

Emerging Technologies Librarian

Health Sciences & Human Services Library

University of Maryland, Baltimore

bze...@hshsl.umaryland.edu

410-706-8865


On Tue, Jul 7, 2015 at 11:56 AM, Matt Sherman 
wrote:

> Hi all,
>
> I am working my way through teaching myself regex to parse an annotated
> bibliography docx file and had a question as I can't seem to get a succinct
> answer from Google.  Is it possible to have regex find words, or in the
> case names, in displayed in all caps?  Also similarly is it possible to
> have regex find words, or in this case titles, that are italicized?  Given
> how the document is formatted doing both would be nice so that I could
> parse them into a table or or database, but I cannot find a clear answer on
> that, though I am very new to regex so it is probably jumping into the deep
> end on this.  Any answers are appreciated.
>
> Matt Sherman
>


[CODE4LIB] Regex Question

2015-07-07 Thread Matt Sherman
Hi all,

I am working my way through teaching myself regex to parse an annotated
bibliography docx file and had a question as I can't seem to get a succinct
answer from Google.  Is it possible to have regex find words, or in the
case names, in displayed in all caps?  Also similarly is it possible to
have regex find words, or in this case titles, that are italicized?  Given
how the document is formatted doing both would be nice so that I could
parse them into a table or or database, but I cannot find a clear answer on
that, though I am very new to regex so it is probably jumping into the deep
end on this.  Any answers are appreciated.

Matt Sherman


[CODE4LIB] Job: Rails Developer at University of Maryland, College Park

2015-07-07 Thread jobs
Rails Developer
University of Maryland, College Park
College Park

Reposted with updated deadline of **July 17, 2015**.

  
The central focus for this Rails Developer contract position will be standing
up our first Hydra framework application against Fedora 4 at the University of
Maryland Libraries.

  
--  
**Title**: Ruby on Rails Software Developer  
**Category**: Exempt, Contingent I (Hourly, 6 months contract renewable up to 
12 months)  
**Department**: Software Systems Development and Research  
**Benefit**: Eligible for Telework  
**Salary**: Up to $60/hr, rate will be determined based on experience  
  
As the largest university library system in the Washington D.C.-Baltimore
area, the University of Maryland Libraries serve more than 37,500 students and
4,200 faculty of the flagship College Park campus. The University of Maryland
Libraries share the teaching, learning and research goals of the university.
Its role as a key academic resource is evident in its service to the academic
community and its actionable strategic plan. Recent membership in the
Committee on Institutional Cooperation, a robust organization of Big Ten
member institutions, is particularly meaningful to the University Libraries
and will further propel the university's ascendancy in academic excellence.

  
The UMD Libraries are upgrading their Digital Collections and Asset Management
system to Fedora Commons Repository 4. The Software Developer will stand up
our first Ruby on Rails and Hydra framework based application that will serve
as the primary interface for staff administration of metadata and content in
the Fedora 4 repository. In addition to agile development
of the application, the developer will establish the initial code base, local
development environment, and RHEL/VM server deployment environment.

  
The UMD Libraries would like to leverage the growing number of Ruby on Rails
based applications that are created and supported by the community of open
source development for library applications. The software
developer will jumpstart adoption of Ruby on Rails as a core supported
technology among the permanent developer team who have Java and Fedora 4
experience. This involves documentation, training, and
support for the full software lifecycle for building Rails applications from
development through promotion to production systems.

  
Additional assignments will depend on the experience and availability of the
successful candidate, possibly including: development of a tool suite for
batch ingestion into the Fedora 4 repository; standing up a new instance of
the Avalon Media System; customization for an existing JRuby/Rails based
ArchivesSpace application for archives information management.

  
Required Qualifications:

  * Bachelor's Degree
  * Two years programming experience
  * Experience programming with the Ruby language and developing web 
applications using Ruby on Rails
  * Experience using standard software development tools for version control, 
integrated development, etc.
  * Excellent interpersonal skills
  * Excellent written and verbal communication skills
For the full position description, please go to [http://www.lib.umd.edu/hr
/employment-opportunities/staff-faculty-positions](http://www.lib.umd.edu/hr
/employment-opportunities/staff-faculty-positions).

  
**APPLICATIONS**: Electronic applications required. Please apply online at 
[https://ejobs.umd.edu/postings/34420](https://ejobs.umd.edu/postings/34420). 
No relocation assistance will be provided. You must be legally able to work in 
the United States; the University of Maryland Libraries will not sponsor 
individuals for employment. An application consists of a cover letter which 
includes the source of advertisement, a resume, and names/e-mail addresses of 
three references.  
  
Applications will be reviewed as they are received and accepted until **July
17, 2015**.

  
The University of Maryland, College Park, actively subscribes to a policy of
equal employment opportunity, and will not discriminate against any employee
or applicant because of race, age, sex, color, sexual orientation, physical or
mental disability, religion, ancestry or national origin, marital status,
genetic information, political affiliation, or gender identity and
expression. Minorities and women are encouraged to apply.



Brought to you by code4lib jobs: http://jobs.code4lib.org/job/21743/
To post a new job please visit http://jobs.code4lib.org/