Re: [CODE4LIB] Regex Question
To add on a few things that others have said in this thread: - Another good online regex tool is https://regex101.com/ I really like the testing tools it provides. - Although it's not exactly what you need, Word does have an ability to search by format (it's under the Select menu on the Home tab of the ribbon). Kate Deibel, PhD | Web Applications Specialist Information Technology Services University of Washington Libraries http://staff.washington.edu/deibel -- "When Thor shows up, it's always deus ex machina." -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Matt Sherman Sent: Tuesday, July 7, 2015 9:45 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Regex Question Thanks everyone, this really helps. I'll have to work out the italicized stuff, but this gets me much closer. On Tue, Jul 7, 2015 at 12:43 PM, Kyle Banerjee wrote: > Y'all are doing this the hard way. Word allows regex replacements as > well as format based criteria. > > For this particular use case: > >1. Open the find/replace dialog (CTL+H) >2. In the "Find what" box, put (<*>) -- make sure the option for "Use >Wildcards" is selected, and for the format, specify italic >3. For the"Replace box," just put \1 and specify All caps > > And you're done > > kyle > > On Tue, Jul 7, 2015 at 9:32 AM, Thomas Krichel > wrote: > > > Eric Phetteplace writes > > > > > You can match a string of all caps letters like "[A-Z]" > > > > This works if you are limited to English. But in a multilingual > > setting, you need to watch out for other uppercases, such as > > крихель vs КРИХЕЛЬ. It then depends in the unicode implementation > > of your regex application. In Perl, for example, you would use > > [[:upper:]]. > > > > > > -- > > > > Cheers, > > > > Thomas Krichel http://openlib.org/home/krichel > > skype:thomaskrichel > > >
Re: [CODE4LIB] Regex Question
For clarity, Word does regex, not just wildcards. It's not quite as complete as what you'd get with some other environments such as OpenOffice Writer since matching is lazy rather than greedy which can be a big deal depending on what you're doing and there are a couple other catches -- notably no support for "|" -- but it's reasonably powerful. There is no regexp capability in Excel unless you're willing to use VBA. kyle On Tue, Jul 7, 2015 at 1:10 PM, Gordon, Bonnie wrote: > OpenOffice Writer (or a similar program) may be useful for this. It would > allow you to search by format while using a more controlled regular > expression than MS Word's wildcards. > > -Original Message- > From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of > Matt Sherman > Sent: Tuesday, July 07, 2015 12:45 PM > To: CODE4LIB@LISTSERV.ND.EDU > Subject: Re: [CODE4LIB] Regex Question > > Thanks everyone, this really helps. I'll have to work out the italicized > stuff, but this gets me much closer. > > On Tue, Jul 7, 2015 at 12:43 PM, Kyle Banerjee > wrote: > > > Y'all are doing this the hard way. Word allows regex replacements as > > well as format based criteria. > > > > For this particular use case: > > > >1. Open the find/replace dialog (CTL+H) > >2. In the "Find what" box, put (<*>) -- make sure the option for "Use > >Wildcards" is selected, and for the format, specify italic > >3. For the"Replace box," just put \1 and specify All caps > > > > And you're done > > > > kyle > > > > On Tue, Jul 7, 2015 at 9:32 AM, Thomas Krichel > > wrote: > > > > > Eric Phetteplace writes > > > > > > > You can match a string of all caps letters like "[A-Z]" > > > > > > This works if you are limited to English. But in a multilingual > > > setting, you need to watch out for other uppercases, such as > > > крихель vs КРИХЕЛЬ. It then depends in the unicode implementation > > > of your regex application. In Perl, for example, you would use > > > [[:upper:]]. > > > > > > > > > -- > > > > > > Cheers, > > > > > > Thomas Krichel http://openlib.org/home/krichel > > > skype:thomaskrichel > > > > > >
[CODE4LIB] Job: Senior Consultant at AVPreserve
Senior Consultant AVPreserve New York City AVPreserve is currently accepting applications for the role of Senior Consultant. Senior Consultants at AVPreserve are the face of the company, responsible for using their expertise and all available resources to contribute to envisioning, formulating, planning, synthesizing, and implementing strategies and solutions in alignment with the goals and objectives of our clients and the AVPreserve vision. Senior Consultants are responsible for identifying opportunities for AVPreserve to support the goals and objectives of past, present and prospective clients, and are also expected to be a leader in the company, mentoring and developing AVPreserve staff and providing support on AVPreserve projects where needed, as well as a knowledge leader among colleagues and within professional organizations. This versatility and cross-fertilization among business lines is an important factor underlying AVPreserve's success. AVPreserve provides a wide array of services, and our team members require diverse skill-sets in order to contribute across a range of those offerings. At this time we are particularly looking for people who can immediately step into roles providing services in the areas of data management and governance, data migration, data flow, and data modeling; the implementation of digital preservation standards, policies, and best practices; and the identification and integration of digital asset management procedures and systems in both production and archival environments. Applicants should have 5 years experience in at least one of those areas as well as existing proficiency in the use of applicable hardware, software, and computing language(s), along with the willingness to gain expertise with additional skills as needed. At AVPreserve, you will find challenging and meaningful work through collaboration with talented and engaged colleagues and clients, tackling real- world problems through development and implementation of innovative solutions and strategies. You will work alongside others in a culture of continual improvement that supports and motivates constant growth. You will be contributing to the continued evolution of AVPreserve at an exciting time for the company and for the clients we serve. Over time, you will be given greater responsibilities, autonomy and the ability to be creative and innovative in your work, with AVPreserve as a platform for support and promotion in the attainment of your own goals. To Apply Email a PDF of a cover letter and resume to careers[at]avpreserve[dot]com. No phone calls please. General goals and objectives of this position include: * Work in support of the AVPreserve vision; * Be an active participant in establishing and maintaining AVPreserve as the premiere consulting and software development firm focused on leveraging a deep understanding of technology, information management, business, and people to advance the ways in which information is used for the benefit of individuals, organizations, and causes; * Provide the most effective and meaningful support possible to our clients; * Develop and retain clientele; * Take responsibility for planning, attaining, managing and reporting on AVPreserve budgetary goals; * Continually learn and consistently contribute to the AVPreserve skill-set and knowledge base; * Increase awareness of and promote the AVPreserve brand and services; and * Promote increased communication and collaboration within AVPreserve to aid in effective coordination and implementation of project management, marketing and revenue generation efforts. Specific responsibilities of this position include: * Contributing to the successful completion of consulting engagements, including but not limited to: * Researching, compiling, synthesizing and analyzing information; * Conducting interviews, surveys, assessments and other forms of discovery; * Leading the analysis and writing of project reports, including the creation of diagrams, spreadsheets and other supporting documents; * Development of specifications and guidelines; * Working with AVPreserve team members to generate project deliverables; * Preparing and making presentations; and * Leading meetings * Leading and managing projects, including but not limited to: * Developing and managing project timelines, schedules, milestones, deliverables, budgets and resource allocation; * Identifying resources required; * Managing work plan execution; * Tracking actual vs. budgeted hours and expenditures; * Overseeing project team members including subcontractors; * Serving as the client liaison; and * Familiarizing staff with projects and activities, training staff when necessary and delegating work so that staff are able to easily assist with completion of work. * Reporting on projects and activities including: * Regular meetings to discuss and present on business, projects and foreca
[CODE4LIB] Job: Digital Initiative Librarian/Assistant Professor at Idaho State University
Digital Initiative Librarian/Assistant Professor Idaho State University Pocatello Supports digital scholarship initiatives including: providing assistance in the digitization of special collections, managing existing digital library services, and informing the creation of new digital services. Reporting to the Head of Collection Management, the Digital Initiatives Librarian works with staff and faculty throughout the library to provide innovative digital library services. Key Responsibilities * Manage and coordinate digital library projects among faculty, librarians, and staff; develop project plans, general project documentation, specifications, workflows, and timelines for successful completion of digital library projects. * Supervise a classified and professional staff of 3 FTE to support library technology including workstations, servers, and other technology. Lead staff in the identification of digital resource needs and solutions. * Identify and lead new digital initiatives. * Assist with staff training related to digital collections and technologies. * Develop and implement new digital library services in partnership with staff across the libraries such as mobile applications or cloud based integrated library systems * Formulate policies, procedures and best practices for the production, management, and preservation of digital content and metadata, technical workflow, quality control, and associated intellectual property issues. * Assist in setting up new collections or content in the various service areas including institutional repository and digital archives; assist in improving workflows for these services as needed. * Provide technical expertise in digital applications and lead the evaluation and implementation of technologies related to the management and dissemination of digital content, e.g. digital asset management system, institutional repository, and online exhibition tools. * Leverage technical skills in managing content migration, sharing collections with aggregators, improving and normalizing encoding and descriptive practices * Engage with and stay current on digital trends, technologies, and industry standards to provide direction and leadership for the library in the rapidly changing digital environment. Maintains in-depth knowledge of the identification, assessment, and implementation of emerging technologies that strategically advance the Library's mission. * Contribute to the campus discussion about digital initiatives and information. * Participates in reference desk duty. Assists faculty, students, staff and other users on a one-on-one basis by answering specific information requests, offering research assistance and guiding users in the effective use of a broad range of information resources offered by the Library. * Serves as a bibliographer to designated academic departments and oversees the selection of materials in assigned subject areas. Assists in collection development and evaluation; monitors pertinent portions of the acquisitions budget; works to develop and maintain good relations with faculty members. * Participates in professional activities, including university and library committees and task forces, and attends professional meetings. * Participates in the Library's instruction program. Minimum Qualifications * M.L.S., M.L.I.S. or equivalent degree from an ALA-accredited program by date of hire * Demonstrated ability to plan, coordinate and manage existing digital library projects * Demonstrated knowledge or awareness of tools and technology standards typically used in digital library environments * Demonstrated experience with multiple computer platforms including UNIX/Linux, and Windows and associated hardware. * Demonstrated excellent written and oral communication skills * Demonstrated good organizational skills and an aptitude for analytical and detailed work * Demonstrated ability to work independently as well as collaboratively in a complex, rapidly changing and professionally diverse environment. * Demonstrated ability to participate in library faculty activities as needed such as public services, instruction, bibliography, and library committee work. * Demonstrated ability to represent the Library to appropriate campus units, to participate in professional activities and committee assignments within the Library and on campus, and to participate in local, regional, and national outreach and professional activities. Preferred Qualifications * Demonstrated experience with relational databases such as SQL, MySQL, SOLR AND MSAccess. * Demonstrated experience and knowledge of metadata and content standards i.e., MARC, EAD, Dublin Core, and AACR2, etc. * Familiarity with of digital objects in various formats (text, image, audio and/or video) and best practices and standards associated with each format * Demonstrated experience with markup languages suc
[CODE4LIB] Job: Video Preservation Engineer at Indiana University Bloomington
Video Preservation Engineer Indiana University Bloomington Bloomington The Video Preservation Engineer works in the IU media digitization facility operated by the Media Digitization and Preservation Initiative (MDPI). The Video Preservation Engineer is responsible for the preservation transfer (digitization) of analog videotape recordings including the Betamax, Hi8, 8mm, U-matic, VHS, and EIAJ formats, to preservation master digital files at the highest quality possible. This includes optimal playback of actively degrading media on highly obsolete formats, quality control procedures, alignment/calibration/manipulation of playback machines, and verification of signal chain performance. Manipulation of degrading media for preservation work requires the independent application of ethical principles while making subjective judgments related to accuracy and quality. The Engineer is also responsible for collecting extensive documentation on both the source recordings and the transfer process. This position will participate in research related to the development of methodologies and workflows for the digitization of particularly difficult formats such as EIAJ, gaining efficiencies within a 1:1 workflow, developing and using workflow management software, and performing quality control for the products of large-scale digitization. Qualifications: REQUIRED: Bachelora€™s degree required in telecommunications or related field and at least three years' experience with the playback of legacy video formats in a studio, post house, archive, or production setting including experience with analysis and diagnostic tools such as waveform monitors and vectorscopes. Strongly preferred experience with: a) video archiving or preservation transfer work, b) deteriorating or otherwise problematic tapes c) documenting video projects in detail, d) lengthy projects requiring meticulous attention to detail, e) application of preservation principles to digitization work. Well-developed critical viewing skills, knowledge of legacy video formats. Indiana University is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, age, sex, sexual orientation or identity, national origin, disability status, or protected veteran status. This institution is also a provider of ADA services. Brought to you by code4lib jobs: http://jobs.code4lib.org/job/21759/ To post a new job please visit http://jobs.code4lib.org/
Re: [CODE4LIB] Regex Question
OpenOffice Writer (or a similar program) may be useful for this. It would allow you to search by format while using a more controlled regular expression than MS Word's wildcards. -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Matt Sherman Sent: Tuesday, July 07, 2015 12:45 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Regex Question Thanks everyone, this really helps. I'll have to work out the italicized stuff, but this gets me much closer. On Tue, Jul 7, 2015 at 12:43 PM, Kyle Banerjee wrote: > Y'all are doing this the hard way. Word allows regex replacements as > well as format based criteria. > > For this particular use case: > >1. Open the find/replace dialog (CTL+H) >2. In the "Find what" box, put (<*>) -- make sure the option for "Use >Wildcards" is selected, and for the format, specify italic >3. For the"Replace box," just put \1 and specify All caps > > And you're done > > kyle > > On Tue, Jul 7, 2015 at 9:32 AM, Thomas Krichel > wrote: > > > Eric Phetteplace writes > > > > > You can match a string of all caps letters like "[A-Z]" > > > > This works if you are limited to English. But in a multilingual > > setting, you need to watch out for other uppercases, such as > > крихель vs КРИХЕЛЬ. It then depends in the unicode implementation > > of your regex application. In Perl, for example, you would use > > [[:upper:]]. > > > > > > -- > > > > Cheers, > > > > Thomas Krichel http://openlib.org/home/krichel > > skype:thomaskrichel > > >
[CODE4LIB] Job: Digital Repository Librarian at University of British Columbia
Digital Repository Librarian University of British Columbia Vancouver The Digital Repository Librarian is responsible for developing, supporting, and assessing recruitmentactivities as well as managing the content in cIRcle, UBC's open access digital repository. The role is alsoresponsible for managing the repository software and technical system administration of the cIRcle managing the development and implementation of content policies and procedures; ensuringcopyright compliance of repository content; and undertaking promotion of cIRcle and open access. TheDigital Repository Librarian is responsible for many of the day-to-day activities of cIRcle, includingliaising with reference librarians and with faculty regarding content recruitment, assuring the applicationof interoperability standards for metadata; assisting with ongoing development of an assessmentprogram for the digital repository; overseeing internal records management activities in the cIRcle office,and developing and maintaining the cIRcle web site. Brought to you by code4lib jobs: http://jobs.code4lib.org/job/21761/ To post a new job please visit http://jobs.code4lib.org/
[CODE4LIB] Job: Digitization Manager at Indiana University Bloomington
Digitization Manager Indiana University Bloomington Bloomington Job Summary: Manages production digitization activities of the Lilly Library; and hires, trains, and supervises appointed and temporary employees involved in the digitization process. Coordinates the movement of Lilly Library materials through in-house digitization workflows; and prepares and transports Lilly Library materials to be digitized at the Wells Library or elsewhere. Maintains records related to materials being digitized; and serves as hardware and software administrator for digital imaging computers. Develops digitization, workflow, and quality control procedures for text, photographic, manuscript, and other materials. Qualifications: Review your qualifications prior to applying to ensure that you meet the minimum qualifications for the position. Resume and cover letter required. REQUIRED: Bachelor's degree in audio/visual art, informatics, computer science, photography, or related field, two years of digital imaging experience and one year of supervisory and/or training experience. Expertise with the following: hardware/software setup, maintenance, and troubleshooting; multiple computer platforms including UNIX, Windows, and Macintosh OS; varied imaging technologies such as photographic equipment including digital camera and lenses, video editing, imaging software, and file conversions. Knowledge of: digital imaging software such as Adobe Photoshop, sufficient to create and manipulate digital images; Optical Character Recognition (OCR) software and tools; XML-editors; digital objects standards and formats and associated best practices; and rare and fragile materials handling and preservation practices. Familiarity with metadata and markup standards such as EAD, MODS and TEI; and knowledge of and experience with copystand and studio lighting. Excellent interpersonal and communication skills, written and verbal including ability to interact with a wide variety of individuals. Solid problem solving, time, and project management skills; demonstrated ability to work independently in a complex, rapidly changing environment; and excellent attention to detail. Physical Requirements: This position requires a full range of movement and can be physically demanding. Individual must be able to stand for long periods of time; lift heavy books or bulky or oversize items up to 25 lbs. on a regular basis, and 50 lbs. occasionally; push and pull heavy book trucks; and bend, stretch, stoop and climb. This position also requires the ability to work at a computer many hours each day with much mental and visual concentration. Preferred: Photography experience; some experience working in a special collections library or archives; experience with digital library workflows. Applications accepted until July 9, or until position is filled. Brought to you by code4lib jobs: http://jobs.code4lib.org/job/21760/ To post a new job please visit http://jobs.code4lib.org/
[CODE4LIB] Job: Paid Digitization Project Internship at Densho: The Japanese American Legacy Project
Paid Digitization Project Internship Densho: The Japanese American Legacy Project Seattle Densho is a Seattle-based non-profit organization started in 1996, with the initial goal of documenting oral histories from Japanese Americans who were incarcerated during World War II. This evolved into a mission to educate, preserve, collaborate and inspire action for equity. Densho uses digital technology to preserve and make accessible primary source materials on the World War II incarceration of Japanese Americans. We present these materials and related resources for their historic value and as a means of exploring issues of democracy, intolerance, wartime hysteria, civil rights and the responsibilities of citizenship in our increasingly global society. Job Description Densho is seeking 2 to 3 interns to support our Digitizing Photos for Japanese American Oral Histories Project through the following activities: corresponding with narrators and their families to identify and collect materials, digitizing the materials (newspapers, photographs, documents, etc.), editing and performing quality control on digital images, transcribing documents, cataloging, data management in digital repository software, and other duties as needed. The internships run from September 2015 through December 2015 or January 2016 at $12 an hour, no benefits. Current students or recent graduates from library and information science or archival studies programs preferred. The ideal candidates are self-directed, detail-oriented, and comfortable with technology. Knowledge of Japanese American history is welcome but not required. The internships are funded by an award from 4Culture. Apply To apply for the position, please send your resume and cover letter to caitlin.oiye(at)densho.org by Wednesday, August 12, 2015. All applications will be held in confidence. All submissions and questions should be sent via email - no phone inquiries please. Brought to you by code4lib jobs: http://jobs.code4lib.org/job/21751/ To post a new job please visit http://jobs.code4lib.org/
Re: [CODE4LIB] Regex Question
In the case of xml, I think xpath is the simpler tool. Brian Zelip wrote Hi Matt. Re: finding words in all caps, yes it's possible. See this SO answer to help: http://stackoverflow.com/a/4255225/2145103 Re: italics, my hunch is that you could do so if you got hold of the xml behind the word doc, which I'd assume would have something like an `` tags or attribute values of `italic` in the markup. good luck! Brian Zelip --- Emerging Technologies Librarian Health Sciences & Human Services Library University of Maryland, Baltimore bze...@hshsl.umaryland.edu 410-706-8865 On Tue, Jul 7, 2015 at 11:56 AM, Matt Sherman wrote: > Hi all, > > I am working my way through teaching myself regex to parse an annotated > bibliography docx file and had a question as I can't seem to get a succinct > answer from Google. Is it possible to have regex find words, or in the > case names, in displayed in all caps? Also similarly is it possible to > have regex find words, or in this case titles, that are italicized? Given > how the document is formatted doing both would be nice so that I could > parse them into a table or or database, but I cannot find a clear answer on > that, though I am very new to regex so it is probably jumping into the deep > end on this. Any answers are appreciated. > > Matt Sherman >
Re: [CODE4LIB] Regex Question
Thanks everyone, this really helps. I'll have to work out the italicized stuff, but this gets me much closer. On Tue, Jul 7, 2015 at 12:43 PM, Kyle Banerjee wrote: > Y'all are doing this the hard way. Word allows regex replacements as well > as format based criteria. > > For this particular use case: > >1. Open the find/replace dialog (CTL+H) >2. In the "Find what" box, put (<*>) -- make sure the option for "Use >Wildcards" is selected, and for the format, specify italic >3. For the"Replace box," just put \1 and specify All caps > > And you're done > > kyle > > On Tue, Jul 7, 2015 at 9:32 AM, Thomas Krichel > wrote: > > > Eric Phetteplace writes > > > > > You can match a string of all caps letters like "[A-Z]" > > > > This works if you are limited to English. But in a multilingual > > setting, you need to watch out for other uppercases, such as > > крихель vs КРИХЕЛЬ. It then depends in the unicode implementation > > of your regex application. In Perl, for example, you would use > > [[:upper:]]. > > > > > > -- > > > > Cheers, > > > > Thomas Krichel http://openlib.org/home/krichel > > skype:thomaskrichel > > >
Re: [CODE4LIB] Regex Question
Y'all are doing this the hard way. Word allows regex replacements as well as format based criteria. For this particular use case: 1. Open the find/replace dialog (CTL+H) 2. In the "Find what" box, put (<*>) -- make sure the option for "Use Wildcards" is selected, and for the format, specify italic 3. For the"Replace box," just put \1 and specify All caps And you're done kyle On Tue, Jul 7, 2015 at 9:32 AM, Thomas Krichel wrote: > Eric Phetteplace writes > > > You can match a string of all caps letters like "[A-Z]" > > This works if you are limited to English. But in a multilingual > setting, you need to watch out for other uppercases, such as > крихель vs КРИХЕЛЬ. It then depends in the unicode implementation > of your regex application. In Perl, for example, you would use > [[:upper:]]. > > > -- > > Cheers, > > Thomas Krichel http://openlib.org/home/krichel > skype:thomaskrichel >
Re: [CODE4LIB] Regex Question
I think I figured out the all-caps need, see http://regexr.com/3bbfi Cheers bzelip On Tue, Jul 7, 2015 at 12:32 PM, Thomas Krichel wrote: > Eric Phetteplace writes > > > You can match a string of all caps letters like "[A-Z]" > > This works if you are limited to English. But in a multilingual > setting, you need to watch out for other uppercases, such as > крихель vs КРИХЕЛЬ. It then depends in the unicode implementation > of your regex application. In Perl, for example, you would use > [[:upper:]]. > > > -- > > Cheers, > > Thomas Krichel http://openlib.org/home/krichel > skype:thomaskrichel >
Re: [CODE4LIB] Regex Question
Eric Phetteplace writes > You can match a string of all caps letters like "[A-Z]" This works if you are limited to English. But in a multilingual setting, you need to watch out for other uppercases, such as крихель vs КРИХЕЛЬ. It then depends in the unicode implementation of your regex application. In Perl, for example, you would use [[:upper:]]. -- Cheers, Thomas Krichel http://openlib.org/home/krichel skype:thomaskrichel
[CODE4LIB] Job: Curator of Manuscripts & Digital Projects at Bowling Green State University
Curator of Manuscripts & Digital Projects Bowling Green State University Bowling Green The Curator of Manuscripts and Digital Projects provides the oversight and coordination of digital projects for the Center for Archival Collections (CAC) and participates in departmental digital initiatives. Develops and oversees innovative, digital projects that enhance access, discovery and retrieval of collections, and coordinates the promotion and preservation of digital collections and manuscripts. Initiates and supervises archival processing and description of manuscript collections. Provides reference services for students, faculty, and off-campus researchers utilizing primary and secondary research materials. * Coordinates and assists in the processing of manuscript collections; develops policies, procedures and guidelines for the accessioning, description, loan, handling and care of manuscript collections. Provides acquisition and deaccessioning recommendations, and participates in consultation activities for the conservation and preservation of manuscripts. * Develops and oversees innovative, collaborative digital projects for the CAC that enhance online access to primary source materials. Participates in departmental digital initiatives sometimes in a leadership role. Consults with the Head Librarian and Reference Archivist on selection and scope of digital projects. * Develops partnerships with key stakeholders on and off campus on projects that will promote, enhance and preserve the CAC's print and digital manuscript holdings. Serves as curator for outward facing exhibits. Researches, writes, and designs exhibition publications and grants. Assists in developing and implementing educational tours, programs, and activities including exhibition receptions and openings. * Provides reference services, including in-depth knowledge of the collection to assist patrons in discovering, accessing and effectively using the full range of resources available in the CAC and the University Libraries (UL). Writes new and updates finding aids. Facilitates the transfer of and service to organizational archives not within the scope of the National Student Affairs Archives. * Participates in public outreach activities of the Center for Archival Collections, including: providing tours and instructional presentations on CAC/UL holdings and their research value to BGSU classes and off-campus groups. * Assists in the selection, training, and supervision of student employees and interns. Knowledge, Skills or Abilities * Strong technology skills including knowledge of digitization practices and Encoded Archival Description (EAD). * Strong oral and written communication skills. * Experience processing manuscript collections preferred. * Knowledge of best practices for managing and preserving digital collections. * Knowledge of archival management issues and trends including applications for organizing and managing archival research data. * Demonstrated presentation experience. * Demonstrated project management experience. * Demonstrated experience mounting exhibits. Brought to you by code4lib jobs: http://jobs.code4lib.org/job/21747/ To post a new job please visit http://jobs.code4lib.org/
Re: [CODE4LIB] Regex Question
Hi Matt, I don't know much about your docx file, but I've also recently been learning & using regular expressions, and I thought I'd send you a link to a handy tool in case you hadn't seen it yet: http://regexr.com/ I've found regexr extremely helpful while trying to create useful regular expressions. You can tweak your regular expression in regexr and instantly see the results. (They provide some default sample text to search, though you're free to type/paste in your own.) If you hover your cursor over pieces of the regular expression, hints pop up and tell you what each part of the expression does; I've found it useful for learning how regular expressions work. There's also a nice cheatsheet on the left, which sometimes cuts down on how much Googling you need to do. Also, in case this is potentially helpful... here is a regular expression that matches groups of two or more capital letters: http://regexr.com/3bbet Perhaps this will do the trick when searching for words that are in all caps? (I make no guarantees; you might need to fiddle with it a bit.) As for searching for italicized words, I have no idea how to search for them unless they are surrounded by certain tags or signifiers. For instance, perhaps all italicized words are surrounded by tags like this: Some Nice Title. You could search for all phrases surrounded by those tags. But without a textual signifier like that, it's beyond me. Best, -- Ivan Goldsmith Web Project Analyst University of Pennsylvania Libraries - Original Message - From: "Matt Sherman" To: CODE4LIB@LISTSERV.ND.EDU Sent: Tuesday, July 7, 2015 11:56:15 AM Subject: [CODE4LIB] Regex Question Hi all, I am working my way through teaching myself regex to parse an annotated bibliography docx file and had a question as I can't seem to get a succinct answer from Google. Is it possible to have regex find words, or in the case names, in displayed in all caps? Also similarly is it possible to have regex find words, or in this case titles, that are italicized? Given how the document is formatted doing both would be nice so that I could parse them into a table or or database, but I cannot find a clear answer on that, though I am very new to regex so it is probably jumping into the deep end on this. Any answers are appreciated. Matt Sherman
Re: [CODE4LIB] Regex Question
Hi Matt! You can match a string of all caps letters like "[A-Z]". Those brackets say "match anything inside" and the hyphen indicates the full range of capital letters. You cannot, unfortunately, match italics since that's formatting and not text. Regex is really only meant for strings of characters and not their formatting. Lastly, I'd be remiss if I didn't point you to Bohyun Kim's nice intro to regex: http://acrl.ala.org/techconnect/?p=3549 Good luck! On Tue, Jul 7, 2015 at 08:56 Matt Sherman wrote: > Hi all, > > I am working my way through teaching myself regex to parse an annotated > bibliography docx file and had a question as I can't seem to get a succinct > answer from Google. Is it possible to have regex find words, or in the > case names, in displayed in all caps? Also similarly is it possible to > have regex find words, or in this case titles, that are italicized? Given > how the document is formatted doing both would be nice so that I could > parse them into a table or or database, but I cannot find a clear answer on > that, though I am very new to regex so it is probably jumping into the deep > end on this. Any answers are appreciated. > > Matt Sherman >
Re: [CODE4LIB] Regex Question
Hi Matt. Re: finding words in all caps, yes it's possible. See this SO answer to help: http://stackoverflow.com/a/4255225/2145103 Re: italics, my hunch is that you could do so if you got hold of the xml behind the word doc, which I'd assume would have something like an `` tags or attribute values of `italic` in the markup. good luck! Brian Zelip --- Emerging Technologies Librarian Health Sciences & Human Services Library University of Maryland, Baltimore bze...@hshsl.umaryland.edu 410-706-8865 On Tue, Jul 7, 2015 at 11:56 AM, Matt Sherman wrote: > Hi all, > > I am working my way through teaching myself regex to parse an annotated > bibliography docx file and had a question as I can't seem to get a succinct > answer from Google. Is it possible to have regex find words, or in the > case names, in displayed in all caps? Also similarly is it possible to > have regex find words, or in this case titles, that are italicized? Given > how the document is formatted doing both would be nice so that I could > parse them into a table or or database, but I cannot find a clear answer on > that, though I am very new to regex so it is probably jumping into the deep > end on this. Any answers are appreciated. > > Matt Sherman >
[CODE4LIB] Regex Question
Hi all, I am working my way through teaching myself regex to parse an annotated bibliography docx file and had a question as I can't seem to get a succinct answer from Google. Is it possible to have regex find words, or in the case names, in displayed in all caps? Also similarly is it possible to have regex find words, or in this case titles, that are italicized? Given how the document is formatted doing both would be nice so that I could parse them into a table or or database, but I cannot find a clear answer on that, though I am very new to regex so it is probably jumping into the deep end on this. Any answers are appreciated. Matt Sherman
[CODE4LIB] Job: Rails Developer at University of Maryland, College Park
Rails Developer University of Maryland, College Park College Park Reposted with updated deadline of **July 17, 2015**. The central focus for this Rails Developer contract position will be standing up our first Hydra framework application against Fedora 4 at the University of Maryland Libraries. -- **Title**: Ruby on Rails Software Developer **Category**: Exempt, Contingent I (Hourly, 6 months contract renewable up to 12 months) **Department**: Software Systems Development and Research **Benefit**: Eligible for Telework **Salary**: Up to $60/hr, rate will be determined based on experience As the largest university library system in the Washington D.C.-Baltimore area, the University of Maryland Libraries serve more than 37,500 students and 4,200 faculty of the flagship College Park campus. The University of Maryland Libraries share the teaching, learning and research goals of the university. Its role as a key academic resource is evident in its service to the academic community and its actionable strategic plan. Recent membership in the Committee on Institutional Cooperation, a robust organization of Big Ten member institutions, is particularly meaningful to the University Libraries and will further propel the university's ascendancy in academic excellence. The UMD Libraries are upgrading their Digital Collections and Asset Management system to Fedora Commons Repository 4. The Software Developer will stand up our first Ruby on Rails and Hydra framework based application that will serve as the primary interface for staff administration of metadata and content in the Fedora 4 repository. In addition to agile development of the application, the developer will establish the initial code base, local development environment, and RHEL/VM server deployment environment. The UMD Libraries would like to leverage the growing number of Ruby on Rails based applications that are created and supported by the community of open source development for library applications. The software developer will jumpstart adoption of Ruby on Rails as a core supported technology among the permanent developer team who have Java and Fedora 4 experience. This involves documentation, training, and support for the full software lifecycle for building Rails applications from development through promotion to production systems. Additional assignments will depend on the experience and availability of the successful candidate, possibly including: development of a tool suite for batch ingestion into the Fedora 4 repository; standing up a new instance of the Avalon Media System; customization for an existing JRuby/Rails based ArchivesSpace application for archives information management. Required Qualifications: * Bachelor's Degree * Two years programming experience * Experience programming with the Ruby language and developing web applications using Ruby on Rails * Experience using standard software development tools for version control, integrated development, etc. * Excellent interpersonal skills * Excellent written and verbal communication skills For the full position description, please go to [http://www.lib.umd.edu/hr /employment-opportunities/staff-faculty-positions](http://www.lib.umd.edu/hr /employment-opportunities/staff-faculty-positions). **APPLICATIONS**: Electronic applications required. Please apply online at [https://ejobs.umd.edu/postings/34420](https://ejobs.umd.edu/postings/34420). No relocation assistance will be provided. You must be legally able to work in the United States; the University of Maryland Libraries will not sponsor individuals for employment. An application consists of a cover letter which includes the source of advertisement, a resume, and names/e-mail addresses of three references. Applications will be reviewed as they are received and accepted until **July 17, 2015**. The University of Maryland, College Park, actively subscribes to a policy of equal employment opportunity, and will not discriminate against any employee or applicant because of race, age, sex, color, sexual orientation, physical or mental disability, religion, ancestry or national origin, marital status, genetic information, political affiliation, or gender identity and expression. Minorities and women are encouraged to apply. Brought to you by code4lib jobs: http://jobs.code4lib.org/job/21743/ To post a new job please visit http://jobs.code4lib.org/