Re: [CODE4LIB] Regex Question

2015-07-07 Thread Ivan Goldsmith
Hi Matt, I don't know much about your docx file, but I've also recently been learning using regular expressions, and I thought I'd send you a link to a handy tool in case you hadn't seen it yet: http://regexr.com/ I've found regexr extremely helpful while trying to create useful regular

[CODE4LIB] Job: Curator of Manuscripts Digital Projects at Bowling Green State University

2015-07-07 Thread jobs
Curator of Manuscripts Digital Projects Bowling Green State University Bowling Green The Curator of Manuscripts and Digital Projects provides the oversight and coordination of digital projects for the Center for Archival Collections (CAC) and participates in departmental digital initiatives.

Re: [CODE4LIB] Regex Question

2015-07-07 Thread Eric Phetteplace
Hi Matt! You can match a string of all caps letters like [A-Z]. Those brackets say match anything inside and the hyphen indicates the full range of capital letters. You cannot, unfortunately, match italics since that's formatting and not text. Regex is really only meant for strings of characters

Re: [CODE4LIB] Regex Question

2015-07-07 Thread Brian Zelip
I think I figured out the all-caps need, see http://regexr.com/3bbfi Cheers bzelip On Tue, Jul 7, 2015 at 12:32 PM, Thomas Krichel kric...@openlib.org wrote: Eric Phetteplace writes You can match a string of all caps letters like [A-Z] This works if you are limited to English. But

Re: [CODE4LIB] Regex Question

2015-07-07 Thread Jason R Peak
In the case of xml, I think xpath is the simpler tool. Brian Zelip wrote Hi Matt. Re: finding words in all caps, yes it's possible. See this SO answer to help: http://stackoverflow.com/a/4255225/2145103 Re: italics, my hunch is that you could do so if you got hold of the xml behind

[CODE4LIB] Job: Rails Developer at University of Maryland, College Park

2015-07-07 Thread jobs
Rails Developer University of Maryland, College Park College Park Reposted with updated deadline of **July 17, 2015**. The central focus for this Rails Developer contract position will be standing up our first Hydra framework application against Fedora 4 at the University of Maryland

Re: [CODE4LIB] Regex Question

2015-07-07 Thread Brian Zelip
Hi Matt. Re: finding words in all caps, yes it's possible. See this SO answer to help: http://stackoverflow.com/a/4255225/2145103 Re: italics, my hunch is that you could do so if you got hold of the xml behind the word doc, which I'd assume would have something like an `italic` tags or attribute

Re: [CODE4LIB] Regex Question

2015-07-07 Thread Matt Sherman
Thanks everyone, this really helps. I'll have to work out the italicized stuff, but this gets me much closer. On Tue, Jul 7, 2015 at 12:43 PM, Kyle Banerjee kyle.baner...@gmail.com wrote: Y'all are doing this the hard way. Word allows regex replacements as well as format based criteria. For

Re: [CODE4LIB] Regex Question

2015-07-07 Thread Kyle Banerjee
Y'all are doing this the hard way. Word allows regex replacements as well as format based criteria. For this particular use case: 1. Open the find/replace dialog (CTL+H) 2. In the Find what box, put (*) -- make sure the option for Use Wildcards is selected, and for the format, specify

[CODE4LIB] Job: Paid Digitization Project Internship at Densho: The Japanese American Legacy Project

2015-07-07 Thread jobs
Paid Digitization Project Internship Densho: The Japanese American Legacy Project Seattle Densho is a Seattle-based non-profit organization started in 1996, with the initial goal of documenting oral histories from Japanese Americans who were incarcerated during World War II. This evolved into a

Re: [CODE4LIB] Regex Question

2015-07-07 Thread Gordon, Bonnie
OpenOffice Writer (or a similar program) may be useful for this. It would allow you to search by format while using a more controlled regular expression than MS Word's wildcards. -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Matt Sherman

[CODE4LIB] Job: Digitization Manager at Indiana University Bloomington

2015-07-07 Thread jobs
Digitization Manager Indiana University Bloomington Bloomington Job Summary: Manages production digitization activities of the Lilly Library; and hires, trains, and supervises appointed and temporary employees involved in the digitization process. Coordinates the movement of Lilly Library

[CODE4LIB] Job: Digital Repository Librarian at University of British Columbia

2015-07-07 Thread jobs
Digital Repository Librarian University of British Columbia Vancouver The Digital Repository Librarian is responsible for developing, supporting, and assessing recruitmentactivities as well as managing the content in cIRcle, UBC's open access digital repository. The role is alsoresponsible for

[CODE4LIB] Job: Senior Consultant at AVPreserve

2015-07-07 Thread jobs
Senior Consultant AVPreserve New York City AVPreserve is currently accepting applications for the role of Senior Consultant. Senior Consultants at AVPreserve are the face of the company, responsible for using their expertise and all available resources to contribute to envisioning, formulating,

Re: [CODE4LIB] Regex Question

2015-07-07 Thread Katherine N. Deibel
To add on a few things that others have said in this thread: - Another good online regex tool is https://regex101.com/ I really like the testing tools it provides. - Although it's not exactly what you need, Word does have an ability to search by format (it's under the Select menu on the Home

Re: [CODE4LIB] Regex Question

2015-07-07 Thread Kyle Banerjee
For clarity, Word does regex, not just wildcards. It's not quite as complete as what you'd get with some other environments such as OpenOffice Writer since matching is lazy rather than greedy which can be a big deal depending on what you're doing and there are a couple other catches -- notably no