Re: [CODE4LIB] Sign up to present at the Code4Lib Virtual Lightning Talks on April 4th
I also like this idea, and it also conflicts for my schedule, but I was contemplating skipping my other commitment to attend this. Hadn't decided for sure, as I was waiting to see a few of the Virtual LT talk titles that signed up (hah!). Also, I am a bit of a newcomer to code4lib, having never attended the conference, and coming from a related but somewhat different community (environmental informatics). I thought this might be a good way to hear a little more about what is developing in the code4lib community. I considered giving a Virtual LT as well, but thought it best to hear a few to gauge what might be of interest from my projects. Matt On Fri, Apr 1, 2011 at 2:48 PM, Edward M. Corrado wrote: > I agree with Luciano that the lead time was a bit short for me. Well, > maybe not specifically because it was short, but it does conflicts > with something else I have to do and I don't have time to reschedule. > I really like this idea and I hope it can be successful, so I hope > this message brought a rash of sign-ups and it goes on, or it is > rescheduled. > > Edward > > On Fri, Apr 1, 2011 at 6:42 PM, Luciano Ramalho > wrote: > > On Fri, Apr 1, 2011 at 3:01 PM, Peter Murray > wrote: > >> So far no one has signed up to present on Monday and only one person has > signed up to attend. It sounds like the idea of virtual lightning talks > isn't going to fly. If you have feedback (e.g., not interesting, not enough > lead time to prepare, wrong time of day/week/year), I'd appreciate hearing > it. > > > > As a Pythonista I am a huge fan of lightning talks, a staple of PyCons > > all over the World. > > > > Virtual lightning talks is a novel idea to me, but sounds great. > > > > I'd be interested in attending and even presenting, but I think the > > lead time was too short, particularly for an activity intended for > > business hours (1:30pm EDT is 2:30pm BRT / UTC-3). > > > > How about trying again, but aiming at a date in late April? > > > > -- > > Luciano Ramalho > > programador repentista || stand-up programmer > > Twitter: @luciano > > >
Re: [CODE4LIB] Sign up to present at the Code4Lib Virtual Lightning Talks on April 4th
I agree with Luciano that the lead time was a bit short for me. Well, maybe not specifically because it was short, but it does conflicts with something else I have to do and I don't have time to reschedule. I really like this idea and I hope it can be successful, so I hope this message brought a rash of sign-ups and it goes on, or it is rescheduled. Edward On Fri, Apr 1, 2011 at 6:42 PM, Luciano Ramalho wrote: > On Fri, Apr 1, 2011 at 3:01 PM, Peter Murray wrote: >> So far no one has signed up to present on Monday and only one person has >> signed up to attend. It sounds like the idea of virtual lightning talks >> isn't going to fly. If you have feedback (e.g., not interesting, not enough >> lead time to prepare, wrong time of day/week/year), I'd appreciate hearing >> it. > > As a Pythonista I am a huge fan of lightning talks, a staple of PyCons > all over the World. > > Virtual lightning talks is a novel idea to me, but sounds great. > > I'd be interested in attending and even presenting, but I think the > lead time was too short, particularly for an activity intended for > business hours (1:30pm EDT is 2:30pm BRT / UTC-3). > > How about trying again, but aiming at a date in late April? > > -- > Luciano Ramalho > programador repentista || stand-up programmer > Twitter: @luciano >
Re: [CODE4LIB] Sign up to present at the Code4Lib Virtual Lightning Talks on April 4th
On Fri, Apr 1, 2011 at 3:01 PM, Peter Murray wrote: > So far no one has signed up to present on Monday and only one person has > signed up to attend. It sounds like the idea of virtual lightning talks > isn't going to fly. If you have feedback (e.g., not interesting, not enough > lead time to prepare, wrong time of day/week/year), I'd appreciate hearing it. As a Pythonista I am a huge fan of lightning talks, a staple of PyCons all over the World. Virtual lightning talks is a novel idea to me, but sounds great. I'd be interested in attending and even presenting, but I think the lead time was too short, particularly for an activity intended for business hours (1:30pm EDT is 2:30pm BRT / UTC-3). How about trying again, but aiming at a date in late April? -- Luciano Ramalho programador repentista || stand-up programmer Twitter: @luciano
Re: [CODE4LIB] regexp for LCC?
At one point, much to my surprise, someone told me that 050 is defined for numbers assigned by LC not for LCC numbers per se. It doesn't really sound like that from the current definition (http://www.loc.gov/marc/bibliographic/bd050.html), but if you look on the ITS page (http://www.itsmarc.com/crs/edit7592.htm), which I think is not up-to-date, you'll see a discussion of "Pseudo call numbers and other forms of LC call numbers" As someone pointed out, only a very few classes start with three letters (off the top of my head; a couple in D and a number in K; see http://library.duke.edu/services/instruction/libraryguide/lcclass.html, but there are more in K than are listed here). The pseudo or shelf numbers I've seen most often in 050 are MLC and SD (which unfortunately is the same as the class for forestry). Look for SD on musical recording records (it used to really mess up the attempts of the catalog where I used to work to facet music CDs on LC class; there were a few other common ones, but I've forgotten). Depending what you're doing, you might try to prefer a call number in 090 if there is one. These are more likely to reflect local preference. Looking up 090 (http://www.oclc.org/bibformats/en/0xx/090.shtm) produced some other examples of non-LCC 050's: PAR, Newspaper, UNC, or NOT IN LC. Good luck! Kelley *** Except now I wonder if those annoying MLCS call numbers might actually be properly MATCHED by this regex, when I need em excluded. They are annoying _similar_ to a classified call number. Well, one way to find out. And the reason this matters is to try and use an LCC to map to a 'discipline' or other broad category, either directly from the LCC schedule labels, or using a mapping like umich's: http://www.lib.umich.edu/browse/categories/ But if it's not really an LCC at all, and you try to map it, you'll get bad postings. On 3/31/2011 1:03 PM, Jonathan Rochkind wrote: > > Thanks, that looks good! > > It's hosted on Google Code, but I don't think that code is anything > "Google uses", it looks like it's from our very own Bill Dueber. > > On 3/31/2011 12:38 PM, Tod Olson wrote: >> >> Check the regexp that Google uses in their call number normalization: >> >> http://code.google.com/p/library-callnumber-lc/wiki/Home >> >> You may want to remove the prefix part, and allow for a fourth cutter. >> >> The folks at UNC pointed me to this a few months ago. >> >> -Tod >> >> On Mar 31, 2011, at 11:29 AM, Jonathan Rochkind wrote: >> >>> Does anyone have a good regular expression that will match all legal >>> LC Call Numbers from the LC Classified Schedule, but will generally >>> not match things that could not possibly be an LC Call Number from >>> the LC Classified Schedule? >>> >>> In particular, I need it to NOT match an "MLC" call number, which is >>> an LC assigned call number that shows up in an 050 with no way to >>> distinguish based on indicators, but isn't actually from the LC >>> Schedules. Here's an example of an "MLC" call number: >>> >>> "MLCS 83/5180 (P)" >>> >>> Hmm, maybe all MLC call numbers begin with MLC, okay I guess I can >>> exclude them just like that. But it looks like there are also OTHER >>> things that can show up in the 050 but aren't actually from the >>> classified schedule, the OCLC documentation even contains an example >>> of "Microfilm 19072 E". >>> >>> What a mess, huh? So, yeah, regex anyone? >>> >>> [You can probably guess why I care if it's from the LC Classified >>> Schedule or not]. >> >> Tod Olson >> Systems Librarian >> University of Chicago Library >>
[CODE4LIB] The Islandora project announces Q1 - RC1 Release
The Islandora project announces an RC1 candidate for our Q1 2011 release. We invite the community to download and help us test this candidate in anticipation of our full release. Highlights: RC1 introduces a new solution pack architecture, which allows for simplified content model installation. We have also added a tabs-based UI for content models. If you are upgrading from previous Islandora modules you *must* run update.php in order to use this new release candidate. Download the distribution from here: https://github.com/islandora/islandora-dist or from http://islandora.ca/download List of commits to master: https://github.com/Islandora/islandora/commits/master List of resolved issues is available here: https://jira.duraspace.org/browse/ISLANDORA/fixforversion/10350 Documentation available here: https://wiki.duraspace.org/display/ISLANDORA/Islandora Feedback on the release can be submitted at http://islandora.ca/contact. We are looking for members of the community willing to submit issues directly to our JIRA ticketing system. If you are willing to provide feedback to the project, please create an account at https://jira.duraspace.org/secure/Signup!default.jspa and send your username to kstapelfe...@upei.ca The JIRA project is visible here: https://jira.duraspace.org/browse/ISLANDORA -- Kirsta Stapelfeldt, MA, MLIS Islandora Project/Repository Manager Robertson Library University of Prince Edward Island kstapelfe...@upei.ca Skype Name: Kirsta.Stapelfeldt 902.620.5096
Re: [CODE4LIB] PDF to Word converter?
Yeah - I just found out about the Save As trick from another list. In Acrobat 9, which is what I'm using, it doesn't render much better than Word Perfect (both have formatting issues, but they're different). But I also just learned from another post that Acrobat X does the job beautifully. Re: Digital Asset Management, Yeah, I have a lot of ground to (re)cover and post on, especially before I lose my audience of one! ;-) At least it's back on the (current) project list. :-) Thanks, Carol On Fri, Apr 1, 2011 at 2:06 PM, Nathan Tallman wrote: > This may only be available in Acrobat and not Reader, but I can choose File > --> Save As and choose to save the PDF as a Word .doc file. Results are > usually okay, but the formating is done with text boxes instead of the > normal way. > > Off topic: How's your Digital Asset Management software comparison going? I > always check your blog periodically for an update! > > Nathan Tallman > > On Fri, Apr 1, 2011 at 12:53 PM, Carol Bean wrote: > > > I have WordPerfect X4, which does a questionable job of (natively) > > converting pdf's, and from which one could then copy and paste the > > questionable results into a Word document. > > > > Has anyone been down this road before (trying to find something better > than > > what WordPerfect does, and that works directly with Word)? Or maybe > > someone > > can offer words of advice (or condolences...)? > > > > I am aware of these, but haven't tested any of them (recommendations or > > pans > > welcome): > > http://www.investintech.com/ > > http://www.hellopdf.com/ > > http://www.anypdftools.com/ > > http://finereader.abbyy.com/full_feature_list/ocr_accuracy/ > > http://www.123fileconvert.com/microsoft_fileconverter_word.asp > > http://www.pdfconverter.com/downloads/ > > http://www.somepdf.com/downloads.html > > http://www.tenorshare.com/pdf-converter.html > > http://www.docudesk.com/ > > > > And also MS Word Marketplace listings: > > > > > http://office.microsoft.com/en-us/downloads/downloads-marketplace-categories-FX102300516.aspx?Category=CL102416887&CTT=5&origin=HA001168333#last=;Index=0 > > > > And this Portable App: > > http://www.pendriveapps.com/free-pdf-to-word-converter/ > > > > Thanks, > > Carol > > -- > > Carol Bean > > beanwo...@gmail.com > > > -- Carol Bean beanwo...@gmail.com
Re: [CODE4LIB] PDF to Word converter?
This may only be available in Acrobat and not Reader, but I can choose File --> Save As and choose to save the PDF as a Word .doc file. Results are usually okay, but the formating is done with text boxes instead of the normal way. Off topic: How's your Digital Asset Management software comparison going? I always check your blog periodically for an update! Nathan Tallman On Fri, Apr 1, 2011 at 12:53 PM, Carol Bean wrote: > I have WordPerfect X4, which does a questionable job of (natively) > converting pdf's, and from which one could then copy and paste the > questionable results into a Word document. > > Has anyone been down this road before (trying to find something better than > what WordPerfect does, and that works directly with Word)? Or maybe > someone > can offer words of advice (or condolences...)? > > I am aware of these, but haven't tested any of them (recommendations or > pans > welcome): > http://www.investintech.com/ > http://www.hellopdf.com/ > http://www.anypdftools.com/ > http://finereader.abbyy.com/full_feature_list/ocr_accuracy/ > http://www.123fileconvert.com/microsoft_fileconverter_word.asp > http://www.pdfconverter.com/downloads/ > http://www.somepdf.com/downloads.html > http://www.tenorshare.com/pdf-converter.html > http://www.docudesk.com/ > > And also MS Word Marketplace listings: > > http://office.microsoft.com/en-us/downloads/downloads-marketplace-categories-FX102300516.aspx?Category=CL102416887&CTT=5&origin=HA001168333#last=;Index=0 > > And this Portable App: > http://www.pendriveapps.com/free-pdf-to-word-converter/ > > Thanks, > Carol > -- > Carol Bean > beanwo...@gmail.com >
Re: [CODE4LIB] Sign up to present at the Code4Lib Virtual Lightning Talks on April 4th
Hey, folks -- So far no one has signed up to present on Monday and only one person has signed up to attend. It sounds like the idea of virtual lightning talks isn't going to fly. If you have feedback (e.g., not interesting, not enough lead time to prepare, wrong time of day/week/year), I'd appreciate hearing it. Peter On Mar 28, 2011, at 10:08 AM, Peter Murray wrote: > > One of the highlights of the Code4Lib annual meeting is the “lightning talk” > rounds. A lightning talk is a fast-paced 5 minute talk on a topic of the > presenter’s choosing. They are usually scheduled on an ad-hoc, > first-come-first-served basis on the day of the event. They are an > opportunity to provide a platform for someone who is just getting started > with public speaking, who wants to ask a question or invite people to help > with a project, or for someone to boast about something he or she did or tell > a short cautionary story. These things are all interesting and worth talking > about, but there might not be enough to say about them to fill up a full > session timeslot. > > “Virtual Lightning Talks” replicates this conference activity online in a > virtual meeting environment. Each one-hour block consists of 10 six-minute > sessions (one minute for the presenter to take control of the virtual meeting > environment and test audio followed by a five minute presentation). > Presenters show their work by sharing their entire desktop; the presentation > can consist of slides, web browser, command-line shell, or any other > application that can be shown on the desktop. > > The first round will be on April 4th at 1:30pm Eastern U.S. Daylight Time. > You can read more information and sign up to be a presenter or attendee at: > > http://wiki.code4lib.org/index.php/Virtual_Lightning_Talks > > > Peter -- Peter Murray peter.mur...@lyrasis.orgtel:+1-678-235-2955 Ass't Director, Technology Services Development http://dltj.org/about/ Lyrasis --Great Libraries. Strong Communities. Innovative Answers. The Disruptive Library Technology Jesterhttp://dltj.org/ Attrib-Noncomm-Share http://creativecommons.org/licenses/by-nc-sa/2.5/
[CODE4LIB] Register now to ensure your spot at code4lib Northwest!
REGISTER NOW at http://www.surveymonkey.com/s/CS6KQ5F The draft schedule is now posted at http://sites.google.com/site/code4libnorthwest and there's a lot you won't want to miss! Learn how to start programming in Android, develop social applications, and work with HTML5. Hear about projects throughout the PNW, and learn more about automating operations, building repositories, and other topics. The format is fun and fast. By keeping the group size low and presentations short in a highly interactive environment, it's easy for busy people to share their ideas so you get to learn about what really interests you. Plus, the PNW code4lib folks are always a good bunch to hang out with (if you can, I recommend attending the preconference activity which will be held at a local watering hole). EVENT INFORMATION: When: Monday June 13th, 2011 Start: 9 AM End: 4:00 PM, with evening gathering for those interested at one of Portland's many local pubs/establishments Where: White Stag, Portland, Oregon Cost: $60 for early registration which covers snacks, refreshments and lunch. Registrations after April 29 are $75. Size: Registration is capped at 60 participants. Website: http://sites.google.com/site/code4libnorthwest PRE-CONFERENCE ACTIVITIES When: Sunday, June 12th, 2011 Start: 6 PM End: ??? Where: TBA REGISTRATION DETAILS: Early registration is $60 (after April 29, it becomes $75), and lunch as well as refreshments throughout the day will be provided. The price has risen due to increased costs for putting on the conference. QUESTIONS/COMMENTS: If you have questions, you can post them to the Google Group at http://groups.google.com/group/pnwcode4lib or send them to the code4lib NW coordinators (Mike Flakus mfla...@pdx.edu and Doug Eriksen eriks...@seattleu.edu) -- -- Kyle Banerjee Digital Services Program Manager Orbis Cascade Alliance baner...@uoregon.edu / 503.877.9773
Re: [CODE4LIB] PDF to Word converter?
I know you're looking for PDF to Word, but just as an fyi, I recently used Office Convert to convert PDF to JPG and it worked very well: http://download.cnet.com/Office-Convert-PDF-to-JPG-JPEG-TIFF-Free/3000-10743_4-10900900.html -Cheryl On Fri, Apr 1, 2011 at 12:53 PM, Carol Bean wrote: > I have WordPerfect X4, which does a questionable job of (natively) > converting pdf's, and from which one could then copy and paste the > questionable results into a Word document. > > Has anyone been down this road before (trying to find something better than > what WordPerfect does, and that works directly with Word)? Or maybe > someone > can offer words of advice (or condolences...)? > > I am aware of these, but haven't tested any of them (recommendations or > pans > welcome): > http://www.investintech.com/ > http://www.hellopdf.com/ > http://www.anypdftools.com/ > http://finereader.abbyy.com/full_feature_list/ocr_accuracy/ > http://www.123fileconvert.com/microsoft_fileconverter_word.asp > http://www.pdfconverter.com/downloads/ > http://www.somepdf.com/downloads.html > http://www.tenorshare.com/pdf-converter.html > http://www.docudesk.com/ > > And also MS Word Marketplace listings: > > http://office.microsoft.com/en-us/downloads/downloads-marketplace-categories-FX102300516.aspx?Category=CL102416887&CTT=5&origin=HA001168333#last=;Index=0 > > And this Portable App: > http://www.pendriveapps.com/free-pdf-to-word-converter/ > > Thanks, > Carol > -- > Carol Bean > beanwo...@gmail.com > -- Cheryl Kohen | Emerging Technology Librarian | Daytona State College
[CODE4LIB] PDF to Word converter?
I have WordPerfect X4, which does a questionable job of (natively) converting pdf's, and from which one could then copy and paste the questionable results into a Word document. Has anyone been down this road before (trying to find something better than what WordPerfect does, and that works directly with Word)? Or maybe someone can offer words of advice (or condolences...)? I am aware of these, but haven't tested any of them (recommendations or pans welcome): http://www.investintech.com/ http://www.hellopdf.com/ http://www.anypdftools.com/ http://finereader.abbyy.com/full_feature_list/ocr_accuracy/ http://www.123fileconvert.com/microsoft_fileconverter_word.asp http://www.pdfconverter.com/downloads/ http://www.somepdf.com/downloads.html http://www.tenorshare.com/pdf-converter.html http://www.docudesk.com/ And also MS Word Marketplace listings: http://office.microsoft.com/en-us/downloads/downloads-marketplace-categories-FX102300516.aspx?Category=CL102416887&CTT=5&origin=HA001168333#last=;Index=0 And this Portable App: http://www.pendriveapps.com/free-pdf-to-word-converter/ Thanks, Carol -- Carol Bean beanwo...@gmail.com
Re: [CODE4LIB] MARC magic for file
"I'm sure any decent MARC tool can deal with them, since decent MARC tools are certainly going to be forgiving enough to deal with four characters that apparently don't even really matter." You say that, but I'm pretty sure Marc4J throws errors MARC records where these characters are incorrect Owen On Fri, Apr 1, 2011 at 3:51 AM, William Denton wrote: > On 28 March 2011, Ford, Kevin wrote: > > I couldn't get Simon's MARC 21 Magic file to work. Among other issues, I >> received "line too long" errors. But, since I've been curious about this >> for sometime, I figured I'd take a whack at it myself. Try this: >> > > This is very nice! Thanks. I tried it on a bunch of MARC files I have, > and it recognized almost all of them. A few it didn't, so I had a closer > look, and they're invalid. > > For example, the Internet Archive's Binghamton catalogue dump: > > http://ia600307.us.archive.org/6/items/marc_binghamton_univ/ > > $ file -m marc.magic bgm*mrc > bgm_openlib_final_0-5.mrc: data > bgm_openlib_final_10-15.mrc: MARC Bibliographic > bgm_openlib_final_15-18.mrc: data > bgm_openlib_final_5-10.mrc:MARC Bibliographic > > But why? Aha: > > $ head -c 25 bgm_openlib_final_*mrc > ==> bgm_openlib_final_0-5.mrc <== > 01812cas 2200457 45x00 > ==> bgm_openlib_final_10-15.mrc <== > 01008nam 2200289ua 45000 > ==> bgm_openlib_final_15-18.mrc <== > 01614cam00385 45 0 > ==> bgm_openlib_final_5-10.mrc <== > 00887nam 2200265v 45000 > > As you say, the leader should end with 4500 (as defined at > http://www.loc.gov/marc/authority/adleader.html) but two of those files > don't. So they're not valid MARC. I'm sure any decent MARC tool can deal > with them, since decent MARC tools are certainly going to be forgiving > enough to deal with four characters that apparently don't even really > matter. > > So on the one hand they're usable MARC but file wouldn't say so, and on the > other that's a good indication that the files have failed a basic validity > test. I wonder if there are similar situations for JPEGs or MP3s. > > I think you should definitely submit this for inclusion in the magic file. > It would be very useful for us all! > > Bill > > P.S. I'd never used head -c (to show a fixed number of bytes) before. > Always nice to find a new useful option to an old command. > > > # >> # MARC 21 Magic (Second cut) >> >> # Set at position 0 >> 0 short >0x >> >> # leader ends with 4500 >> >>> 20 string 4500 >>> >> >> # leader starts with 5 digits, followed by codes specific to MARC format >> >>> 0 regex/1 (^[0-9]{5})[acdnp][^bhlnqsu-z] MARC Bibliographic 0 regex/1 (^[0-9]{5})[acdnosx][z] MARC Authority 0 regex/1 (^[0-9]{5})[cdn][uvxy] MARC Holdings 0 regex/1 (^[0-9]{5})[acdn][w]MARC Classification 0 regex/1 (^[0-9]{5})[cdn][q] MARC Community >>> > > -- > William Denton, Toronto : miskatonic.org www.frbr.org openfrbr.org > -- Owen Stephens Owen Stephens Consulting Web: http://www.ostephens.com Email: o...@ostephens.com