Re: [CODE4LIB] Sign up to present at the Code4Lib Virtual Lightning Talks on April 4th

2011-04-01 Thread Matt Jones
I also like this idea, and it also conflicts for my schedule, but I was
contemplating skipping my other commitment to attend this.  Hadn't decided
for sure, as I was waiting to see a few of the Virtual LT talk titles that
signed up (hah!).

Also, I am a bit of a newcomer to code4lib, having never attended the
conference, and coming from a related but somewhat different community
(environmental informatics).  I thought this might be a good way to hear a
little more about what is developing in the code4lib community.  I
considered giving a Virtual LT as well, but thought it best to hear a few
to gauge what might be of interest from my projects.

Matt

On Fri, Apr 1, 2011 at 2:48 PM, Edward M. Corrado wrote:

> I agree with Luciano that the lead time was a bit short for me. Well,
> maybe not specifically because it was short, but it does conflicts
> with something else I have to do and I don't have time to reschedule.
> I really like this idea and I hope it can be successful, so I hope
> this message brought a rash of sign-ups and it goes on, or it is
> rescheduled.
>
> Edward
>
> On Fri, Apr 1, 2011 at 6:42 PM, Luciano Ramalho 
> wrote:
> > On Fri, Apr 1, 2011 at 3:01 PM, Peter Murray 
> wrote:
> >> So far no one has signed up to present on Monday and only one person has
> signed up to attend.  It sounds like the idea of virtual lightning talks
> isn't going to fly.  If you have feedback (e.g., not interesting, not enough
> lead time to prepare, wrong time of day/week/year), I'd appreciate hearing
> it.
> >
> > As a Pythonista I am a huge fan of lightning talks, a staple of PyCons
> > all over the World.
> >
> > Virtual lightning talks is a novel idea to me, but sounds great.
> >
> > I'd be interested in attending and even presenting, but I think the
> > lead time was too short, particularly for an activity intended for
> > business hours (1:30pm EDT is 2:30pm BRT / UTC-3).
> >
> > How about trying again, but aiming at a date in late April?
> >
> > --
> > Luciano Ramalho
> > programador repentista || stand-up programmer
> > Twitter: @luciano
> >
>


Re: [CODE4LIB] Sign up to present at the Code4Lib Virtual Lightning Talks on April 4th

2011-04-01 Thread Edward M. Corrado
I agree with Luciano that the lead time was a bit short for me. Well,
maybe not specifically because it was short, but it does conflicts
with something else I have to do and I don't have time to reschedule.
I really like this idea and I hope it can be successful, so I hope
this message brought a rash of sign-ups and it goes on, or it is
rescheduled.

Edward

On Fri, Apr 1, 2011 at 6:42 PM, Luciano Ramalho  wrote:
> On Fri, Apr 1, 2011 at 3:01 PM, Peter Murray  wrote:
>> So far no one has signed up to present on Monday and only one person has 
>> signed up to attend.  It sounds like the idea of virtual lightning talks 
>> isn't going to fly.  If you have feedback (e.g., not interesting, not enough 
>> lead time to prepare, wrong time of day/week/year), I'd appreciate hearing 
>> it.
>
> As a Pythonista I am a huge fan of lightning talks, a staple of PyCons
> all over the World.
>
> Virtual lightning talks is a novel idea to me, but sounds great.
>
> I'd be interested in attending and even presenting, but I think the
> lead time was too short, particularly for an activity intended for
> business hours (1:30pm EDT is 2:30pm BRT / UTC-3).
>
> How about trying again, but aiming at a date in late April?
>
> --
> Luciano Ramalho
> programador repentista || stand-up programmer
> Twitter: @luciano
>


Re: [CODE4LIB] Sign up to present at the Code4Lib Virtual Lightning Talks on April 4th

2011-04-01 Thread Luciano Ramalho
On Fri, Apr 1, 2011 at 3:01 PM, Peter Murray  wrote:
> So far no one has signed up to present on Monday and only one person has 
> signed up to attend.  It sounds like the idea of virtual lightning talks 
> isn't going to fly.  If you have feedback (e.g., not interesting, not enough 
> lead time to prepare, wrong time of day/week/year), I'd appreciate hearing it.

As a Pythonista I am a huge fan of lightning talks, a staple of PyCons
all over the World.

Virtual lightning talks is a novel idea to me, but sounds great.

I'd be interested in attending and even presenting, but I think the
lead time was too short, particularly for an activity intended for
business hours (1:30pm EDT is 2:30pm BRT / UTC-3).

How about trying again, but aiming at a date in late April?

-- 
Luciano Ramalho
programador repentista || stand-up programmer
Twitter: @luciano


Re: [CODE4LIB] regexp for LCC?

2011-04-01 Thread Kelley McGrath
At one point, much to my surprise, someone told me that 050 is defined for
numbers assigned by LC not for LCC numbers per se. It doesn't really sound
like that from the current definition
(http://www.loc.gov/marc/bibliographic/bd050.html), but if you look on the
ITS page (http://www.itsmarc.com/crs/edit7592.htm), which I think is not
up-to-date, you'll see a discussion of "Pseudo call numbers and other forms
of LC call numbers"

As someone pointed out, only a very few classes start with three letters
(off the top of my head; a couple in D and a number in K; see
http://library.duke.edu/services/instruction/libraryguide/lcclass.html, but
there are more in K than are listed here).

The pseudo or shelf numbers I've seen most often in 050 are MLC and SD
(which unfortunately is the same as the class for forestry). Look for SD on
musical recording records (it used to really mess up the attempts of the
catalog where I used to work to facet music CDs on LC class; there were a
few other common ones, but I've forgotten).

Depending what you're doing, you might try to prefer a call number in 090 if
there is one. These are more likely to reflect local preference.

Looking up 090 (http://www.oclc.org/bibformats/en/0xx/090.shtm) produced
some other examples of non-LCC 050's: PAR, Newspaper, UNC, or NOT IN LC.

Good luck!

Kelley

***
Except now I wonder if those annoying MLCS call numbers might actually be
properly MATCHED by this regex, when I need em excluded. They are annoying
_similar_ to a classified call number. Well, one way to find out.

And the reason this matters is to try and use an LCC to map to a
'discipline' or other broad category, either directly from the LCC schedule
labels, or using a mapping like umich's:
http://www.lib.umich.edu/browse/categories/

But if it's not really an LCC at all, and you try to map it, you'll get bad
postings.

On 3/31/2011 1:03 PM, Jonathan Rochkind wrote:
>
> Thanks, that looks good!
>
> It's hosted on Google Code, but I don't think that code is anything 
> "Google uses", it looks like it's from our very own Bill Dueber.
>
> On 3/31/2011 12:38 PM, Tod Olson wrote:
>>
>> Check the regexp that Google uses in their call number normalization:
>>
>>        http://code.google.com/p/library-callnumber-lc/wiki/Home
>>
>> You may want to remove the prefix part, and allow for a fourth cutter.
>>
>> The folks at UNC pointed me to this a few months ago.
>>
>> -Tod
>>
>> On Mar 31, 2011, at 11:29 AM, Jonathan Rochkind wrote:
>>
>>> Does anyone have a good regular expression that will match all legal 
>>> LC Call Numbers from the LC Classified Schedule, but will generally 
>>> not match things that could not possibly be an LC Call Number from 
>>> the LC Classified Schedule?
>>>
>>> In particular, I need it to NOT match an "MLC" call number, which is 
>>> an LC assigned call number that shows up in an 050 with no way to 
>>> distinguish based on indicators, but isn't actually from the LC 
>>> Schedules.  Here's an example of an "MLC" call number:
>>>
>>> "MLCS 83/5180 (P)"
>>>
>>> Hmm, maybe all MLC call numbers begin with MLC, okay I guess I can 
>>> exclude them just like that. But it looks like there are also OTHER 
>>> things that can show up in the 050 but aren't actually from the 
>>> classified schedule, the OCLC documentation even contains an example 
>>> of "Microfilm 19072 E".
>>>
>>> What a mess, huh?  So, yeah, regex anyone?
>>>
>>> [You can probably guess why I care if it's from the LC Classified 
>>> Schedule or not].
>>
>> Tod Olson
>> Systems Librarian
>> University of Chicago Library
>>


[CODE4LIB] The Islandora project announces Q1 - RC1 Release

2011-04-01 Thread Kirsta Stapelfeldt
The Islandora project announces an RC1 candidate for our Q1 2011 release. We
invite the community to download and help us test this candidate in
anticipation of our full release.

Highlights: RC1 introduces a new solution pack architecture, which allows
for simplified content model installation. We have also added a tabs-based
UI for content models.

If you are upgrading from previous Islandora modules you *must* run
update.php in order to use this new release candidate.

Download the distribution from here:
https://github.com/islandora/islandora-dist or from
http://islandora.ca/download

List of commits to master:
https://github.com/Islandora/islandora/commits/master

List of resolved issues is available here:
https://jira.duraspace.org/browse/ISLANDORA/fixforversion/10350

Documentation available here:
https://wiki.duraspace.org/display/ISLANDORA/Islandora

Feedback on the release can be submitted at http://islandora.ca/contact.  We
are looking for members of the community willing to submit issues directly
to our JIRA ticketing system. If you are willing to provide feedback to the
project, please create an account at
https://jira.duraspace.org/secure/Signup!default.jspa and send your username
to kstapelfe...@upei.ca

The JIRA project is visible here:
https://jira.duraspace.org/browse/ISLANDORA

-- 
Kirsta Stapelfeldt, MA, MLIS
Islandora Project/Repository Manager
Robertson Library
University of Prince Edward Island
kstapelfe...@upei.ca
Skype Name: Kirsta.Stapelfeldt
902.620.5096


Re: [CODE4LIB] PDF to Word converter?

2011-04-01 Thread Carol Bean
Yeah - I just found out about the Save As trick from another list. In
Acrobat 9, which is what I'm using, it doesn't render much better than Word
Perfect (both have formatting issues, but they're different).  But I also
just learned from another post that Acrobat X does the job beautifully.

Re: Digital Asset Management, Yeah, I have a lot of ground to (re)cover and
post on, especially before I lose my audience of one! ;-) At least it's back
on the (current) project list. :-)

Thanks,
Carol

On Fri, Apr 1, 2011 at 2:06 PM, Nathan Tallman  wrote:

> This may only be available in Acrobat and not Reader, but I can choose File
> --> Save As and choose to save the PDF as a Word .doc file. Results are
> usually okay, but the formating is done with text boxes instead of the
> normal way.
>
> Off topic: How's your Digital Asset Management software comparison going? I
> always check your blog periodically for an update!
>
> Nathan Tallman
>
> On Fri, Apr 1, 2011 at 12:53 PM, Carol Bean  wrote:
>
> > I have WordPerfect X4, which does a questionable job of (natively)
> > converting pdf's, and from which one could then copy and paste the
> > questionable results into a Word document.
> >
> > Has anyone been down this road before (trying to find something better
> than
> > what WordPerfect does, and that works directly with Word)?  Or maybe
> > someone
> > can offer words of advice (or condolences...)?
> >
> > I am aware of these, but haven't tested any of them (recommendations or
> > pans
> > welcome):
> > http://www.investintech.com/
> > http://www.hellopdf.com/
> > http://www.anypdftools.com/
> > http://finereader.abbyy.com/full_feature_list/ocr_accuracy/
> > http://www.123fileconvert.com/microsoft_fileconverter_word.asp
> > http://www.pdfconverter.com/downloads/
> > http://www.somepdf.com/downloads.html
> > http://www.tenorshare.com/pdf-converter.html
> > http://www.docudesk.com/
> >
> > And also MS Word Marketplace listings:
> >
> >
> http://office.microsoft.com/en-us/downloads/downloads-marketplace-categories-FX102300516.aspx?Category=CL102416887&CTT=5&origin=HA001168333#last=;Index=0
> >
> > And this Portable App:
> > http://www.pendriveapps.com/free-pdf-to-word-converter/
> >
> > Thanks,
> > Carol
> > --
> > Carol Bean
> > beanwo...@gmail.com
> >
>



-- 
Carol Bean
beanwo...@gmail.com


Re: [CODE4LIB] PDF to Word converter?

2011-04-01 Thread Nathan Tallman
This may only be available in Acrobat and not Reader, but I can choose File
--> Save As and choose to save the PDF as a Word .doc file. Results are
usually okay, but the formating is done with text boxes instead of the
normal way.

Off topic: How's your Digital Asset Management software comparison going? I
always check your blog periodically for an update!

Nathan Tallman

On Fri, Apr 1, 2011 at 12:53 PM, Carol Bean  wrote:

> I have WordPerfect X4, which does a questionable job of (natively)
> converting pdf's, and from which one could then copy and paste the
> questionable results into a Word document.
>
> Has anyone been down this road before (trying to find something better than
> what WordPerfect does, and that works directly with Word)?  Or maybe
> someone
> can offer words of advice (or condolences...)?
>
> I am aware of these, but haven't tested any of them (recommendations or
> pans
> welcome):
> http://www.investintech.com/
> http://www.hellopdf.com/
> http://www.anypdftools.com/
> http://finereader.abbyy.com/full_feature_list/ocr_accuracy/
> http://www.123fileconvert.com/microsoft_fileconverter_word.asp
> http://www.pdfconverter.com/downloads/
> http://www.somepdf.com/downloads.html
> http://www.tenorshare.com/pdf-converter.html
> http://www.docudesk.com/
>
> And also MS Word Marketplace listings:
>
> http://office.microsoft.com/en-us/downloads/downloads-marketplace-categories-FX102300516.aspx?Category=CL102416887&CTT=5&origin=HA001168333#last=;Index=0
>
> And this Portable App:
> http://www.pendriveapps.com/free-pdf-to-word-converter/
>
> Thanks,
> Carol
> --
> Carol Bean
> beanwo...@gmail.com
>


Re: [CODE4LIB] Sign up to present at the Code4Lib Virtual Lightning Talks on April 4th

2011-04-01 Thread Peter Murray
Hey, folks --

So far no one has signed up to present on Monday and only one person has signed 
up to attend.  It sounds like the idea of virtual lightning talks isn't going 
to fly.  If you have feedback (e.g., not interesting, not enough lead time to 
prepare, wrong time of day/week/year), I'd appreciate hearing it.


Peter


On Mar 28, 2011, at 10:08 AM, Peter Murray wrote:
> 
> One of the highlights of the Code4Lib annual meeting is the “lightning talk” 
> rounds. A lightning talk is a fast-paced 5 minute talk on a topic of the 
> presenter’s choosing. They are usually scheduled on an ad-hoc, 
> first-come-first-served basis on the day of the event. They are an 
> opportunity to provide a platform for someone who is just getting started 
> with public speaking, who wants to ask a question or invite people to help 
> with a project, or for someone to boast about something he or she did or tell 
> a short cautionary story. These things are all interesting and worth talking 
> about, but there might not be enough to say about them to fill up a full 
> session timeslot.
> 
> “Virtual Lightning Talks” replicates this conference activity online in a 
> virtual meeting environment. Each one-hour block consists of 10 six-minute 
> sessions (one minute for the presenter to take control of the virtual meeting 
> environment and test audio followed by a five minute presentation). 
> Presenters show their work by sharing their entire desktop; the presentation 
> can consist of slides, web browser, command-line shell, or any other 
> application that can be shown on the desktop.
> 
> The first round will be on April 4th at 1:30pm Eastern U.S. Daylight Time.  
> You can read more information and sign up to be a presenter or attendee at:
> 
>  http://wiki.code4lib.org/index.php/Virtual_Lightning_Talks
> 
> 
> Peter


-- 
Peter Murray peter.mur...@lyrasis.orgtel:+1-678-235-2955
 
Ass't Director, Technology Services Development   http://dltj.org/about/
Lyrasis   --Great Libraries. Strong Communities. Innovative Answers.
The Disruptive Library Technology Jesterhttp://dltj.org/ 
Attrib-Noncomm-Share   http://creativecommons.org/licenses/by-nc-sa/2.5/ 


[CODE4LIB] Register now to ensure your spot at code4lib Northwest!

2011-04-01 Thread Kyle Banerjee
REGISTER NOW at http://www.surveymonkey.com/s/CS6KQ5F

The draft schedule is now posted at
http://sites.google.com/site/code4libnorthwest and there's a lot you won't
want to miss!

Learn how to start programming in Android, develop social applications, and
work with HTML5. Hear about projects throughout the PNW, and learn more
about automating operations, building repositories, and other topics.

The format is fun and fast. By keeping the group size low and presentations
short in a highly interactive environment, it's easy for busy people to
share their ideas so you get to learn about what really interests you. Plus,
the PNW code4lib folks are always a good bunch to hang out with (if you can,
I recommend attending the preconference activity which will be held at a
local watering hole).

EVENT INFORMATION:

When:  Monday June 13th, 2011
Start: 9 AM
End: 4:00 PM, with evening gathering for those interested at one of
Portland's many local pubs/establishments
Where:  White Stag, Portland, Oregon
Cost:  $60 for early registration which covers snacks, refreshments and
lunch.  Registrations after April 29 are $75.
Size:  Registration is capped at 60 participants.
Website: http://sites.google.com/site/code4libnorthwest

PRE-CONFERENCE ACTIVITIES

When:  Sunday, June 12th, 2011
Start: 6 PM
End:  ???
Where:  TBA

REGISTRATION DETAILS:

Early registration is $60 (after April 29, it becomes $75), and lunch as
well as refreshments throughout the day will be provided. The price has
risen due to increased costs for putting on the conference.

QUESTIONS/COMMENTS:

If you have questions, you can post them to the Google Group at
http://groups.google.com/group/pnwcode4lib or send them to the code4lib NW
coordinators (Mike Flakus mfla...@pdx.edu and Doug Eriksen
eriks...@seattleu.edu)

-- 
--
Kyle Banerjee
Digital Services Program Manager
Orbis Cascade Alliance
baner...@uoregon.edu / 503.877.9773


Re: [CODE4LIB] PDF to Word converter?

2011-04-01 Thread Cheryl Kohen
I know you're looking for PDF to Word, but just as an fyi, I recently used
Office Convert to convert PDF to JPG and it worked very well:
http://download.cnet.com/Office-Convert-PDF-to-JPG-JPEG-TIFF-Free/3000-10743_4-10900900.html

-Cheryl

On Fri, Apr 1, 2011 at 12:53 PM, Carol Bean  wrote:

> I have WordPerfect X4, which does a questionable job of (natively)
> converting pdf's, and from which one could then copy and paste the
> questionable results into a Word document.
>
> Has anyone been down this road before (trying to find something better than
> what WordPerfect does, and that works directly with Word)?  Or maybe
> someone
> can offer words of advice (or condolences...)?
>
> I am aware of these, but haven't tested any of them (recommendations or
> pans
> welcome):
> http://www.investintech.com/
> http://www.hellopdf.com/
> http://www.anypdftools.com/
> http://finereader.abbyy.com/full_feature_list/ocr_accuracy/
> http://www.123fileconvert.com/microsoft_fileconverter_word.asp
> http://www.pdfconverter.com/downloads/
> http://www.somepdf.com/downloads.html
> http://www.tenorshare.com/pdf-converter.html
> http://www.docudesk.com/
>
> And also MS Word Marketplace listings:
>
> http://office.microsoft.com/en-us/downloads/downloads-marketplace-categories-FX102300516.aspx?Category=CL102416887&CTT=5&origin=HA001168333#last=;Index=0
>
> And this Portable App:
> http://www.pendriveapps.com/free-pdf-to-word-converter/
>
> Thanks,
> Carol
> --
> Carol Bean
> beanwo...@gmail.com
>



-- 
Cheryl Kohen | Emerging Technology Librarian | Daytona State College


[CODE4LIB] PDF to Word converter?

2011-04-01 Thread Carol Bean
I have WordPerfect X4, which does a questionable job of (natively)
converting pdf's, and from which one could then copy and paste the
questionable results into a Word document.

Has anyone been down this road before (trying to find something better than
what WordPerfect does, and that works directly with Word)?  Or maybe someone
can offer words of advice (or condolences...)?

I am aware of these, but haven't tested any of them (recommendations or pans
welcome):
http://www.investintech.com/
http://www.hellopdf.com/
http://www.anypdftools.com/
http://finereader.abbyy.com/full_feature_list/ocr_accuracy/
http://www.123fileconvert.com/microsoft_fileconverter_word.asp
http://www.pdfconverter.com/downloads/
http://www.somepdf.com/downloads.html
http://www.tenorshare.com/pdf-converter.html
http://www.docudesk.com/

And also MS Word Marketplace listings:
http://office.microsoft.com/en-us/downloads/downloads-marketplace-categories-FX102300516.aspx?Category=CL102416887&CTT=5&origin=HA001168333#last=;Index=0

And this Portable App:
http://www.pendriveapps.com/free-pdf-to-word-converter/

Thanks,
Carol
-- 
Carol Bean
beanwo...@gmail.com


Re: [CODE4LIB] MARC magic for file

2011-04-01 Thread Owen Stephens
"I'm sure any decent MARC tool can deal with them, since decent MARC tools
are certainly going to be forgiving enough to deal with four characters that
apparently don't even really matter."

You say that, but I'm pretty sure Marc4J throws errors MARC records where
these characters are incorrect

Owen

On Fri, Apr 1, 2011 at 3:51 AM, William Denton  wrote:

> On 28 March 2011, Ford, Kevin wrote:
>
>  I couldn't get Simon's MARC 21 Magic file to work.  Among other issues, I
>> received "line too long" errors.  But, since I've been curious about this
>> for sometime, I figured I'd take a whack at it myself.  Try this:
>>
>
> This is very nice!  Thanks.  I tried it on a bunch of MARC files I have,
> and it recognized almost all of them.  A few it didn't, so I had a closer
> look, and they're invalid.
>
> For example, the Internet Archive's Binghamton catalogue dump:
>
> http://ia600307.us.archive.org/6/items/marc_binghamton_univ/
>
> $ file -m marc.magic bgm*mrc
> bgm_openlib_final_0-5.mrc: data
> bgm_openlib_final_10-15.mrc:   MARC Bibliographic
> bgm_openlib_final_15-18.mrc:   data
> bgm_openlib_final_5-10.mrc:MARC Bibliographic
>
> But why?  Aha:
>
> $ head -c 25 bgm_openlib_final_*mrc
> ==> bgm_openlib_final_0-5.mrc <==
> 01812cas  2200457   45x00
> ==> bgm_openlib_final_10-15.mrc <==
> 01008nam  2200289ua 45000
> ==> bgm_openlib_final_15-18.mrc <==
> 01614cam00385   45  0
> ==> bgm_openlib_final_5-10.mrc <==
> 00887nam  2200265v  45000
>
> As you say, the leader should end with 4500 (as defined at
> http://www.loc.gov/marc/authority/adleader.html) but two of those files
> don't.  So they're not valid MARC.  I'm sure any decent MARC tool can deal
> with them, since decent MARC tools are certainly going to be forgiving
> enough to deal with four characters that apparently don't even really
> matter.
>
> So on the one hand they're usable MARC but file wouldn't say so, and on the
> other that's a good indication that the files have failed a basic validity
> test.  I wonder if there are similar situations for JPEGs or MP3s.
>
> I think you should definitely submit this for inclusion in the magic file.
> It would be very useful for us all!
>
> Bill
>
> P.S. I'd never used head -c (to show a fixed number of bytes) before.
> Always nice to find a new useful option to an old command.
>
>
>  #
>> # MARC 21 Magic  (Second cut)
>>
>> # Set at position 0
>> 0   short   >0x
>>
>> # leader ends with 4500
>>
>>> 20  string  4500
>>>
>>
>> # leader starts with 5 digits, followed by codes specific to MARC format
>>
>>> 0   regex/1 (^[0-9]{5})[acdnp][^bhlnqsu-z]  MARC Bibliographic
 0   regex/1 (^[0-9]{5})[acdnosx][z] MARC Authority
 0   regex/1 (^[0-9]{5})[cdn][uvxy]  MARC Holdings
 0   regex/1 (^[0-9]{5})[acdn][w]MARC Classification
 0   regex/1 (^[0-9]{5})[cdn][q] MARC Community

>>>
>
> --
> William Denton, Toronto : miskatonic.org www.frbr.org openfrbr.org
>



-- 
Owen Stephens
Owen Stephens Consulting
Web: http://www.ostephens.com
Email: o...@ostephens.com