Aereo[1] makes clear that viewing streamed material is a performance .
«In 1976 Congress amended the Copyright Act in large part to reject the
Court's holdings in *Fortnightly* and *Teleprompter.* See H.R.Rep. No.
94-1476, pp. 86-87 (1976) (hereinafter H.R. Rep.) (The 1976 amendments
completely
There are published papers on MD5 collisions, with associated examples.
Researchers at http://isi.jhu.edu are quite likely to have read and
downloaded them.
E. G.
http://www.forensicfocus.com/Content/pid=87/page=2/
On Oct 3, 2014 3:05 PM, Alexander Duryee alexanderdur...@gmail.com
wrote:
Simon
Intel skylake processors have dedicated sha instructions.
See: https://software.intel.com/en-us/articles/intel-sha-extensions
Using a tree hash approach (which is inherently embarrassingly parallel)
will leave io time dominant. This approach is used by Amazon glacier - see
SHA-256 is authorized up to SECRET.
SHA-384+ is required for TOP SECRET.
Algorithms approved for more stringent requirements such as FOUO-SCI (SES
Covering-up Incompetence) have not been revealed, though CARPA has funded
research into plausible repudiability.
.
We use SHA 512 for most work. We don't do finance or national security, so
it is a good fit for us.
Cary
On Oct 2, 2014, at 12:30 PM, Simon Spero sesunc...@gmail.com wrote:
Intel skylake processors have dedicated sha instructions.
See: https://software.intel.com/en-us/articles/intel-sha
Was this rushed due to the ALA meeting?
https://www.youtube.com/watch?v=DLNI0AND4rU
THIS IS SLIGHTLY DIFFERENT THAN WHAT WE DISCUSSED
On Sun, Jun 29, 2014 at 6:03 AM, Rebecca Dillmeier rdillme...@gmail.com
wrote:
On Jun 28, 2014 11:00 PM, CODE4LIB automatic digest system
lists...@listserv.nd.edu wrote:
There are 8 messages totaling 378 lines in this issue.
Topics of
Two strings denote the same public identity if both names are inten*t*ionally
a linguistic or orthographic variant of each other and the public identity
identifiers the same party.
Intentionality is important because of cases like Ian Banks and Ian M
Banks, which are different public identities
On Mon, Jun 9, 2014 at 10:30 AM, Eric Hellman e...@hellman.net wrote:
I assume you've discovered the word is usually spelled administrivia.
It's fun making up words. My latest is bibliopotheosis
It's also handy when the OED adds them too - less need to update local
spell check files on new
Who died and made you boss?
Not this thread.
On Wed, Jun 4, 2014 at 5:08 PM, Adam Wead amsterda...@gmail.com wrote:
Do it in the next 20 minutes and we’ll throw in 10 karma points from Zoia,
ABSOLUTELY FREE
…adam
On Jun 4, 2014, at 17:05, Michael J. Giarlo leftw...@alumni.rutgers.edu
Log analysis can sometimes tell you more than a survey; however it requires
clickstream analysis, and limited cognitive modeling.
User studies with eye tracking can be very revealing ( especially if you
trust a model like EZ-reader to proxy for cognitive load ).
EEG can also give very useful
This book might be useful (it's a year old)
Anonymizing Health Data http://shop.oreilly.com/product/0636920029229.do
Case Studies and Methods to Get You Started
By Khaled El Emam, Luk Arbuckle
http://shop.oreilly.com/product/0636920029229.do#tab_03_0
Publisher: O'Reilly Media
Released: December
I thought IT support for NCSU scavenger hunts involved electronic crow
lures
On Wed, May 28, 2014 at 8:48 AM, Scherbak, Loren scherb...@si.edu wrote:
I much prefer the new format, but I see I am in the minority. I get the
digest and cannot filter on Job.
user-agent: Microsoft-MacOutlook/14.4.1.140326
==
You ought to be able to create a rule (filter) using these
, Simon Spero sesunc...@gmail.com
wrote:
On Wed, May 28, 2014 at 8:48 AM, Scherbak, Loren scherb...@si.edu
wrote:
I much prefer the new format, but I see I am in the minority. I get
the
digest and cannot filter on Job.
user-agent: Microsoft-MacOutlook/14.4.1.140326
/**about.me
http://www.about.me/matthewmckinley*
On Wed, May 28, 2014 at 8:34 AM, Simon Spero
sesunc...@gmail.com
wrote:
On Wed, May 28, 2014 at 8:48 AM, Scherbak, Loren
scherb...@si.edu
wrote:
I much prefer the new format, but I see I am
s4T2573P017562
V/R,
Simon Spero
(I think...At least an Identity Crisis is better than a Civil War)
On May 16, 2014 3:46 PM, Andreas Orphanides akorp...@ncsu.edu wrote:
THIS IS SLIGHTLY DIFFERENT THAN WHAT WE DISCUSSED.
This has meme potential.
(Over a screen grab of the bibframe homepage?)
(royt photoshopped into a picture of people climbing Ben Nevis?)
Is the Selenium php driver any use?
https://github.com/facebook/php-webdriver
On May 16, 2014 4:54 PM, Karen Coombs librarywebc...@gmail.com wrote:
I'm trying to mock HTTP requests for a code library I'm working on by using
the PHP-VCR library.
https://github.com/php-vcr/php-vcr
I'm having
FTW!
[It's all fun and games till someone ends up in a cone]
https://drupal.org/project/bad_judgement
On Thu, May 8, 2014 at 12:07 PM, Joe Hourcle
onei...@grace.nascom.nasa.govwrote:
On May 8, 2014, at 11:35 AM, Ben Brumfield wrote:As this is an an actual
LISTSERV(tm) mailing list, it's possible for the list owner to define
'topics', and then for people to set up their subscription to exclude
educational activities such as the conference, mailing list,
website,
IRC
channel, etc.). So a 501(c)(3) seems like a better fit to me.
-Esme
On 04/13/2014, at 12:57 PM, Simon Spero sesunc...@gmail.com
wrote:
[Note that 501 (c)(6) only applies to membership
Whilst it might be possible that code4lib might qualify under IRC
501(c)(3) it is also possible that code4lib might be a professional
organization under IRC 501(c)(6) .
6. An organization formed to stimulate the development and
free interchange of information pertaining to systems and
[Note that 501 (c)(6) only applies to membership organizations]
On Sun, Apr 13, 2014 at 12:55 PM, Simon Spero sesunc...@gmail.com wrote:
Whilst it might be possible that code4lib might qualify under IRC
501(c)(3) it is also possible that code4lib might be a professional
organization under
the the advancement of education category (i.e. to support
educational activities such as the conference, mailing list, website, IRC
channel, etc.). So a 501(c)(3) seems like a better fit to me.
-Esme
On 04/13/2014, at 12:57 PM, Simon Spero sesunc...@gmail.com wrote:
[Note that 501 (c)(6) only applies
If DEATH were holding the severed head of MARC I would get this as a prison
tattoo.
On Wed, Apr 9, 2014 at 5:20 PM, Jay Gattuso jay.gatt...@dia.govt.nz wrote:
Luckily we made the graphic in such a way we can easily change the
text Any of the text.
The maker is on leave exploring Europe,
Bob Jones University library?
On Fri, Mar 28, 2014 at 10:30 PM, Andreas Orphanides akorp...@ncsu.eduwrote:
Be sure to specify WHICH Greenville. Greenville NC isn't exactly central,
but people might not be paying attention.
On Fri, Mar 28, 2014 at 10:15 PM, Kevin S. Clarke
On Mon, Mar 31, 2014 at 12:45 PM, Kevin S. Clarke kscla...@gmail.comwrote:
rsinger++
Runs until next Monday at ~11:45PM PDT
Though I am a bit curious why an East coast meeting gets a PDT deadline...
:-)
Says the man who leaves the N out of UNC
I would structure the book by task, showing how different languages would
implement the same task.
For example,
using a marc parsing library in java, groovy, python, ruby, perl,
c/c++/objective c, Haskell.
Implementing same.
Using a rest API
Implementing a rest API
Doing statistical analysis
Alfresco uses apache tika to extract exif metadata from images. The tika
plugin to support is on github at https://github.com/Alfresco/tika-exiftool
.
oh.
On Dec 17, 2013 4:55 PM, Edward Summers e...@pobox.com wrote:
I remember hearing somewhere that ExifTool is pretty good for extracting
I've did some mining of DDC/LCSH correlations from the scriblio.net LC
files (Dec 2006 - ~ 7m records) as an experiment a long time ago; I should
see if I can find the right repo.
I was looking at 1st 6XX - 082 associations as evidence for or against
LCSH syndetic structure. I didn't do
Atlassian Crowd (if you have jira or confluence you may already have this
licensed).
SocialAuth (supports, but relies on oauth sources).
Apache Shiro (requires external account provider- see demo).
Shibboleth?
On Dec 1, 2013 4:09 PM, LeVan,Ralph le...@oclc.org wrote:
OCLC Research is
On Dec 1, 2013 6:42 PM, Joe Hourcle onei...@grace.nascom.nasa.gov wrote:
So that you don't screw up web proxies, you have to specify the 'Vary'
header to tell which parameters you consider significant so that it knows
what is or isn't cacheable.
I believe that if a Vary isn't specified, and the
Seth (and commenters) -
The basic point is sound, but there are some important issues that are
averted or are elided in the original article in order to make the
underlying point more clearly.
1: It should be quite clear that there is no need to develop an API for
the sole purpose of
[If a library building is open for all but a few hours, but contains
servers, it is bad practice for cleaning staff to shutoff building power
when finishing their shifts. Not that this has ever happened, especially
not at UNC]
There are some generalizations in the data shown on your blog post
http?X and https?X are different URIs. You may fetch a document containing
a serialized graph using TLS but that is quite separate from the URIs that
may be used as identifiers.
In fact, the denotation of a IRI used to name a graph is explicitly
unspecified (I am having to craft a semantics for
Anyone thought about doing a code4lib in Asheville?
What about Raleigh?
:-P
On Nov 12, 2013 8:42 PM, Kevin S. Clarke kscla...@gmail.com wrote:
I'd be interested. I'm in Boone... not too far a drive. :)
Kevin
On Nov 12, 2013 6:35 PM, Riley Childs ri...@tfsgeo.com wrote:
Is anyone in
I brought marc4j under ivy; I haven't bothered with the maven artifact
generation tasks, but that would be the way to go.
On Nov 13, 2013 6:16 PM, Kevin S. Clarke kscla...@gmail.com wrote:
On Wed, Nov 13, 2013 at 6:04 PM, Robert Haschart rh...@virginia.edu
wrote:
If you mean re-organize the
On Tue, Jul 30, 2013 at 11:07 AM, Marc Chantreux m...@unistra.fr wrote:
On Tue, Jul 30, 2013 at 10:27:01AM -0400, Mark A. Matienzo wrote:
i don't know why we're not talking about Haskell
I did to tell there is a lack of libraries and it is not as convenient
as perl when it comes to use
On Sun, Apr 28, 2013 at 2:43 AM, Kyle Banerjee kyle.baner...@gmail.comwrote:
Every choice people make is about loss. Equipment, optics, lighting,
you name it. But for some reason, the instant we're talking about bits of
data
on a disk, people plan as though capacity were unlimited when most
I cast Arrow's Impossibility Theorem on the ogre closest to the paladin...
For a forced-choice selection between two options, First Past The Post is
equivalent to instant run-off with no RON available.
A system that satisfies many useful properties, (one of which is there's
code out there
[Since you're getting good performance using a relational database, these
may not be necessary for you, but since I've been looking at some of the
tricks I've used in my own code to see how they can be fitted into the
revived marc4j project, I thought I'd write them down]
If the Tennant/Dean
There are several known algorithms for Secret Sharing - see
http://en.wikipedia.org/wiki/Secret_sharing
Simon
On Tue, Mar 5, 2013 at 1:35 PM, Joe Hourcle
onei...@grace.nascom.nasa.govwrote:
On Mar 5, 2013, at 8:29 AM, Adam Constabaris wrote:
An option is to use a password management program
On Thu, Feb 14, 2013 at 12:59 PM, Andreas Orphanides akorp...@ncsu.eduwrote:
Nuh-uh, remember that whole Reformation thing?
You nailed *what* to the door?
XMP uses a subset of RDF/XML, with a few limitations thrown in to make
reification and provenance tracking impossible, but hey who needs metadata.
I'm not sure if XSLT is particularly well suited to anything, but it ought
to be possible to cruft something up. I would still recommend following
On a vaguely related note: I was talking to some people at the ABBYY booth
in the exhibit hall for KMWorld/Enterprise Search Summit/Taxonomy
Bootcamp, and asked whether the Linux version of their software was not
going to be updated. Apparently it will be skipping a major, and will be
syncing
An interesting reference is this:
High, W. M. (1990). Editing Changes to Monographic Cataloging Records in the
OCLC Database: An Analysis of the Practice in Five University Libraries. PhD
thesis, University of North Carolina at Chapel Hill.
It's in UMI (and Heavy Trussed).
Simon
On Aug
On Aug 28, 2012, at 2:17 PM, Joe Hourcle wrote:
I seem to recall seeing a presentation a couple of years ago from someone in
the intelligence community, where they'd keep all of their intelligence, but
they stored RDF quads so they could track the source.
They'd then assign a confidence
BLUF: don't embed the scheme, and don't bother with a ./ path component.
See section 5 of http://www.rfc-editor.org/rfc/rfc3986.txt for more details.
On Jul 19, 2012 1:53 PM, Esha Datta e...@nyu.edu wrote:
Hi,
I am working on a METS document and want to point to files using paths
relative
very much like to apply.
On Thu, Jul 19, 2012 at 5:54 PM, Simon Spero sesunc...@gmail.com wrote:
On Thu, Jul 19, 2012 at 6:35 PM, j...@code4lib.org wrote:
* May be required to assist with disaster recovery efforts.
PREFERRED EDUCATION, EXPERIENCE AND SKILLS
* Advanced degree
On Jul 9, 2012 1:27 PM, Joshua Gomez jngo...@gwu.edu wrote:
WE NEED A CAT LOVER WHO IS ALSO A FEDERAL EMPLOYEE TO DO THIS JOB!
Must have active TS/SCI clearance with FS Poly.
All applicants must complete the attached 20 page KSA.
On Mon, Jul 9, 2012 at 2:22 PM, Joe Hourcle
onei...@grace.nascom.nasa.govwrote:
So, the real question is why it must specifically be federal
employee librarians. (and I don't know of any librarians with TS/SCI/Poly
... but I *have* heard that some of the archivists at the National Archives
On Mon, Jun 18, 2012 at 7:01 PM, Stephen Westman srwest...@gmail.comwrote:
Is it just me. or is there a problem the jobs.code4lib.org Web site? I
try
going there using different browsers - and even different operating systems
- and all I see is gibberish - as if the character encoding is
-- Forwarded message --
From: Trish Whetzel plwhet...@gmail.com
Date: Mon, Jun 18, 2012 at 9:24 PM
Subject: [ontolog-forum] NCBO Webinar: Yolanda Gil, June 20 - Semantic
Workflows and Provenance-Aware Software
To: obo-disc...@lists.sourceforge.net,
://www.ctan.org/tex-archive/macros/latex/contrib/pdfx/
[2] https://github.com/bencomp/**pdfx-exthttps://github.com/bencomp/pdfx-ext
On 12/6/'12 12:15 AM, Simon Spero wrote:
You can include XMP packets in pdftex files using the \pdfcatalog
primitive ; there are a couple of macro packages
You can include XMP packets in pdftex files using the \pdfcatalog
primitive ; there are a couple of macro packages that add a little bit of a
wrapper:
xmpincl basically just wraps and inserts the contents of an external
RDF+XML file as the value of /Metadata .
Having done my time working for in both the research
and administrative side university computing, I would also have to ask if
the development is within the library's competence, or if it is something
that would normally be handled by one of the other groups.
If it's administrative computing,
I've messed around with UIMA, and it's a nice general architecture, but
don't even think about trying to use it without the eclipse workflow gui.
I have a slight preference for GATE (http://gate.ac.uk), but there is
literally no difference in functionality between the two, since UIMA has a
plugin
On Tue, May 29, 2012 at 4:22 PM, Ravi Shankar rshan...@stanford.edu wrote:
We were currently leaning towards open-source triple stores. As far as
inferencing goes, I suspect we will be doing at least transitive closures
on rdfs:subClassOf and rdfs:subPropertyOf properties. I will look into
On Tue, May 29, 2012 at 5:31 AM, Jakob Voss jakob.v...@gbv.de wrote:
What are your major complaints with NCIP?
2. NCIP is rarely implemented in total, so you never know what you get
2(a) Each implementation is usually of a different subset :-/
One existing protocol that might be
The latest version of Jena TDB adds atomic transactions (version 0.9.0+)
See http://jena.apache.org/documentation/tdb/tdb_transactions.html for
documentation:
The following limitations are listed:
- Bulk loads: the TDB bulk loader is not transactional
- Nested transactions are not
Is OCLC controlling sandwich meats now? Where will it end?
The correct answer to the original question is - go to court.
On May 23, 2012 1:27 PM, LeVan,Ralph le...@oclc.org wrote:
Looks like baloney to me. I wouldn't touch it.
Ralph
-Original Message-
From: Code for Libraries
The file format magic format magic changed between versions; I think the
OSX version was not compatible with more up to date versions (in the
original thread, this caused me some confusion).
Simon
On Wed, May 23, 2012 at 4:34 PM, Ross Singer rossfsin...@gmail.com wrote:
On May 23, 2012, at
Arash - you might not want to use a straight dump of worldcat catalog
records- at least not without the associated holdings information.*
There are a lot of quasi-duplicate records that are sufficiently broken
that the worldcat de-duplication algorithm refuses to merge them. These
records will
On Tue, Apr 17, 2012 at 7:55 PM, Jonathan Rochkind rochk...@jhu.edu wrote:
Okay, forget XML for a moment, let's just look at marc 'binary'.
First, for Anglophone-centric MARC21.
Actually Anglo and Francophone centric. And the USMARC style 245 was a poor
replacement for the UKMARC approach
If you are using a linux system, you can get the information by looking at
/proc/net/wireless, or by running iwlist scan.
If you're running mac os, there's a command line program for doing
wirelessy things -
/System/Library/PrivateFrameworks/Apple80211.framework/Resources/airport
You can scan for
On Mon, Apr 9, 2012 at 7:13 PM, Ethan Gruber ewg4x...@gmail.com wrote:
Ancient geographic entities. Athens is in Attica. Sardis is in Lydia (in
Anatolia, for example). If these were modern geopolitical entities, I
would use geonames. We're linking cities to Pleiades, but Pleiades does
not
Are you talking about geographical entities, or geopolitical ones? For
example, is there an answer to the question what country is
constantinople located in?
Simon
On Apr 8, 2012 8:02 PM, Ethan Gruber ewg4x...@gmail.com wrote:
CIDOC-CRM may be the answer here. I will look over the
On Wed, Mar 14, 2012 at 9:37 AM, Owen, Will o...@email.unc.edu wrote:
Endeca provides a public interface to library holdings: it is not
and could not be an ILS, performing functions like circulation, accounting
control, etc. In this respect it's more akin to Blacklight that to an ILS.
One
On Tue, Mar 13, 2012 at 12:58 PM, Ed Summers e...@pobox.com wrote:
On Tue, Mar 13, 2012 at 12:49 PM, Chad Benjamin Nelson
cnelso...@gsu.edu wrote:
I think it is just some examples of the weird and interesting data in
scraperwiki.
Yeah, I guess it would be kind of pointless spam eh? :-)
Indoctrination is probably the correct term; it's the background briefing
before being read in to the compartment.
Simon
On Feb 22, 2012 11:52 AM, Bohyun Kim k...@fiu.edu wrote:
In my defense, I didn't pick the term, 'indoctrination.' =) But it shows
something about the community, eh? The
On Thu, Feb 16, 2012 at 7:13 AM, suzanne.pilsk suzanne.pi...@gmail.comwrote:
(Pssst: Does it matter if you call it data and I call it metadata?)
It may matter only to the extent that we each use the terms appropriately
relative to our own use and context.
On a related note, I would make a
I have had several theoretical changes of opinion on this question, and
have come to the considered opinion that there is no principled *essential*
difference between Metadata and Data. It all depends on the
context/theory/background assumptions to which the data is being applied.
The property of
/nick edsu
On Feb 7, 2012 6:51 AM, Jay Luker lb...@reallywow.com wrote:
Hi all,
Just a reminder: the #code4lib IRC channel will be logged and the logs made
available for viewing at http://irc.code4lib.org at some point during or
just after the conference.
Keep it civil, keep it weird.
Anyone interested in getting together to watch the big game on weds?
Simon
On Feb 4, 2012, at 16:19, Cary Gordon listu...@chillco.com wrote:
Probably their cat… They need this: http://www.bitboost.com/pawsense/
2012/2/4 Erik Hatcher erikhatc...@mac.com
Looks like some MARC records I've seen.
Next up on Library Planet: My Cataloger from
cancellations at this time.
Elizabeth
Elizabeth Duell
Orbis Cascade Alliance
edu...@uoregon.edu
(541) 346-1883
On 1/30/2012 3:02 PM, Simon Spero wrote:
My partner isn't going to be able to make it to the conference next
week,
so she is offering her paid registration to a deserving cause
Barcode Scanner?
https://market.android.com/details?id=com.google.zxing.client.androidhl=en
Simon
On Tue, Jan 31, 2012 at 12:32 PM, David Mayo pobo...@gmail.com wrote:
As a sort of side question, does anyone know of a halfway-decent Android
app for scanning UPC-style barcodes? QR scanners
the barcodes over wifi. But I think
that might be asking a bit too much.
- Dave
On Tue, Jan 31, 2012 at 12:44 PM, Simon Spero s...@unc.edu wrote:
Barcode Scanner?
https://market.android.com/details?id=com.google.zxing.client.androidhl=en
Simon
On Tue, Jan 31, 2012 at 12:32 PM, David
My partner isn't going to be able to make it to the conference next week,
so she is offering her paid registration to a deserving cause (students by
preference).
Simon
On Thu, Jan 5, 2012 at 9:32 AM, Joshua Gomez jngo...@library.gwu.eduwrote:
will be a quick summary of FRBR, and a possible solution for aggregates
which opens up a whole 'nother can of worms, but allows for some complex
relationships to be described.
Any summary of the discussion? I'm
You can get anything you want
At Brewster Kahle's restaurant.
http://openlibrary.org/data#bulk_download
Simon
On Wed, Jan 11, 2012 at 10:55 AM, LeVan,Ralph le...@oclc.org wrote:
http://staff.oclc.org/~levan/PearsTraining/scifi.usmarc has 10,000 marc
records in it. They are part of the old
On Wed, Jan 11, 2012 at 2:36 PM, Ethan Gruber ewg4x...@gmail.com wrote:
Hi all,
Suppose I have RDF describing an object, and I would like some
fairly free-form human generating description about the object (let's say
within dcterms:description). Is it semantically acceptable to have XHTML
There seems to be a character encoding error; #\( is rendering as #\{ and
#\) is rendering as #\}.
((believes ed.) (that (shurely (that (exists ?X (mistake ?X)))
Simon
On Dec 22, 2011 10:53 PM, Ann Lally ala...@uw.edu wrote:
Sean Hannan from Johns Hopkins University is the winner of the
ABBYY Finereader can do this. http://www.abbyy.com
Also, typing the company name can teleport you to different parts of the
dungeon.
Simon
On Wed, Dec 14, 2011 at 1:08 PM, Matt Amory matt.am...@gmail.com wrote:
Just looking to preserve column structure.
--
Matt Amory
(917) 771-4157
On Sun, Dec 11, 2011 at 10:33 AM, Karen Coyle li...@kcoyle.net wrote:
Quoting Simon Spero s...@unc.edu:
From a logical point of view, a bibliographic record can seen as a theory
-that is to say a consistent set of statements. There may be
records describing the same thing, but the theories
On Sun, Dec 11, 2011 at 3:25 PM, Lars Aronsson l...@aronsson.se wrote:
On 12/11/2011 08:52 PM, Simon Spero wrote:
The point I was trying to make is not related to any kind of display- it
is about how the meanings of the statements derived from a record are only
The reality that library
On Thu, Dec 8, 2011 at 12:16 PM, Richard Wallis richard.wal...@talis.comwrote:
*A record is a silo within a silo*
* *
A record within a catalogue duplicates the
publisher/author/subject/etc.information stored in adjacent records
describing items by the same
author/publisher/etc. This
I think this calls for an unwritten rule engine.
On Dec 1, 2011 10:22 PM, Ross Singer rossfsin...@gmail.com wrote:
I think the point of the hubbub today is trying to articulate the rule that
should be written.
Nobody is being excluded: we make things up as they go along and anybody is
ABBYY sdk does have a linux option which can be driven from the command
line. I'm not sure if the linux version is having as active development as
the windows one, as the linux sdk is one major behind the windows one, but
the linux ocr engine for that version had some improvements over the
ABBYY's engine is pretty good; though depending on whether you've already
scanned the text you might end up with higher thruput by having the OCR
performed at each scanning station.
I'm not sure if the non-server software is multi-core/multi-processor
aware; the version that is used in the
you've already paid for it
once... :) also am a fan of adobe acrobat's ocr and optimizer.
ap
On 11/4/11 2:25 PM, Simon Spero s...@unc.edu wrote:
ABBYY's engine is pretty good; though depending on whether you've already
scanned the text you might end up with higher thruput by having
Issues related to provenance and access control are of great practical
importance to the intelligence and defense communities, as well as to the
financial sector.
There are multiple regions in the problem space that have been explored and
for which COTS/GOTS solutions are available.
One
Project Greenlight having been long since canceled, the position has been
replaced by a drama about crooked deals to control corrupted metadata
called Crosswalk Empire.
There is still a development position for fedora with the title True Bugs.
On Oct 31, 2011 9:50 PM, Michael J. Giarlo
On Oct 3, 2011 9:19 AM, Ed Summers e...@pobox.com wrote:
On Sun, Oct 2, 2011 at 10:32 PM, Ken Irwin kir...@wittenberg.edu wrote:
1. respect robots.txt
Disclaimer: I am not a lawyer.
Remember that robots.txt applies only to recursive web crawlers, and not to
screen-scraping per se. In cases
On Tue, Aug 30, 2011 at 12:56 PM, Ken Irwin kir...@wittenberg.edu wrote:
I have a feeling it may be time for me to learn some grown-up programming
skills, and I hope someone here might be able to help.
I'm basically building a big associative array encoding the name of the
borrowing
Sender: CODE4LIB@LISTSERV.ND.EDU
On-Behalf-Of: sesunc...@gmail.com
Subject: Re: [CODE4LIB] Advice on a class
Message-Id: cfd505f2-8dec-47e8-b0d3-b0bd0a759...@gmail.com
Recipient: jcost...@trinity.edu.test-google-a.com, Forwarded:
jane.costa...@trinity.edu
Recipient:
On Jul 27, 2011, at 10:32 PM, Bill Janssen wrote:
If I'm hiring a programmer, I want them to know C and Python. [...]
Various flavors of C are acceptable: Objective-C is OK with me, and
C++ is a plus -- it's an order of magnitude more difficult than C to
use properly, and people who can
On Tue, Jul 19, 2011 at 10:55 AM, Nordstrom, Kurt kurt.nordst...@unt.eduwrote:
Unfortunately, in today's economic climate, the prospect of being able to
ship your developer team across the country to attend week-long seminars or
conferences or what-have-you is not quite as realistic as it
1 - 100 of 131 matches
Mail list logo