Hi, Helix84,

So, if I've got this right, importing data into DSpace, the first two items should look like this in the CSV file::

   id,collection,
   "+","123456789",

so that DSpace will auto-increment and the "handle" will be correct?

The AWK program works just fine to make the original data look right (double quotes around strings and comma field separators).

A side note to Stuart Yeates, the AWK I use is what you might want to call (or maybe not) "real" AWK built from source (not gawk which I find too fiddled-with for my taste) from Brian Kernighan's web page (he's the K in AWK). Unix systems distinguish original AWK as "oawk" and new AWK as "nawk," the version that redefined the language with the publication of "The AWK Programming Language" ISBN 0-201-07981-X in 1981. As it happens, nawk handles 8-bit characters just fine on my Slackware 64-bit Linux systems. I got used to nawk doing development on Solaris some years ago and, well, I just stick with the "standard" and don't have any problems going between Solaris and Slackware that way. Just in case you might be interested, Brian Kernighan's Home Page <http://www.cs.princeton.edu/%7Ebwk/> at Princeton; it's free download and builds properly on any Linux system with standard development libraries installed.

A riddle, wrapped in a mystery, inside an enigma but that's my story and I'm 
stickin' to it.

On 08/20/2013 04:51 PM, helix84 wrote:
This is just the value of the item_id column of the item table and is autoincremented using the item_seq sequence. Use "+" to insert new items.
     the collection is 123456789/7 (I was fiddling around trying to get
     it work, probably 6 times); is that number going to be something
     anything will care about later? Should I make it something else for
     an initial load? Can it be reset to 123456789/1 somehow (or should I
     just use some thing else)?
123456789 is the handle prefix, defined by "handle.prefix" in
dspace.cfg. You shouldn't change it unless you want to actually
register with Handle.net.

Handle.net is meant to be a globally unique identifier. 123456789 is
just the default value until you decide to set up the global part.

7 is the handle postfix and is maintained locally. What DSpace does is
treat it sequentially, so even if you delete one, it never gets
assigned again.

Check out the handle table and the handle_seq sequence. If you want to
start over, a handy shortcut is
dspace/etc/postgres/update-sequences.sql

------------------------------------------------------------------------------
Introducing Performance Central, a new site from SourceForge and 
AppDynamics. Performance Central is your source for news, insights, 
analysis and resources for efficient Application Performance Management. 
Visit us today!
http://pubads.g.doubleclick.net/gampad/clk?id=48897511&iu=/4140/ostg.clktrk
_______________________________________________
Dspace-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dspace-general

Reply via email to