Hi Alexander,
If you prefer to leave them uncompressed then fine, change the reference in the
ld_dir () procedure to "*.nt" in that case. The gz recommendation was more to
save space on the machine ...
BTW, their is rogue "i" character on the end of the graph name in the ld_dir
command below I just noticed, which should be:
"ld_dir ('/opt/lgdelement', *.gz', 'http://linkedgeodata.org#')"
Best Regards
Hugh Williams
Professional Services
OpenLink Software
Web: http://www.openlinksw.com
Support: http://support.openlinksw.com
Forums: http://boards.openlinksw.com/support
Twitter: http://twitter.com/OpenLink
On 16 Apr 2010, at 12:03, Alexander Sidorov wrote:
> Hi Hugh,
>
> Thank you for feedback. I am decompressing the dataset right now... But I
> didn't understand why I should compress the dataset again (using gzip) if I
> can load uncompressed .nt file.
>
> Regards,
> Alexander
>
> 2010/4/16 Hugh Williams <[email protected]>
> Hi Alexander,
>
> The Virtuoso Bulk loader scripts do not support the extraction of "bz2"
> compressed files, only gzip compressed or uncompressed files, so you will
> need to uncompress the LGD bz2 file(s) first. Also, as you have "*.*" in the
> ld_dir() command the loader is trying to load all file in the directory
> including the global.graph file as if it is a dataset, which it is not.
>
> Thus I would suggest you do the following:
>
> 1) Compress the LDG NT files you want to load with gzip
> 2) Remove all entries from the "DB.DBA.LOAD_LIST" table
> 3) Run "ld_dir ('/opt/lgdelement', '*.gz', 'http://linkedgeodata.org#i')"
> 4) Run "rdf_loader_run()"
>
> Let us know if this now works for you ...
>
> Best Regards
> Hugh Williams
> Professional Services
> OpenLink Software
> Web: http://www.openlinksw.com
> Support: http://support.openlinksw.com
> Forums: http://boards.openlinksw.com/support
> Twitter: http://twitter.com/OpenLink
>
> On 16 Apr 2010, at 08:11, Alexander Sidorov wrote:
>
>> Hello!
>>
>> I'm trying to load LGD Elements dataset to my EC2 instance but as result get
>> the following error at logs:
>>
>> 02:46:41 PL LOG: File /opt/lgdelements/global.graph error 37000 SP029:
>> TURTLE RDF loader, line 1: Undefined namespace prefix at
>> http://linkedgeodata.org#
>> 02:46:41 PL LOG: File /opt/lgdelements/lgd-elements.nt.bz2 error 37000
>> SP029: TURTLE RDF loader, line 8: syntax error
>>
>> Here is my ld_dir configuration:
>>
>> ld_dir ('/opt/lgdelements', '*.*', 'http://linkedgeodata.org#');
>>
>> Folder /opt/lgdelements contains file global.graph that contains
>> http://linkedgeodata.org string.
>>
>> Regards,
>> Alexander
>> ------------------------------------------------------------------------------
>> Download Intel® Parallel Studio Eval
>> Try the new software tools for yourself. Speed compiling, find bugs
>> proactively, and fine-tune applications for parallel performance.
>> See why Intel Parallel Studio got high marks during beta.
>> http://p.sf.net/sfu/intel-sw-dev_______________________________________________
>> Virtuoso-users mailing list
>> [email protected]
>> https://lists.sourceforge.net/lists/listinfo/virtuoso-users
>
>