[Virtuoso-users] RDFa Support In Virtuoso Crawler

2015-10-01 Thread Haag, Jason
I'm curious if Virtuoso only works with XHTML doctype declarations?
I'm trying to use the crawler to automatically import this page:
http://xapi.vocab.pub/datasets/adl/verbs but it uses HTML5. Can anyone
confirm whether or not RDFa in HTML5 is supported in version 7.2.1?

Thank you.

--
___
Virtuoso-users mailing list
Virtuoso-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/virtuoso-users


[Virtuoso-users] virtuoso crash during nightly reload

2015-10-01 Thread Davis, Daniel (NIH/NLM) [C]
Virtuoso's load via TTLP of a N-triples file failed for me last night.   I am 
running 7.2.1 + the memory leak patch I posted about earlier.
The back-trace doesn't have that many symbols in it (optimized, stripped).

So, I'm skipping a lot:

...
11:20:25 /usr/nlm/virtuoso/bin/virtuoso-t(table_source_input+0x677) [0x61ae47]
...
11:20:25 /usr/nlm/virtuoso/bin/virtuoso-t(sf_sql_execute_w+0x8f) [0x621a2f]
...
11:20:25 /lib64/libc.so.6(clone+0x6d) [0x7f99eb2969ad]
11:20:25 GPF: cetop.c:1388 not supposed to hit lt rl ce if looking for end of 
range
...

If it crashes in clone(), does that look familiar to anyone?

Dan Davis, Systems/Applications Architect (Contractor),
Office of Computer and Communications Systems,
National Library of Medicine, NIH

--
___
Virtuoso-users mailing list
Virtuoso-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/virtuoso-users


Re: [Virtuoso-users] Virtuoso-users Digest, Vol 108, Issue 1

2015-10-01 Thread Jason Haag
Does anyone happen to have a sample RDFa/HTML file that works in
crawler for vituoso? I have tried all of the options (including both
sitemap and semantic sitemaps). I'm curious if my RDFa/HTML import
(URI is here: http://purl.org/xapi/adl/verbs) might not be working
with virtuoso for some reason. It shows it being fetched in the import
que and retrieved sites in the conductor interface, but there is no
data in the database.

I am able to import my data when I use a RDF/XML file (.rdf) as the
graph URI, but I would really like to demonstrate that it can process
RDFa/HTML as an option. Thanks in advance.

On Thu, Oct 1, 2015 at 3:29 PM, Jason Haag  wrote:
> Hi Kingsley,
>
> Thank you for the reply. I entered a Graph URI for the import into the
> Target URL field. Is this the correct place to enter it?
>
> I also selected the "Do not store crawled documents" radio button
> option as I don't care if the data is stored locally in WEBDAV.
>
> However, when I chose this option I receive the warning message "Host,
> Target collection and Start path must be specified."
>
> So it seems that providing a webdav path is required no matter what?
>
> I also have installed the RDF sponger cartridges and have those
> checked as well as "Accept RDF."
>
> Alternatively, I tried entering my URI into the field where it states:
> "If Graph IRI is unassigned use this Data Source URL:"
>
> When I try using this alternative data source URL field I receive the
> warning message, "Incorrect target URL."
>
> Appreciate any tips or advice you have for using the crawler interface
> with RDFa. Will also take a look at the linked data folder setup you
> suggested.
>
> Kind Regards,
>
> J Haag
>
>
> Date: Wed, 30 Sep 2015 20:25:50 -0400
> From: Kingsley Idehen 
> Subject: Re: [Virtuoso-users] Automating RDF data imports in VIrtuoso
> To: virtuoso-users@lists.sourceforge.net
> Message-ID: <560c7d8e.7020...@openlinksw.com>
> Content-Type: text/plain; charset="windows-1252"
>
> On 9/30/15 6:10 PM, Haag, Jason wrote:
>> Thanks Daniel. I'm running Linux/Debian + Ubuntu. I am able to import
>> the HTML/RDFa file using the crawler. I'm able to find it in WEBDAV.
>> It just doesn't populate the database. If I import the same data in
>> RDF/XML format it does populate the database.
>
> Did you set a Named Graph IRI in the import/crawl job? That's how you
> get data into the quad store. Storing the ingest data to WebDAV is
> totally optional. Also note that you can invoke the sponger too, as part
> of this crawling functionality.
>
> You can also make Linked Data Folder Types  that are mapped to named
> graph iris as part of the folder config. Once in place you can make said
> folder the target of RDF content that you want to import into the quad
> store.
>
> [1] https://www.pinterest.com/kidehen/virtuoso-universal-server-related/
> -- I've added 4 screenshots showcasing Linked Data Folder Type setup
>
>
> Kingsley

--
___
Virtuoso-users mailing list
Virtuoso-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/virtuoso-users


Re: [Virtuoso-users] Virtuoso-users Digest, Vol 108, Issue 1

2015-10-01 Thread Kingsley Idehen
On 10/1/15 6:52 PM, Jason Haag wrote:
> Does anyone happen to have a sample RDFa/HTML file that works in
> crawler for vituoso? I have tried all of the options (including both
> sitemap and semantic sitemaps). I'm curious if my RDFa/HTML import
> (URI is here: http://purl.org/xapi/adl/verbs) might not be working
> with virtuoso for some reason. It shows it being fetched in the import
> que and retrieved sites in the conductor interface, but there is no
> data in the database.

Here's an example using our URIBurner instance (a public instance of
Virtuoso with the Sponger and various Cartridges enabled):

[1]
http://linkeddata.uriburner.com/about/html/http/xapi.vocab.pub/datasets/adl/verbs/index.html
-- Using /about service
[2] http://linkeddata.uriburner.com/c/9DB7EM2T - Using /describe service .
>
> I am able to import my data when I use a RDF/XML file (.rdf) as the
> graph URI, but I would really like to demonstrate that it can process
> RDFa/HTML as an option. Thanks in advance.

You should be able to use the /about service, as I have to achieve
similar results. Once that works you simply enable the crawler to use
the sponger. If you have any issues, share a screenshot of your crawler
job configuration.


Ensure you hatch the following checkboxes:
[1] Run Sponger
[2] Accept RDF -- this adds transparent content negotiation algorithm
into the HTTP request
[3] Store Metadata -- instruction to persist the sponger transformation
to the designated named graph.


Kingsley
>
> On Thu, Oct 1, 2015 at 3:29 PM, Jason Haag  wrote:
>> Hi Kingsley,
>>
>> Thank you for the reply. I entered a Graph URI for the import into the
>> Target URL field. Is this the correct place to enter it?
>>
>> I also selected the "Do not store crawled documents" radio button
>> option as I don't care if the data is stored locally in WEBDAV.
>>
>> However, when I chose this option I receive the warning message "Host,
>> Target collection and Start path must be specified."
>>
>> So it seems that providing a webdav path is required no matter what?
>>
>> I also have installed the RDF sponger cartridges and have those
>> checked as well as "Accept RDF."
>>
>> Alternatively, I tried entering my URI into the field where it states:
>> "If Graph IRI is unassigned use this Data Source URL:"
>>
>> When I try using this alternative data source URL field I receive the
>> warning message, "Incorrect target URL."
>>
>> Appreciate any tips or advice you have for using the crawler interface
>> with RDFa. Will also take a look at the linked data folder setup you
>> suggested.
>>
>> Kind Regards,
>>
>> J Haag
>>
>>
>> Date: Wed, 30 Sep 2015 20:25:50 -0400
>> From: Kingsley Idehen 
>> Subject: Re: [Virtuoso-users] Automating RDF data imports in VIrtuoso
>> To: virtuoso-users@lists.sourceforge.net
>> Message-ID: <560c7d8e.7020...@openlinksw.com>
>> Content-Type: text/plain; charset="windows-1252"
>>
>> On 9/30/15 6:10 PM, Haag, Jason wrote:
>>> Thanks Daniel. I'm running Linux/Debian + Ubuntu. I am able to import
>>> the HTML/RDFa file using the crawler. I'm able to find it in WEBDAV.
>>> It just doesn't populate the database. If I import the same data in
>>> RDF/XML format it does populate the database.
>> Did you set a Named Graph IRI in the import/crawl job? That's how you
>> get data into the quad store. Storing the ingest data to WebDAV is
>> totally optional. Also note that you can invoke the sponger too, as part
>> of this crawling functionality.
>>
>> You can also make Linked Data Folder Types  that are mapped to named
>> graph iris as part of the folder config. Once in place you can make said
>> folder the target of RDF content that you want to import into the quad
>> store.
>>
>> [1] https://www.pinterest.com/kidehen/virtuoso-universal-server-related/
>> -- I've added 4 screenshots showcasing Linked Data Folder Type setup
>>
>>
>> Kingsley
> --
> ___
> Virtuoso-users mailing list
> Virtuoso-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/virtuoso-users
>


-- 
Regards,

Kingsley Idehen   
Founder & CEO 
OpenLink Software 
Company Web: http://www.openlinksw.com
Personal Weblog 1: http://kidehen.blogspot.com
Personal Weblog 2: http://www.openlinksw.com/blog/~kidehen
Twitter Profile: https://twitter.com/kidehen
Google+ Profile: https://plus.google.com/+KingsleyIdehen/about
LinkedIn Profile: http://www.linkedin.com/in/kidehen
Personal WebID: http://kingsley.idehen.net/dataspace/person/kidehen#this




smime.p7s
Description: S/MIME Cryptographic Signature
--
___
Virtuoso-users mailing list
Virtuoso-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/virtuoso-users


Re: [Virtuoso-users] Virtuoso-users Digest, Vol 108, Issue 1

2015-10-01 Thread Kingsley Idehen
On 10/1/15 4:29 PM, Jason Haag wrote:
> Hi Kingsley,
>
> Thank you for the reply. I entered a Graph URI for the import into the
> Target URL field. Is this the correct place to enter it?
>
> I also selected the "Do not store crawled documents" radio button
> option as I don't care if the data is stored locally in WEBDAV.
>
> However, when I chose this option I receive the warning message "Host,
> Target collection and Start path must be specified."

Yes, that's a red herring i.e., you enter a path, but further down you
have an option not to store locally.

>
> So it seems that providing a webdav path is required no matter what?

Yes, as per comment above.

>
> I also have installed the RDF sponger cartridges and have those
> checked as well as "Accept RDF."
>
> Alternatively, I tried entering my URI into the field where it states:
> "If Graph IRI is unassigned use this Data Source URL:"
>
> When I try using this alternative data source URL field I receive the
> warning message, "Incorrect target URL."

try something like: urn:data:test , for now.

Kingsley
>
> Appreciate any tips or advice you have for using the crawler interface
> with RDFa. Will also take a look at the linked data folder setup you
> suggested.
>
> Kind Regards,
>
> J Haag
>
>
> Date: Wed, 30 Sep 2015 20:25:50 -0400
> From: Kingsley Idehen 
> Subject: Re: [Virtuoso-users] Automating RDF data imports in VIrtuoso
> To: virtuoso-users@lists.sourceforge.net
> Message-ID: <560c7d8e.7020...@openlinksw.com>
> Content-Type: text/plain; charset="windows-1252"
>
> On 9/30/15 6:10 PM, Haag, Jason wrote:
>> Thanks Daniel. I'm running Linux/Debian + Ubuntu. I am able to import
>> the HTML/RDFa file using the crawler. I'm able to find it in WEBDAV.
>> It just doesn't populate the database. If I import the same data in
>> RDF/XML format it does populate the database.
> Did you set a Named Graph IRI in the import/crawl job? That's how you
> get data into the quad store. Storing the ingest data to WebDAV is
> totally optional. Also note that you can invoke the sponger too, as part
> of this crawling functionality.
>
> You can also make Linked Data Folder Types  that are mapped to named
> graph iris as part of the folder config. Once in place you can make said
> folder the target of RDF content that you want to import into the quad
> store.
>
> [1] https://www.pinterest.com/kidehen/virtuoso-universal-server-related/
> -- I've added 4 screenshots showcasing Linked Data Folder Type setup
>
>
> Kingsley
>
> --
> ___
> Virtuoso-users mailing list
> Virtuoso-users@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/virtuoso-users
>


-- 
Regards,

Kingsley Idehen   
Founder & CEO 
OpenLink Software 
Company Web: http://www.openlinksw.com
Personal Weblog 1: http://kidehen.blogspot.com
Personal Weblog 2: http://www.openlinksw.com/blog/~kidehen
Twitter Profile: https://twitter.com/kidehen
Google+ Profile: https://plus.google.com/+KingsleyIdehen/about
LinkedIn Profile: http://www.linkedin.com/in/kidehen
Personal WebID: http://kingsley.idehen.net/dataspace/person/kidehen#this




smime.p7s
Description: S/MIME Cryptographic Signature
--
___
Virtuoso-users mailing list
Virtuoso-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/virtuoso-users