Hey Leigh,
I had this problem some months ago. At this time there where no
mechanism to "exclude" invalid triples. So i changed the Jena-Api and
included my own parsing proccess. But it was quite complicated and it
just works for my data. I think there are two not so complicated ways.
*first:*
- you could look for a ntriple-parser and do the checking before you
dump in tdb. I used the api of sesame some time ago. and it was quite good.
*second:*
- you could load triple per triple into tdb. and if it runs fail on one
triple you just load the next one. But if you do this you just can use
tdbloader and not tdbloader2, which slows down the loading proccess imense
Regards
Stefan
Am 21.08.2012 17:55, schrieb Leigh Dodds:
Hi,
I'm doing some testing of TDB for a client. They have data in an older
RDB database which accepted triples that TDB now rejects.
Is there a way I can run a data dump through riot to clean it (i.e.
leaving only acceptable triples) or getting TDB to reject triples but
continue to load the rest?
Apologies if this is an FAQ. I know others have hit this issue before,
but couldn't find a good solution.
Cheers,
L.