One thing that it would be great to do is to detect the ontology ID
*before* creating the TripleCollection in Clerezza, so any mappings
could be done before storing.
But I don't know how this can be done with not so much code.
Perhaps creating an IndexedGraph, exploring its content, then creating
the Graph in the TcManager with the same content and the right graph
name, then finally clearing the IndexedGraph could work.
But it still means having twice the resource usage (disk+memory) for a
period.
Alessandro
On 3/16/12 10:56 AM, Alessandro Adamou wrote:
Hi David,
well, I guess that depends pretty much on how heavy the usage of
OntoNet is in your Stanbol installation.
Those are graphs created when OntoNet has to load an ontology from its
content rather than from a Web URI, so it cannot know the ontology ID
earlier.
This happens e.g. by POSTing the ontology as the payload or by passing
a GraphContentInputSource to the Java API.
Now I do not know why these graphs are created (perhaps the refactor
engine could be loading some), but I do know that a Clerezza graph in
Jena TDB occupies a LOT of disk space.
Suffice it to say that my bundled had stored nine graphs of <100
triples each. Their disk space was about 1.8 GB, but when I tried to
make a zipfile out of it, it came out as about 2MB!
Alessandro
On 3/16/12 10:30 AM, David Riccitelli wrote:
Dears,
As I ran into disk issues, I found that this folder:
sling/felix/bundleXXX/data/tdb-data/mgraph
where XX is the bundle of:
Clerezza - SCB Jena TDB Storage Provider
org.apache.clerezza.rdf.jena.tdb.storage
took almost 70 gbytes of disk space (then the disk space has been
exhausted).
These are some of the files I found inside:
193M ./ontonet%3A%3Ainputstream%3Aontology889
193M ./ontonet%3A%3Ainputstream%3Aontology1041
193M ./ontonet%3A%3Ainputstream%3Aontology395
193M ./ontonet%3A%3Ainputstream%3Aontology363
193M ./ontonet%3A%3Ainputstream%3Aontology661
193M ./ontonet%3A%3Ainputstream%3Aontology786
193M ./ontonet%3A%3Ainputstream%3Aontology608
193M ./ontonet%3A%3Ainputstream%3Aontology213
193M ./ontonet%3A%3Ainputstream%3Aontology188
193M ./ontonet%3A%3Ainputstream%3Aontology602
Any clues?
Thanks,
David Riccitelli
********************************************************************************
InsideOut10 s.r.l.
P.IVA: IT-11381771002
Fax: +39 0110708239
---
LinkedIn: http://it.linkedin.com/in/riccitelli
Twitter: ziodave
---
Layar Partner
Network<http://www.layar.com/publishing/developers/list/?page=1&country=&city=&keyword=insideout10&lpn=1>
********************************************************************************
--
M.Sc. Alessandro Adamou
Alma Mater Studiorum - Università di Bologna
Department of Computer Science
Mura Anteo Zamboni 7, 40127 Bologna - Italy
Semantic Technology Laboratory (STLab)
Institute for Cognitive Science and Technology (ISTC)
National Research Council (CNR)
Via Nomentana 56, 00161 Rome - Italy
"I will give you everything, so long as you do not demand anything."
(Ettore Petrolini, 1930)
Not sent from my iSnobTechDevice