...sorry, sent the INDEX information tab instead of "SPACE" info tab :

Table   Index name      Rows    Pages   Row pages       Blob pages      Size
DB.DBA.RDF_OBJ  DB.DBA.RDF_OBJ  30443064        255445  255445  0       1833.25 
MB
DB.DBA.RDF_IRI  DB.DBA.RDF_IRI  59063426        206412  206412  0       1247.19 
MB
DB.DBA.RDF_IRI DB_DBA_RDF_IRI_UNQC_RI_ID 59063426 183541 183541 0 1282.95 MB DB.DBA.RDF_OBJ_RO_FLAGS_WORDS DB.DBA.RDF_OBJ_RO_FLAGS_WORDS 8341023 179875 179875 0 1332.22 MB
DB.DBA.RDF_OBJ  RO_VAL  30443064        117863  117863  0       773.81 MB
DB.DBA.RO_START         DB.DBA.RO_START         30443064        85319   85319   
0       475.31 MB


Le 20/03/2017 à 11:34, Thomas Michaux a écrit :



Le 19/03/2017 à 16:15, Hugh Williams a écrit :
Hi Thomas,
Hi,
Is the loading of the dataset now complete or it is still in progress as you 
opening statement is not clear ?
You should not need 40GB RAM for inserting and hosting 240 million triples, 
which should require less then 10GB depending on how well they can be 
compressed for storage in the database.
loading is complete, we finished at 243 188 427 triples , hosting now requires 25GB ram, 15Gb disk, details :

void:triples 243188427 ;
 void:classes 13 ;
 void:entities 58523487 ;
 void:distinctSubjects 58523514 ;
 void:properties 32 ;
 void:distinctObjects 73171603 .

Total pages     1925120
Free pages      607377
Buffers         2720000
Buffers used    244554
Dirty buffers   3
Wired down buffers      0

Table   Index name      Touches         Reads   Read %
DB.DBA.RDF_QUAD         RDF_QUAD        1562356553      36371   0
DB.DBA.RDF_QUAD         RDF_QUAD_POGS   609423455       16989   0
DB.DBA.RDF_QUAD         RDF_QUAD_SP     378769255       35822   0
DB.DBA.RDF_QUAD         RDF_QUAD_GS     340377017       1634    0

I assume you have set the swappiness as suggested previously ?
yes, done, $ sysctl vm.swappiness
vm.swappiness = 10

When you recompiled your Virtuoso was this done from the git stable/7 or 
develop/7 branch , as I latter has a number of memory consumption fixes that 
would not be in stable/7, thus I would suggest building from develop/7.
will investigate.

The two main problems we encountered while loading were :

- logs messages indicating "Flushing at 5.7 MB/s while application is making dirty pages at 1.7 MB/s." which we interpreted as not enough write speed while receiving lots of JDBC INSERTs (disk issue ? buffer issue ? ...)

- high memory consumption (40GB RAM), virtuoso process never releasing memory while loading, free RAM always going down...

Have you provided a copy of your INI file previously,  if not can you provide a 
copy ?
see attached (FYI QueryLog= was not active while loading)
Do ensure the following params are set to 1 in order to clean up unused 
threads/resources and reduce memory consumption of the Virtuoso server, which 
can otherwise be construed as memory leaks.:

ThreadCleanupInterval    = 1
ResourcesCleanupInterval = 1
we have theses settings right.

Thanks for your help,

Thomas

if needed we model ORCID 2016 dataset using :
c1      c2
http://xmlns.com/foaf/0.1/Person        
28021451
http://purl.org/ontology/bibo/Document  
14283692
http://purl.org/ontology/bibo/Journal   
9104659
http://xmlns.com/foaf/0.1/PersonalProfileDocument       
2527333
http://xmlns.com/foaf/0.1/Article       
974945
http://www.w3.org/ns/org#Membership     
807465
http://www.w3.org/2006/vcard/ns#Address         
807423
http://www.w3.org/ns/org#Organization   
807418
http://purl.org/ontology/bibo/Conference        
769451
http://www.w3.org/ns/org#OrganizationalUnit     
649291
http://www.w3.org/2004/02/skos/core#Concept     
371731
http://purl.org/ontology/bibo/Book      
205493
http://www.w3.org/ns/org#Role   
168423
http://www.w3.org/1999/02/22-rdf-syntax-ns#Property     
170
http://www.openlinksw.com/schemas/virtrdf#QuadMapFormat         
130
http://www.openlinksw.com/schemas/virtrdf#array-of-QuadMapFormat        
98
http://www.w3.org/2000/01/rdf-schema#Class      
56
http://www.openlinksw.com/schemas/virtrdf#QuadMapValue  
8
http://www.openlinksw.com/schemas/virtrdf#array-of-QuadMapColumn        
8
http://www.openlinksw.com/schemas/virtrdf#QuadMapColumn         
8




Best Regards
Hugh Williams
Professional Services
OpenLink Software, Inc.      //http://www.openlinksw.com/
Weblog   --http://www.openlinksw.com/blogs/
LinkedIn --http://www.linkedin.com/company/openlink-software/
Twitter  --http://twitter.com/OpenLink
Google+  --http://plus.google.com/100570109519069333827/
Facebook --http://www.facebook.com/OpenLinkSoftware
Universal Data Access, Integration, and Management Technology Providers



On 15 Mar 2017, at 17:08, Thomas Michaux<mich...@abes.fr>  wrote:

Hello,

FYI, virtuoso still loading but we needed to increase memory ressources,
now the process use almost 40GB of ram :

[devel@tulipe-test2 ~]$ ./memcheck-virtuoso.sh
2017-03-15T17:54 VmSize: 41273424kB 5883

stats for the graph<http://hub.abes.fr/referentiel/ORCID/2016>  (forget
to mention, it's the only graph in db) :

239 451 028 triples


this:Dataset a void:Dataset ;
rdfs:seeAlso<http://hub.abes.fr/referentiel/ORCID/2016>  ;
rdfs:label "" ;
void:sparqlEndpoint<http://idrefplus.v102.abes.fr:8890/sparql>  ;
void:triples 239451028 ;
void:classes 13 ;
void:entities 57692917 ;
void:distinctSubjects 57650847 ;
void:properties 32 ;
void:distinctObjects 72219514 .

this:sameAsLinks a void:Linkset ;
void:inDataset this:Dataset ;
void:triples 997389 ;
void:linkPredicate owl:sameAs .


Le 14/03/2017 à 10:05, Thomas Michaux a écrit :



------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot


_______________________________________________
Virtuoso-users mailing list
Virtuoso-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/virtuoso-users

------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Virtuoso-users mailing list
Virtuoso-users@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/virtuoso-users

Reply via email to