Re: [basex-talk] size on collection in the time (Marc)

2015-09-14 Thread Christian Grün
> I confirm the problem : on another database (same data, same script.xbs,
> different server), we have exactly the same problem : the database grows
> each time the script is run.
>
> I'm going to prepare a test-case with small public data tonight.

Same here: Using the latest version of BaseX should solve the
problems. If not, please give us an update.

Christian



>
>
>
> Best regards,
>
> Christophe
>
>
>
>
> Hello Dirk,
> We use a BaseX script (.xbs) with a list of REPLACE.
> We try to investigate this point if this is reproducable or not.
> If I find this problem again, I take a copy of the collection before to
> delete her.
>
> Thank you for answer.
>
> Marc
> Le 03/09/2015 09:52, Dirk Kirsten a ?crit :
>
> Hello Marc,
>
> sure, a replace should lead to a true replace, i.e. the old data will be
> dropped. So this sounds like a bug, but it is difficult to replicate
> from what you describe.
>
> Do you think it would be possible to make the issue reproducable, maybe
> also with a smaller data set? Also, how exactly do you add documents or
> do you also do any queries and maybe other updating queries?
>
> Cheers
> Dirk
>
> On 09/02/2015 05:36 PM, Marc wrote:
>
> Hi,
> I have a remark about the size of the files of a collection.
> We do a lot of updates (REPLACE) with an attributes index but without
> a text index.
> I see that the size of the files grows up in the time we arrive until
> 30Go until the limit of the filessytems quota.
> When I drop the collection and recreate it from the last version of
> the files the size was only 6Go.
>
> We will redo the test with the time, but I just want to know if the
> program clears the disk use when he does a REPLACE?
>
> Marc
>
>
>
> --
>
> Message: 4
> Date: Thu, 3 Sep 2015 10:01:05 +0200
> From: Marc 
> To: Dirk Kirsten 
> Cc: BaseX 
> Subject: Re: [basex-talk] (no subject)
> Message-ID: <55e7fe41.5080...@free.fr>
> Content-Type: text/plain; charset=iso-8859-15; format=flowed
>
>
> I forgot to give this precision, we use the 7.9 version on this server.
>
> Marc
> Le 01/01/1970 01:00,  a ?crit :
>
> Hello Dirk,
> We use a BaseX script (.xbs) with a list of REPLACE.
> We try to investigate this point if this is reproducable or not.
> If I find this problem again, I take a copy of the collection before to
> delete her.
>
> Thank you for answer.
>
> Marc
> Le 03/09/2015 09:52, Dirk Kirsten a ?crit :
>
> Hello Marc,
>
> sure, a replace should lead to a true replace, i.e. the old data will be
> dropped. So this sounds like a bug, but it is difficult to replicate
> from what you describe.
>
> Do you think it would be possible to make the issue reproducable, maybe
> also with a smaller data set? Also, how exactly do you add documents or
> do you also do any queries and maybe other updating queries?
>
> Cheers
> Dirk
>
> On 09/02/2015 05:36 PM, Marc wrote:
>
> Hi,
> I have a remark about the size of the files of a collection.
> We do a lot of updates (REPLACE) with an attributes index but without
> a text index.
> I see that the size of the files grows up in the time we arrive until
> 30Go until the limit of the filessytems quota.
> When I drop the collection and recreate it from the last version of
> the files the size was only 6Go.
>
> We will redo the test with the time, but I just want to know if the
> program clears the disk use when he does a REPLACE?
>
> Marc
>
>
>
> End of BaseX-Talk Digest, Vol 69, Issue 5
> *


Re: [basex-talk] size on collection in the time

2015-09-03 Thread Christophe Marchand

James, you are right, OPTIMIZE ALL solves the problem.

Thanks a lot !

Best regards,
Christophe

Le 03/09/2015 21:20, basex-talk-requ...@mailman.uni-konstanz.de a écrit :

Send BaseX-Talk mailing list submissions to
basex-talk@mailman.uni-konstanz.de

To subscribe or unsubscribe via the World Wide Web, visit
https://mailman.uni-konstanz.de/mailman/listinfo/basex-talk
or, via email, send a message with subject or body 'help' to
basex-talk-requ...@mailman.uni-konstanz.de

You can reach the person managing the list at
basex-talk-ow...@mailman.uni-konstanz.de

When replying, please edit your Subject line so it is more specific
than "Re: Contents of BaseX-Talk digest..."


Today's Topics:

1. Xquery collections (michele.gre...@email.it)
2. Re: Xquery collections (Dirk Kirsten)
3. Re: size on collection in the time (Marc)
4. Re: size on collection in the time (Christophe Marchand)


--

Message: 1
Date: Thu, 3 Sep 2015 17:48:23 +0200
From: michele.gre...@email.it
To: basex-talk@mailman.uni-konstanz.de
Subject: [basex-talk] Xquery collections
Message-ID: <22e8f2f70298a35093e933ab65f26...@wm10.email.it>
Content-Type: text/plain; charset="iso-8859-1"


Hello I Michele i'm new to basex,
i created a collection with different documents .xml.
I want to know how to query the entire collection e its documents at one
xquery.
I'm trying,but i can't.

Thanks.
MG






  
  
  --

  ZE-Light e ZE-Pro: servizi zimbra per caselle con dominio email.it, per tutti 
i dettagli
Clicca qui 
http://posta.email.it/caselle-di-posta-z-email-it/?utm_campaign=email_Zimbra_102014=main_footer/f
  
  Sponsor:

  Caselle con tuo dominio su piattaforma Zimbra, fino a 30 GB di spazio, 
sincronizzazione dati e backup
  Clicca qui: http://adv.email.it/cgi-bin/foclick.cgi?mid=13324=3-9
-- next part --
An HTML attachment was scrubbed...
URL: 


--

Message: 2
Date: Thu, 3 Sep 2015 17:55:49 +0200
From: Dirk Kirsten 
To: michele.gre...@email.it
Cc: basex-talk@mailman.uni-konstanz.de
Subject: Re: [basex-talk] Xquery collections
Message-ID: <55e86d85.2000...@basex.org>
Content-Type: text/plain; charset="utf-8"

Ciao Michele,

welcome to the community :-)

Collections in BaseX are basically databases. You can simply open a
database e.g. by issueing the XQuery db:open('mydatabase'). This will
use all documents in your database.

You can find some more examples at http://docs.basex.org/wiki/Databases

It might also be helpful for you to read some of the Tutorials given at
http://docs.basex.org/wiki/Getting_Started like BaseX for Dummies, given
a concise introduction to BaseX.

Spero che sia d'aiuto
Dirk



On 09/03/2015 05:48 PM, michele.gre...@email.it wrote:

Hello I Michele i'm new to basex,
i created a collection with different documents .xml.
I want to know how to query the entire collection e its documents at
one xquery.
I'm trying,but i can't.
  
Thanks.

MG
  



ZE-Light e ZE-Pro: servizi zimbra per caselle con dominio email.it,
per tutti i dettagli clicca qui


Sponsor:
Idee regalo classiche o alternative? Trova l'offerta migliore in un click
Clicca qui 




Re: [basex-talk] size on collection in the time

2015-09-03 Thread Marc

James,
We have the UPDINDEX to FALSE.
I'll try to do an OPTIMIZE command, but we do it at the end of all the 
REPLACE so I think that it's not enough.

After that I prepare a study case with public data.
Regards
Marc

Le 03/09/2015 12:17, James Ball a écrit :

Marc,

Are you using the updatable index? UPDINDEX true.

If so, if you look back through the list Christian and I had an exchange on 
this as I was experiencing an ever growing index - a replace always appended to 
the end of the index. IIRC the logic is a bit better now in that if on a 
replace there is enough room in the index structure to reuse slots it will, but 
if not it will be appended. And the db will reuse empty slots while the 
database remains open. I can’t remember which is the first version where this 
change was made, sorry.

I had a DB the other day that had got to 14GB but was only around 600MB once I 
did a full optimise. My suggestion is next time to try a full optimise instead 
of dropping the collection and see if that helps.

Regards, James



Message: 1
Date: Wed, 2 Sep 2015 17:36:19 +0200
From: Marc <marc.li...@free.fr>
To: BaseX <basex-talk@mailman.uni-konstanz.de>
Subject: [basex-talk] size on collection in the time
Message-ID: <55e71773.7050...@free.fr>
Content-Type: text/plain; charset=iso-8859-15; format=flowed

Hi,
I have a remark about the size of the files of a collection.
We do a lot of updates (REPLACE) with an attributes index but without a
text index.
I see that the size of the files grows up in the time we arrive until
30Go until the limit of the filessytems quota.
When I drop the collection and recreate it from the last version of the
files the size was only 6Go.

We will redo the test with the time, but I just want to know if the
program clears the disk use when he does a REPLACE?

Marc


Re: [basex-talk] size on collection in the time

2015-09-03 Thread Christophe Marchand

James, you are right, OPTIMIZE ALL solves the problem.
The txt.basex was growing at each replace, other files staying almost 
identical.


Thanks a lot !

Best regards,
Christophe

Le 03/09/2015 21:20, basex-talk-requ...@mailman.uni-konstanz.de a écrit :

Send BaseX-Talk mailing list submissions to
basex-talk@mailman.uni-konstanz.de

To subscribe or unsubscribe via the World Wide Web, visit
https://mailman.uni-konstanz.de/mailman/listinfo/basex-talk
or, via email, send a message with subject or body 'help' to
basex-talk-requ...@mailman.uni-konstanz.de

You can reach the person managing the list at
basex-talk-ow...@mailman.uni-konstanz.de

When replying, please edit your Subject line so it is more specific
than "Re: Contents of BaseX-Talk digest..."


Today's Topics:

1. Xquery collections (michele.gre...@email.it)
2. Re: Xquery collections (Dirk Kirsten)
3. Re: size on collection in the time (Marc)
4. Re: size on collection in the time (Christophe Marchand)


--

Message: 1
Date: Thu, 3 Sep 2015 17:48:23 +0200
From: michele.gre...@email.it
To: basex-talk@mailman.uni-konstanz.de
Subject: [basex-talk] Xquery collections
Message-ID: <22e8f2f70298a35093e933ab65f26...@wm10.email.it>
Content-Type: text/plain; charset="iso-8859-1"


Hello I Michele i'm new to basex,
i created a collection with different documents .xml.
I want to know how to query the entire collection e its documents at one
xquery.
I'm trying,but i can't.

Thanks.
MG






  
  
  --

  ZE-Light e ZE-Pro: servizi zimbra per caselle con dominio email.it, per tutti 
i dettagli
Clicca qui 
http://posta.email.it/caselle-di-posta-z-email-it/?utm_campaign=email_Zimbra_102014=main_footer/f
  
  Sponsor:

  Caselle con tuo dominio su piattaforma Zimbra, fino a 30 GB di spazio, 
sincronizzazione dati e backup
  Clicca qui: http://adv.email.it/cgi-bin/foclick.cgi?mid=13324=3-9
-- next part --
An HTML attachment was scrubbed...
URL: 


--

Message: 2
Date: Thu, 3 Sep 2015 17:55:49 +0200
From: Dirk Kirsten 
To: michele.gre...@email.it
Cc: basex-talk@mailman.uni-konstanz.de
Subject: Re: [basex-talk] Xquery collections
Message-ID: <55e86d85.2000...@basex.org>
Content-Type: text/plain; charset="utf-8"

Ciao Michele,

welcome to the community :-)

Collections in BaseX are basically databases. You can simply open a
database e.g. by issueing the XQuery db:open('mydatabase'). This will
use all documents in your database.

You can find some more examples at http://docs.basex.org/wiki/Databases

It might also be helpful for you to read some of the Tutorials given at
http://docs.basex.org/wiki/Getting_Started like BaseX for Dummies, given
a concise introduction to BaseX.

Spero che sia d'aiuto
Dirk



On 09/03/2015 05:48 PM, michele.gre...@email.it wrote:

Hello I Michele i'm new to basex,
i created a collection with different documents .xml.
I want to know how to query the entire collection e its documents at
one xquery.
I'm trying,but i can't.
  
Thanks.

MG
  



ZE-Light e ZE-Pro: servizi zimbra per caselle con dominio email.it,
per tutti i dettagli clicca qui


Sponsor:
Idee regalo classiche o alternative? Trova l'offerta migliore in un click
Clicca qui 




Re: [basex-talk] size on collection in the time

2015-09-03 Thread Marc

Hello Dirk,
We use a BaseX script (.xbs) with a list of REPLACE.
We try to investigate this point if this is reproducable or not.
If I find this problem again, I take a copy of the collection before to 
delete her.


Thank you for answer.

Marc
Le 03/09/2015 09:52, Dirk Kirsten a écrit :

Hello Marc,

sure, a replace should lead to a true replace, i.e. the old data will be
dropped. So this sounds like a bug, but it is difficult to replicate
from what you describe.

Do you think it would be possible to make the issue reproducable, maybe
also with a smaller data set? Also, how exactly do you add documents or
do you also do any queries and maybe other updating queries?

Cheers
Dirk

On 09/02/2015 05:36 PM, Marc wrote:

Hi,
I have a remark about the size of the files of a collection.
We do a lot of updates (REPLACE) with an attributes index but without
a text index.
I see that the size of the files grows up in the time we arrive until
30Go until the limit of the filessytems quota.
When I drop the collection and recreate it from the last version of
the files the size was only 6Go.

We will redo the test with the time, but I just want to know if the
program clears the disk use when he does a REPLACE?

Marc




Re: [basex-talk] size on collection in the time

2015-09-03 Thread Dirk Kirsten
Hello Marc,

sure, a replace should lead to a true replace, i.e. the old data will be
dropped. So this sounds like a bug, but it is difficult to replicate
from what you describe.

Do you think it would be possible to make the issue reproducable, maybe
also with a smaller data set? Also, how exactly do you add documents or
do you also do any queries and maybe other updating queries?

Cheers
Dirk

On 09/02/2015 05:36 PM, Marc wrote:
> Hi,
> I have a remark about the size of the files of a collection.
> We do a lot of updates (REPLACE) with an attributes index but without
> a text index.
> I see that the size of the files grows up in the time we arrive until
> 30Go until the limit of the filessytems quota.
> When I drop the collection and recreate it from the last version of
> the files the size was only 6Go.
>
> We will redo the test with the time, but I just want to know if the
> program clears the disk use when he does a REPLACE?
>
> Marc

-- 
Dirk Kirsten, BaseX GmbH, http://basexgmbh.de
|-- Firmensitz: Blarerstrasse 56, 78462 Konstanz
|-- Registergericht Freiburg, HRB: 708285, Geschäftsführer:
|   Dr. Christian Grün, Dr. Alexander Holupirek, Michael Seiferle
`-- Phone: 0049 7531 28 28 676, Fax: 0049 7531 20 05 22



Re: [basex-talk] size on collection in the time

2015-09-03 Thread James Ball
Marc,

Are you using the updatable index? UPDINDEX true.

If so, if you look back through the list Christian and I had an exchange on 
this as I was experiencing an ever growing index - a replace always appended to 
the end of the index. IIRC the logic is a bit better now in that if on a 
replace there is enough room in the index structure to reuse slots it will, but 
if not it will be appended. And the db will reuse empty slots while the 
database remains open. I can’t remember which is the first version where this 
change was made, sorry.

I had a DB the other day that had got to 14GB but was only around 600MB once I 
did a full optimise. My suggestion is next time to try a full optimise instead 
of dropping the collection and see if that helps.

Regards, James

> 
> Message: 1
> Date: Wed, 2 Sep 2015 17:36:19 +0200
> From: Marc <marc.li...@free.fr>
> To: BaseX <basex-talk@mailman.uni-konstanz.de>
> Subject: [basex-talk] size on collection in the time
> Message-ID: <55e71773.7050...@free.fr>
> Content-Type: text/plain; charset=iso-8859-15; format=flowed
> 
> Hi,
> I have a remark about the size of the files of a collection.
> We do a lot of updates (REPLACE) with an attributes index but without a 
> text index.
> I see that the size of the files grows up in the time we arrive until 
> 30Go until the limit of the filessytems quota.
> When I drop the collection and recreate it from the last version of the 
> files the size was only 6Go.
> 
> We will redo the test with the time, but I just want to know if the 
> program clears the disk use when he does a REPLACE?
> 
> Marc


Re: [basex-talk] size on collection in the time (Marc)

2015-09-03 Thread cmarchand
 

I confirm the problem : on another database (same data, same script.xbs,
different server), we have exactly the same problem : the database grows
each time the script is run. 

I'm going to prepare a test-case with small public data tonight. 

Best regards, 

Christophe 

Hello Dirk,
We use a BaseX script (.xbs) with a list of REPLACE.
We try to investigate this point if this is reproducable or not.
If I find this problem again, I take a copy of the collection before to 
delete her.

Thank you for answer.

Marc
Le 03/09/2015 09:52, Dirk Kirsten a ?crit : 

> Hello Marc,
> 
> sure, a replace should lead to a true replace, i.e. the old data will be
> dropped. So this sounds like a bug, but it is difficult to replicate
> from what you describe.
> 
> Do you think it would be possible to make the issue reproducable, maybe
> also with a smaller data set? Also, how exactly do you add documents or
> do you also do any queries and maybe other updating queries?
> 
> Cheers
> Dirk
> 
> On 09/02/2015 05:36 PM, Marc wrote: 
> 
>> Hi,
>> I have a remark about the size of the files of a collection.
>> We do a lot of updates (REPLACE) with an attributes index but without
>> a text index.
>> I see that the size of the files grows up in the time we arrive until
>> 30Go until the limit of the filessytems quota.
>> When I drop the collection and recreate it from the last version of
>> the files the size was only 6Go.
>> 
>> We will redo the test with the time, but I just want to know if the
>> program clears the disk use when he does a REPLACE?
>> 
>> Marc

--

Message: 4
Date: Thu, 3 Sep 2015 10:01:05 +0200
From: Marc 
To: Dirk Kirsten 
Cc: BaseX 
Subject: Re: [basex-talk] (no subject)
Message-ID: <55e7fe41.5080...@free.fr>
Content-Type: text/plain; charset=iso-8859-15; format=flowed

I forgot to give this precision, we use the 7.9 version on this server.

Marc
Le 01/01/1970 01:00, a ?crit : 

> Hello Dirk,
> We use a BaseX script (.xbs) with a list of REPLACE.
> We try to investigate this point if this is reproducable or not.
> If I find this problem again, I take a copy of the collection before to
> delete her.
> 
> Thank you for answer.
> 
> Marc
> Le 03/09/2015 09:52, Dirk Kirsten a ?crit : Hello Marc,
> 
> sure, a replace should lead to a true replace, i.e. the old data will be
> dropped. So this sounds like a bug, but it is difficult to replicate
> from what you describe.
> 
> Do you think it would be possible to make the issue reproducable, maybe
> also with a smaller data set? Also, how exactly do you add documents or
> do you also do any queries and maybe other updating queries?
> 
> Cheers
> Dirk
> 
> On 09/02/2015 05:36 PM, Marc wrote: Hi,
> I have a remark about the size of the files of a collection.
> We do a lot of updates (REPLACE) with an attributes index but without
> a text index.
> I see that the size of the files grows up in the time we arrive until
> 30Go until the limit of the filessytems quota.
> When I drop the collection and recreate it from the last version of
> the files the size was only 6Go.
> 
> We will redo the test with the time, but I just want to know if the
> program clears the disk use when he does a REPLACE?
> 
> Marc

End of BaseX-Talk Digest, Vol 69, Issue 5
* 

[basex-talk] size on collection in the time

2015-09-02 Thread Marc

Hi,
I have a remark about the size of the files of a collection.
We do a lot of updates (REPLACE) with an attributes index but without a 
text index.
I see that the size of the files grows up in the time we arrive until 
30Go until the limit of the filessytems quota.
When I drop the collection and recreate it from the last version of the 
files the size was only 6Go.


We will redo the test with the time, but I just want to know if the 
program clears the disk use when he does a REPLACE?


Marc