Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1

2009-08-20 Thread Jim Simmons
Martin,

For what it's worth, I've experienced this running the Sugar test
environment in both Fedora 10 and 11.  It would not seem to be
specific to .82.  The way to reproduce the problem is to run the Sugar
environment in Xephyr (sugar-emulator).  Do some stuff, then instead
of shutting down cleanly click on the "close" button on the Xephyr
window.  That will close your Sugar session and practically guarantee
that the next time you open it your Sugar Journal will be empty.  I
regret to say that I've done this a few times.

James Simmons


> Date: Thu, 20 Aug 2009 18:56:17 +0200
> From: Martin Langhoff 
> Subject: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1
>        users in Uy
> To: OLPC Devel ,        Sugar-dev
>        
> Message-ID:
>        <46a038f90908200956h436459f3j9fa7b29bb111d...@mail.gmail.com>
> Content-Type: text/plain; charset=ISO-8859-1
>
> On olpc-sur we're seeing reports of "my journal contents disappeared
> after reboot". (Thread starts at
> http://lists.laptop.org/pipermail/olpc-sur/2009-August/004177.html )
>
> This reminds me a lot of the issues we saw earlier in development of
> what became 8.2.x -- if for any reason Sugar doesn't like the Journal
> contents or indexes, it moves the 'datastore' directory aside, and
> starts anew.
>
> That's my hunch, and I am hoping to hear confirmation from the users
> reporting the problem.
>
> Two pleas for help
>
>  - A sugar activity that does some data-recovery and freeing up the space.
>
> The good news is that we have a json file with the metadata for each
> Journal entry, so we can either load it back into the Journal (mstone
> had published a cli script that exports/imports to the Journal?) or
> create Journal Entry Bundles and save them to a USB stick.
>
> I suspect the JEB path might help users more -- if the Journal is
> choking on some specific metadata, a script that reloads it all in one
> go will make it choke again. Having files as JEBs allows step-by-step
> attempts.
>
>  - Diagnosis.
>
> If it's something we can pin down, it'll be great to have a fix! This
> will surely involve getting in touch with teachers on olpc-sur.
>
> cheers,
>
>
>
> m
> --
>  martin.langh...@gmail.com
>  mar...@laptop.org -- School Server Architect
>  - ask interesting questions
>  - don't get distracted with shiny stuff  - working code first
>  - http://wiki.laptop.org/go/User:Martinlanghoff
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1

2009-08-20 Thread Martin Langhoff
On Fri, Aug 21, 2009 at 12:13 AM, Jim Simmons wrote:
> For what it's worth, I've experienced this running the Sugar test
> environment in both Fedora 10 and 11.  It would not seem to be
> specific to .82.  The way to reproduce the problem is to run the Sugar
> environment in Xephyr (sugar-emulator).  Do some stuff, then instead
> of shutting down cleanly click on the "close" button on the Xephyr
> window.  That will close your Sugar session and practically guarantee
> that the next time you open it your Sugar Journal will be empty.  I
> regret to say that I've done this a few times.

That's not very encouraging. I "switch off" my XO, running 8.2.x
(which is 0.82.x of Sugar) all the time by removing the battery, which
is a much harsher method than what you describe (which is merely a
SIGTERM).

Sugar 0.84 has seen a reimplementation of the Datastore (by Tomeu?)
and I am sure he'll be interested in this. Avoiding dataloss and
generally being atomic is always one of the most important aspects of
the DS.

I can't cast the first stone[1] but I do want to say: any
Journal-related dataloss is a major concern. Let's report it, and
diagnose it. I have only seen dataloss on SoaS, but related to vfat /
overlay corruption (widely discussed).

cheers,




m
1 - The very early XS images _repartition & format your hard drive
soon after booting, without asking for permission_. I fixed it as soon
as I could move to a better installer. Still, it managed to eat
someone's HD. Shame on me.
-- 
 martin.langh...@gmail.com
 mar...@laptop.org -- School Server Architect
 - ask interesting questions
 - don't get distracted with shiny stuff  - working code first
 - http://wiki.laptop.org/go/User:Martinlanghoff
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1

2009-08-21 Thread Tomeu Vizoso
On Fri, Aug 21, 2009 at 08:23, Martin Langhoff wrote:
> On Fri, Aug 21, 2009 at 12:13 AM, Jim Simmons wrote:
>> For what it's worth, I've experienced this running the Sugar test
>> environment in both Fedora 10 and 11.  It would not seem to be
>> specific to .82.  The way to reproduce the problem is to run the Sugar
>> environment in Xephyr (sugar-emulator).  Do some stuff, then instead
>> of shutting down cleanly click on the "close" button on the Xephyr
>> window.  That will close your Sugar session and practically guarantee
>> that the next time you open it your Sugar Journal will be empty.  I
>> regret to say that I've done this a few times.
>
> That's not very encouraging. I "switch off" my XO, running 8.2.x
> (which is 0.82.x of Sugar) all the time by removing the battery, which
> is a much harsher method than what you describe (which is merely a
> SIGTERM).
>
> Sugar 0.84 has seen a reimplementation of the Datastore (by Tomeu?)
> and I am sure he'll be interested in this. Avoiding dataloss and
> generally being atomic is always one of the most important aspects of
> the DS.
>
> I can't cast the first stone[1] but I do want to say: any
> Journal-related dataloss is a major concern. Let's report it, and
> diagnose it. I have only seen dataloss on SoaS, but related to vfat /
> overlay corruption (widely discussed).

The new DS has a design that is supposed to be very robust to any
partial data losses. That said, any bug in the implementation could
still cause total data loss. I have fixed all known such bugs, but
myself alone cannot reproduce all possible situations.

So please, when anyone finds that the contents of their journal
disappears in a release of Sugar posterior to 0.84, enter a bug _and_
attach the shell.log and datastore.log files.

http://wiki.sugarlabs.org/go/BugSquad/Get_Logs

Thanks,

Tomeu

> cheers,
>
>
>
>
> m
> 1 - The very early XS images _repartition & format your hard drive
> soon after booting, without asking for permission_. I fixed it as soon
> as I could move to a better installer. Still, it managed to eat
> someone's HD. Shame on me.
> --
>  martin.langh...@gmail.com
>  mar...@laptop.org -- School Server Architect
>  - ask interesting questions
>  - don't get distracted with shiny stuff  - working code first
>  - http://wiki.laptop.org/go/User:Martinlanghoff
> ___
> Sugar-devel mailing list
> Sugar-devel@lists.sugarlabs.org
> http://lists.sugarlabs.org/listinfo/sugar-devel
>



-- 
«Sugar Labs is anyone who participates in improving and using Sugar.
What Sugar Labs does is determined by the participants.» - David
Farning
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1

2009-08-21 Thread Sascha Silbe

On Thu, Aug 20, 2009 at 05:13:42PM -0500, Jim Simmons wrote:


The way to reproduce the problem is to run the Sugar
environment in Xephyr (sugar-emulator).  Do some stuff, then instead
of shutting down cleanly click on the "close" button on the Xephyr
window.
This happened a lot to me as well. The reason is that with an unclean 
shutdown (but no power cycle) some stuff, especially the data store, 
will keep running, but be inaccesible to the new session. So DBus starts 
a new data store process which can't lock the index...
It will only happen if you do an unclean restart of X; a reboot / power 
cycle isn't a problem.


CU Sascha

--
http://sascha.silbe.org/
http://www.infra-silbe.de/

signature.asc
Description: Digital signature
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1

2009-08-21 Thread Tomeu Vizoso
On Fri, Aug 21, 2009 at 11:08, Sascha
Silbe wrote:
> On Thu, Aug 20, 2009 at 05:13:42PM -0500, Jim Simmons wrote:
>
>> The way to reproduce the problem is to run the Sugar
>> environment in Xephyr (sugar-emulator).  Do some stuff, then instead
>> of shutting down cleanly click on the "close" button on the Xephyr
>> window.
>
> This happened a lot to me as well. The reason is that with an unclean
> shutdown (but no power cycle) some stuff, especially the data store, will
> keep running, but be inaccesible to the new session. So DBus starts a new
> data store process which can't lock the index...
> It will only happen if you do an unclean restart of X; a reboot / power
> cycle isn't a problem.

That's interesting information, the dbus session daemon should finish
when sugar-session ends, and sugar-session should be terminated by the
lost X connection. The DS and the other DBus services like the PS,
telepathy-*, etc should terminate when the dbus daemon dies.

Would be good to have a ticket that entices someone to dig deeper and
see why things aren't like they are supposed to be.

Thanks,

Tomeu

> CU Sascha
>
> --
> http://sascha.silbe.org/
> http://www.infra-silbe.de/
> -BEGIN PGP SIGNATURE-
> Version: GnuPG v1.4.9 (GNU/Linux)
>
> iQEcBAEBAgAGBQJKjmP2AAoJELpz82VMF3DaYsgH/j/LYhJ7lOWrVreA3zImkh8w
> +BAT94io4D1Y/S9QE63CF8FgBJhQvpOeY42lc72TKU19365qIrcJ5YEekBGVkHYj
> /bi7E3kCUJebZIYHdFhpjGA86UdKdPQLPjmtlP6kRSvc8Ce2BOMH23ZS6vrAItib
> f71BIndBg7HmC/0d9jblGAr6I9NNzfp+TBSucGZ/2YNjVOXTsIC0O1CgUIkXPVAN
> jV/vS06lBHQPddNa/Xj1ZmZ1+/IP9jlVkPyadWWY+ssDupTGs62YGhZuu31DAJl2
> zm8vgG6D4KxQsXTA+Qlxj+sx3K56uA9s0gAhttYfmgH/dqeGmPjTUMPuuN97jh0=
> =MK24
> -END PGP SIGNATURE-
>
> ___
> Sugar-devel mailing list
> Sugar-devel@lists.sugarlabs.org
> http://lists.sugarlabs.org/listinfo/sugar-devel
>
>



-- 
«Sugar Labs is anyone who participates in improving and using Sugar.
What Sugar Labs does is determined by the participants.» - David
Farning
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1

2009-08-21 Thread Gary C Martin
On 21 Aug 2009, at 07:23, Martin Langhoff wrote:

> On Fri, Aug 21, 2009 at 12:13 AM, Jim Simmons  
> wrote:
>> For what it's worth, I've experienced this running the Sugar test
>> environment in both Fedora 10 and 11.  It would not seem to be
>> specific to .82.  The way to reproduce the problem is to run the  
>> Sugar
>> environment in Xephyr (sugar-emulator).  Do some stuff, then instead
>> of shutting down cleanly click on the "close" button on the Xephyr
>> window.  That will close your Sugar session and practically guarantee
>> that the next time you open it your Sugar Journal will be empty.  I
>> regret to say that I've done this a few times.
>
> That's not very encouraging. I "switch off" my XO, running 8.2.x
> (which is 0.82.x of Sugar) all the time by removing the battery, which
> is a much harsher method than what you describe (which is merely a
> SIGTERM).

FWIW, I'm having to close Xephyr quite frequently (5+ times a day)  
here – with no data loss. Under F11 sugar-jhbuild and the switch to  
Metacity (0.85.x) I'm getting quite regular 'black Xephyrs of death',  
usually (I think) just after stopping an activity – this leads me to  
pull up the F11 task bar and use it to close Xephyr so I can start  
again.

Regards,
--Gary

___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1

2009-08-22 Thread Walter Bender
On Thu, Aug 20, 2009 at 6:13 PM, Jim Simmons wrote:
> Martin,
>
> For what it's worth, I've experienced this running the Sugar test
> environment in both Fedora 10 and 11.  It would not seem to be
> specific to .82.  The way to reproduce the problem is to run the Sugar
> environment in Xephyr (sugar-emulator).  Do some stuff, then instead
> of shutting down cleanly click on the "close" button on the Xephyr
> window.  That will close your Sugar session and practically guarantee
> that the next time you open it your Sugar Journal will be empty.  I
> regret to say that I've done this a few times.

I do this all the time and have never had a Journal corruption error
in Xephyr. Curious.

-walter

> James Simmons
>
>
>> Date: Thu, 20 Aug 2009 18:56:17 +0200
>> From: Martin Langhoff 
>> Subject: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1
>>        users in Uy
>> To: OLPC Devel ,        Sugar-dev
>>        
>> Message-ID:
>>        <46a038f90908200956h436459f3j9fa7b29bb111d...@mail.gmail.com>
>> Content-Type: text/plain; charset=ISO-8859-1
>>
>> On olpc-sur we're seeing reports of "my journal contents disappeared
>> after reboot". (Thread starts at
>> http://lists.laptop.org/pipermail/olpc-sur/2009-August/004177.html )
>>
>> This reminds me a lot of the issues we saw earlier in development of
>> what became 8.2.x -- if for any reason Sugar doesn't like the Journal
>> contents or indexes, it moves the 'datastore' directory aside, and
>> starts anew.
>>
>> That's my hunch, and I am hoping to hear confirmation from the users
>> reporting the problem.
>>
>> Two pleas for help
>>
>>  - A sugar activity that does some data-recovery and freeing up the space.
>>
>> The good news is that we have a json file with the metadata for each
>> Journal entry, so we can either load it back into the Journal (mstone
>> had published a cli script that exports/imports to the Journal?) or
>> create Journal Entry Bundles and save them to a USB stick.
>>
>> I suspect the JEB path might help users more -- if the Journal is
>> choking on some specific metadata, a script that reloads it all in one
>> go will make it choke again. Having files as JEBs allows step-by-step
>> attempts.
>>
>>  - Diagnosis.
>>
>> If it's something we can pin down, it'll be great to have a fix! This
>> will surely involve getting in touch with teachers on olpc-sur.
>>
>> cheers,
>>
>>
>>
>> m
>> --
>>  martin.langh...@gmail.com
>>  mar...@laptop.org -- School Server Architect
>>  - ask interesting questions
>>  - don't get distracted with shiny stuff  - working code first
>>  - http://wiki.laptop.org/go/User:Martinlanghoff
> ___
> Sugar-devel mailing list
> Sugar-devel@lists.sugarlabs.org
> http://lists.sugarlabs.org/listinfo/sugar-devel
>



-- 
Walter Bender
Sugar Labs
http://www.sugarlabs.org
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1

2009-08-22 Thread Art Hunkins
Probably unrelated, but FWIW:

One fast way to completely get rid of the Journal (SoaS) is to delete the 
stick's boot directory! (I did it once by accident.)

Strangely enough, the stick will still boot fine on non-XO's; it just lacks 
a Journal.

OTOH, it won't boot at all on the XO-1 (whereas, otherwise it boots fine).

Then I figured I might be able to recoup the basic Journal - either to 
retrieve the activity, or to get the stick to boot on the XO - by copying 
the boot directory from another stick - but *no*.

Art Hunkins

- Original Message - 
From: "Walter Bender" 
To: "Jim Simmons" 
Cc: 
Sent: Saturday, August 22, 2009 9:13 AM
Subject: Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1


On Thu, Aug 20, 2009 at 6:13 PM, Jim Simmons wrote:
> Martin,
>
> For what it's worth, I've experienced this running the Sugar test
> environment in both Fedora 10 and 11. It would not seem to be
> specific to .82. The way to reproduce the problem is to run the Sugar
> environment in Xephyr (sugar-emulator). Do some stuff, then instead
> of shutting down cleanly click on the "close" button on the Xephyr
> window. That will close your Sugar session and practically guarantee
> that the next time you open it your Sugar Journal will be empty. I
> regret to say that I've done this a few times.

I do this all the time and have never had a Journal corruption error
in Xephyr. Curious.

-walter

> James Simmons
>
>
>> Date: Thu, 20 Aug 2009 18:56:17 +0200
>> From: Martin Langhoff 
>> Subject: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1
>> users in Uy
>> To: OLPC Devel , Sugar-dev
>> 
>> Message-ID:
>> <46a038f90908200956h436459f3j9fa7b29bb111d...@mail.gmail.com>
>> Content-Type: text/plain; charset=ISO-8859-1
>>
>> On olpc-sur we're seeing reports of "my journal contents disappeared
>> after reboot". (Thread starts at
>> http://lists.laptop.org/pipermail/olpc-sur/2009-August/004177.html )
>>
>> This reminds me a lot of the issues we saw earlier in development of
>> what became 8.2.x -- if for any reason Sugar doesn't like the Journal
>> contents or indexes, it moves the 'datastore' directory aside, and
>> starts anew.
>>
>> That's my hunch, and I am hoping to hear confirmation from the users
>> reporting the problem.
>>
>> Two pleas for help
>>
>> - A sugar activity that does some data-recovery and freeing up the space.
>>
>> The good news is that we have a json file with the metadata for each
>> Journal entry, so we can either load it back into the Journal (mstone
>> had published a cli script that exports/imports to the Journal?) or
>> create Journal Entry Bundles and save them to a USB stick.
>>
>> I suspect the JEB path might help users more -- if the Journal is
>> choking on some specific metadata, a script that reloads it all in one
>> go will make it choke again. Having files as JEBs allows step-by-step
>> attempts.
>>
>> - Diagnosis.
>>
>> If it's something we can pin down, it'll be great to have a fix! This
>> will surely involve getting in touch with teachers on olpc-sur.
>>
>> cheers,
>>
>>
>>
>> m
>> --
>> martin.langh...@gmail.com
>> mar...@laptop.org -- School Server Architect
>> - ask interesting questions
>> - don't get distracted with shiny stuff - working code first
>> - http://wiki.laptop.org/go/User:Martinlanghoff
> ___
> Sugar-devel mailing list
> Sugar-devel@lists.sugarlabs.org
> http://lists.sugarlabs.org/listinfo/sugar-devel
>



-- 
Walter Bender
Sugar Labs
http://www.sugarlabs.org
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel 

___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


[Sugar-devel] RFH - Journal corruption reports fom 8.2.1 users in Uy

2009-08-20 Thread Martin Langhoff
On olpc-sur we're seeing reports of "my journal contents disappeared
after reboot". (Thread starts at
http://lists.laptop.org/pipermail/olpc-sur/2009-August/004177.html )

This reminds me a lot of the issues we saw earlier in development of
what became 8.2.x -- if for any reason Sugar doesn't like the Journal
contents or indexes, it moves the 'datastore' directory aside, and
starts anew.

That's my hunch, and I am hoping to hear confirmation from the users
reporting the problem.

Two pleas for help

 - A sugar activity that does some data-recovery and freeing up the space.

The good news is that we have a json file with the metadata for each
Journal entry, so we can either load it back into the Journal (mstone
had published a cli script that exports/imports to the Journal?) or
create Journal Entry Bundles and save them to a USB stick.

I suspect the JEB path might help users more -- if the Journal is
choking on some specific metadata, a script that reloads it all in one
go will make it choke again. Having files as JEBs allows step-by-step
attempts.

 - Diagnosis.

If it's something we can pin down, it'll be great to have a fix! This
will surely involve getting in touch with teachers on olpc-sur.

cheers,



m
-- 
 martin.langh...@gmail.com
 mar...@laptop.org -- School Server Architect
 - ask interesting questions
 - don't get distracted with shiny stuff  - working code first
 - http://wiki.laptop.org/go/User:Martinlanghoff
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1 users in Uy

2009-08-20 Thread Tomeu Vizoso
On Thu, Aug 20, 2009 at 18:56, Martin Langhoff wrote:
> On olpc-sur we're seeing reports of "my journal contents disappeared
> after reboot". (Thread starts at
> http://lists.laptop.org/pipermail/olpc-sur/2009-August/004177.html )
>
> This reminds me a lot of the issues we saw earlier in development of
> what became 8.2.x -- if for any reason Sugar doesn't like the Journal
> contents or indexes, it moves the 'datastore' directory aside, and
> starts anew.
>
> That's my hunch, and I am hoping to hear confirmation from the users
> reporting the problem.
>
> Two pleas for help
>
>  - A sugar activity that does some data-recovery and freeing up the space.
>
> The good news is that we have a json file with the metadata for each
> Journal entry, so we can either load it back into the Journal (mstone
> had published a cli script that exports/imports to the Journal?) or
> create Journal Entry Bundles and save them to a USB stick.
>
> I suspect the JEB path might help users more -- if the Journal is
> choking on some specific metadata, a script that reloads it all in one
> go will make it choke again. Having files as JEBs allows step-by-step
> attempts.

How do you envision the UI of the tool that will recover the JEBs?

There's a good chance that the xapian index got corrupted because the
machine lost power while the index on disk was on an inconsistent
state. If that's the case, just rebuilding the whole index should work
fine.

Would be nice to have more details about the failure so we can both
fix it and propose the best tool for the job. Maybe a ceibal jam
volunteer with some linux knowledge would like to work with us on
this?

Regards,

Tomeu

>  - Diagnosis.
>
> If it's something we can pin down, it'll be great to have a fix! This
> will surely involve getting in touch with teachers on olpc-sur.
>
> cheers,
>
>
>
> m
> --
>  martin.langh...@gmail.com
>  mar...@laptop.org -- School Server Architect
>  - ask interesting questions
>  - don't get distracted with shiny stuff  - working code first
>  - http://wiki.laptop.org/go/User:Martinlanghoff
> ___
> Sugar-devel mailing list
> Sugar-devel@lists.sugarlabs.org
> http://lists.sugarlabs.org/listinfo/sugar-devel
>



-- 
«Sugar Labs is anyone who participates in improving and using Sugar.
What Sugar Labs does is determined by the participants.» - David
Farning
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1 users in Uy

2009-08-20 Thread Martin Langhoff
On Thu, Aug 20, 2009 at 7:44 PM, Tomeu Vizoso wrote:
> How do you envision the UI of the tool that will recover the JEBs?

We don't need many words there :-) all we need is

 - A listing of Journals available (by date - the numbers after
'datastore' are epoch seconds IIRC) - if there's only one, skip this
stage...

 - A view that looks a bit like the Journal view... we could show an
'action' that is 'inject into journal' (icon of the activity, arrow,
journal icon?).

> There's a good chance that the xapian index got corrupted because the
> machine lost power while the index on disk was on an inconsistent
> state. If that's the case, just rebuilding the whole index should work
> fine.

In that case, another option is to not have a UI, and just "merge" all
the datastores present, in one go, no more questions asked. As part of
it, files are moved (instead of cp'd).

Not sure how to make the above atomic & safe.

> Would be nice to have more details about the failure so we can both
> fix it and propose the best tool for the job. Maybe a ceibal jam
> volunteer with some linux knowledge would like to work with us on
> this?

Many are reading olpc-sur -- I hope someone might help.

cheers,



m
-- 
 martin.langh...@gmail.com
 mar...@laptop.org -- School Server Architect
 - ask interesting questions
 - don't get distracted with shiny stuff  - working code first
 - http://wiki.laptop.org/go/User:Martinlanghoff
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1 users in Uy

2009-08-20 Thread Andrés Ambrois
On Thursday 20 August 2009 06:58:48 pm Martin Langhoff wrote:
> On Thu, Aug 20, 2009 at 7:44 PM, Tomeu Vizoso wrote:
> > Would be nice to have more details about the failure so we can both
> > fix it and propose the best tool for the job. Maybe a ceibal jam
> > volunteer with some linux knowledge would like to work with us on
> > this?
>
> Many are reading olpc-sur -- I hope someone might help.

I'll help in any way I can. Would an upload webservice similar to Sacha's for 
uploading the necessary datastore bits help? We can then distribute a simple 
cli tool that uploads the data.

Will collecting the indexes be enough or do you need the whole datastore?

> cheers,
>
>
>
> m

-- 
  -Andrés
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1 users in Uy

2009-08-20 Thread James Cameron
On Thu, Aug 20, 2009 at 06:56:17PM +0200, Martin Langhoff wrote:
> On olpc-sur we're seeing reports of "my journal contents disappeared
> after reboot". (Thread starts at
> http://lists.laptop.org/pipermail/olpc-sur/2009-August/004177.html )

Hmm.  I've two XOs with 8.2.1 being used by children of a friend, and
one of them reported a loss event recently ... what do I look for in the
filesystem to check for evidence of this?

-- 
James Cameron
http://quozl.linux.org.au/
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1 users in Uy

2009-08-20 Thread Martin Langhoff
On Fri, Aug 21, 2009 at 1:40 AM, James Cameron wrote:
> On Thu, Aug 20, 2009 at 06:56:17PM +0200, Martin Langhoff wrote:
>> On olpc-sur we're seeing reports of "my journal contents disappeared
>> after reboot". (Thread starts at
>> http://lists.laptop.org/pipermail/olpc-sur/2009-August/004177.html )
>
> Hmm.  I've two XOs with 8.2.1 being used by children of a friend, and
> one of them reported a loss event recently ... what do I look for in the
> filesystem to check for evidence of this?

Look in .sugar/default/ there will be a datastoreN directory.
That's the "lost" datastore...



m
-- 
 martin.langh...@gmail.com
 mar...@laptop.org -- School Server Architect
 - ask interesting questions
 - don't get distracted with shiny stuff  - working code first
 - http://wiki.laptop.org/go/User:Martinlanghoff
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1 users in Uy

2009-08-20 Thread Martin Langhoff
On Fri, Aug 21, 2009 at 12:24 AM, Andrés Ambrois wrote:
> I'll help in any way I can. Would an upload webservice similar to Sacha's for
> uploading the necessary datastore bits help? We can then distribute a simple
> cli tool that uploads the data.

Thanks! If Uy was using School Servers, this wouldn't be an issue. The
DS data would be there :-/

For end users, I don't think a network-based solution is best. A local
activity will be a lot more effective.

For debugging we might need to get some sample 'broken' datastores
(with due care for user privacy, etc...).

> Will collecting the indexes be enough or do you need the whole datastore?

For debugging, better to get the whole thing I guess.


m
-- 
 martin.langh...@gmail.com
 mar...@laptop.org -- School Server Architect
 - ask interesting questions
 - don't get distracted with shiny stuff  - working code first
 - http://wiki.laptop.org/go/User:Martinlanghoff
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1 users in Uy

2009-08-21 Thread Tomeu Vizoso
On Fri, Aug 21, 2009 at 07:46, Martin Langhoff wrote:
> On Fri, Aug 21, 2009 at 12:24 AM, Andrés Ambrois 
> wrote:
>> I'll help in any way I can. Would an upload webservice similar to Sacha's for
>> uploading the necessary datastore bits help? We can then distribute a simple
>> cli tool that uploads the data.
>
> Thanks! If Uy was using School Servers, this wouldn't be an issue. The
> DS data would be there :-/
>
> For end users, I don't think a network-based solution is best. A local
> activity will be a lot more effective.
>
> For debugging we might need to get some sample 'broken' datastores
> (with due care for user privacy, etc...).
>
>> Will collecting the indexes be enough or do you need the whole datastore?
>
> For debugging, better to get the whole thing I guess.

Yes, though if we manage to get the logs dir of a machine just after
the DS was "lost", there's lots of chances we are going to be able to
determine the exact cause for the loss. Then we would be in a better
position to find the best fix.

Regards,

Tomeu

-- 
«Sugar Labs is anyone who participates in improving and using Sugar.
What Sugar Labs does is determined by the participants.» - David
Farning
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1 users in Uy

2009-08-21 Thread Martin Langhoff
2009/8/21 Tomeu Vizoso :
> Yes, though if we manage to get the logs dir of a machine just after
> the DS was "lost", there's lots of chances we are going to be able to

The ones on the ramdisk? :-(



m
-- 
 martin.langh...@gmail.com
 mar...@laptop.org -- School Server Architect
 - ask interesting questions
 - don't get distracted with shiny stuff  - working code first
 - http://wiki.laptop.org/go/User:Martinlanghoff
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1 users in Uy

2009-08-21 Thread Tomeu Vizoso
2009/8/21 Martin Langhoff :
> 2009/8/21 Tomeu Vizoso :
>> Yes, though if we manage to get the logs dir of a machine just after
>> the DS was "lost", there's lots of chances we are going to be able to
>
> The ones on the ramdisk? :-(

Is ~/.sugar/default/logs in the ramdisk? This would be the first time
I hear about it.

If it's like that, maybe Andres could modify a XO to have that dir in
the nand and wait for it to happen?

Regards,

Tomeu

> m
> --
>  martin.langh...@gmail.com
>  mar...@laptop.org -- School Server Architect
>  - ask interesting questions
>  - don't get distracted with shiny stuff  - working code first
>  - http://wiki.laptop.org/go/User:Martinlanghoff
>



-- 
«Sugar Labs is anyone who participates in improving and using Sugar.
What Sugar Labs does is determined by the participants.» - David
Farning
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1 users in Uy

2009-08-21 Thread Andrés Ambrois
On Friday 21 August 2009 05:08:28 am Martin Langhoff wrote:
> 2009/8/21 Tomeu Vizoso :
> > Yes, though if we manage to get the logs dir of a machine just after
> > the DS was "lost", there's lots of chances we are going to be able to
>
> The ones on the ramdisk? :-(

  I think ~/.sugar/default/logs

  We're holding a jam the 29th, so maybe someone affected could attend, 
otherwise it'll be very hard for teachers to upload large files (ADSL broadband 
has a 128 Kbps uplink here in .uy). 

  I'll post a mail in Sur and olpc-uruguay, after I get some sleep. 
>
>
> m

-- 
  -Andrés
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1 users in Uy

2009-08-21 Thread Andrés Ambrois
On Friday 21 August 2009 05:10:21 am Tomeu Vizoso wrote:
> 2009/8/21 Martin Langhoff :
> > 2009/8/21 Tomeu Vizoso :
> >> Yes, though if we manage to get the logs dir of a machine just after
> >> the DS was "lost", there's lots of chances we are going to be able to
> >
> > The ones on the ramdisk? :-(
>
> Is ~/.sugar/default/logs in the ramdisk? This would be the first time
> I hear about it.
>
> If it's like that, maybe Andres could modify a XO to have that dir in
> the nand and wait for it to happen?

Remember that Ceibal XOs have root access locked-down. And I recently found 
out that since the key-delegation stuff was implemented, we can't request 
developer keys. Not from OLPC at least, and LATU is not providing that service 
that I know...

I'll also try to flash an XO with 8.2.1 and try to reproduce. 

> Regards,
>
> Tomeu
>
> > m
> > --
> >  martin.langh...@gmail.com
> >  mar...@laptop.org -- School Server Architect
> >  - ask interesting questions
> >  - don't get distracted with shiny stuff  - working code first
> >  - http://wiki.laptop.org/go/User:Martinlanghoff

-- 
  -Andrés
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1 users in Uy

2009-08-21 Thread Martin Langhoff
2009/8/21 Tomeu Vizoso :
> 2009/8/21 Martin Langhoff :
>> 2009/8/21 Tomeu Vizoso :
>>> Yes, though if we manage to get the logs dir of a machine just after
>>> the DS was "lost", there's lots of chances we are going to be able to
>>
>> The ones on the ramdisk? :-(
>
> Is ~/.sugar/default/logs in the ramdisk? This would be the first time
> I hear about it.

OLPC's builds (at least the F7/F9 based series) I understood they were
on a ramdisk. Looking at it now, however, it seems that they aren't --
but a restart of X clears it out anyway.

So by the time the problem is _seen_, *poof*, logs are gone.

Maybe something in the sugar startup clears it out. Dunno. Doesn't
seem to be olpc-configure.



m
-- 
 martin.langh...@gmail.com
 mar...@laptop.org -- School Server Architect
 - ask interesting questions
 - don't get distracted with shiny stuff  - working code first
 - http://wiki.laptop.org/go/User:Martinlanghoff
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1 users in Uy

2009-08-21 Thread Martin Dengler
On Fri, Aug 21, 2009 at 10:36:44AM +0200, Martin Langhoff wrote:
> 2009/8/21 Tomeu Vizoso :
> > 2009/8/21 Martin Langhoff :
> >> 2009/8/21 Tomeu Vizoso :
> >>> Yes, though if we manage to get the logs dir of a machine just after
> >>> the DS was "lost", there's lots of chances we are going to be able to
> >>
> >> The ones on the ramdisk? :-(
> >
> > Is ~/.sugar/default/logs in the ramdisk? This would be the first time
> > I hear about it.
> 
> OLPC's builds (at least the F7/F9 based series) I understood they were
> on a ramdisk. Looking at it now, however, it seems that they aren't --
> but a restart of X clears it out anyway.
> 
> So by the time the problem is _seen_, *poof*, logs are gone.

You sure that ~/.sugar/default/logs doesn't just get renamed to
~/.sugar/default/logs/N?  That's what happens now, but perhaps not
on older builds?

> m

Martin


pgp5v7mSfw7cF.pgp
Description: PGP signature
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1 users in Uy

2009-08-21 Thread Tomeu Vizoso
2009/8/21 Martin Langhoff :
> 2009/8/21 Tomeu Vizoso :
>> 2009/8/21 Martin Langhoff :
>>> 2009/8/21 Tomeu Vizoso :
 Yes, though if we manage to get the logs dir of a machine just after
 the DS was "lost", there's lots of chances we are going to be able to
>>>
>>> The ones on the ramdisk? :-(
>>
>> Is ~/.sugar/default/logs in the ramdisk? This would be the first time
>> I hear about it.
>
> OLPC's builds (at least the F7/F9 based series) I understood they were
> on a ramdisk. Looking at it now, however, it seems that they aren't --
> but a restart of X clears it out anyway.
>
> So by the time the problem is _seen_, *poof*, logs are gone.
>
> Maybe something in the sugar startup clears it out. Dunno. Doesn't
> seem to be olpc-configure.

Should be kept in a subdir in there, from:

http://git.sugarlabs.org/projects/sugar/repos/mainline/blobs/cbd0509f4bcfe1b3e52f8898b1cb83d522337ab8/src/logsmanager.py#line25

But anyway, the failure to read the DS would be in the first boot
where the journal appears empty, not in the boot where the DS index
got corrupted.

Regards,

Tomeu

>
>
> m
> --
>  martin.langh...@gmail.com
>  mar...@laptop.org -- School Server Architect
>  - ask interesting questions
>  - don't get distracted with shiny stuff  - working code first
>  - http://wiki.laptop.org/go/User:Martinlanghoff
>



-- 
«Sugar Labs is anyone who participates in improving and using Sugar.
What Sugar Labs does is determined by the participants.» - David
Farning
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1 users in Uy

2009-08-21 Thread Martin Langhoff
2009/8/21 Tomeu Vizoso :
> Should be kept in a subdir in there, from:

Damn! I wish I had known about this! So 3 old dirs are kept - excellent.

> But anyway, the failure to read the DS would be in the first boot
> where the journal appears empty, not in the boot where the DS index
> got corrupted.

Well, if you can get your hands on the 'corrupt' datastore dir,
doesn't it also contain the xapian DB? Just renaming it into
'datastore' should re-produce the error...

Maybe not?



m
-- 
 martin.langh...@gmail.com
 mar...@laptop.org -- School Server Architect
 - ask interesting questions
 - don't get distracted with shiny stuff  - working code first
 - http://wiki.laptop.org/go/User:Martinlanghoff
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1 users in Uy

2009-08-21 Thread Tomeu Vizoso
2009/8/21 Martin Langhoff :
> 2009/8/21 Tomeu Vizoso :
>> Should be kept in a subdir in there, from:
>
> Damn! I wish I had known about this! So 3 old dirs are kept - excellent.
>
>> But anyway, the failure to read the DS would be in the first boot
>> where the journal appears empty, not in the boot where the DS index
>> got corrupted.
>
> Well, if you can get your hands on the 'corrupt' datastore dir,
> doesn't it also contain the xapian DB? Just renaming it into
> 'datastore' should re-produce the error...
>
> Maybe not?

I think it should if we know the exact build image on which to place
it. So I would say whatever is easier for people to provide.

Regards,

Tomeu

>
>
> m
> --
>  martin.langh...@gmail.com
>  mar...@laptop.org -- School Server Architect
>  - ask interesting questions
>  - don't get distracted with shiny stuff  - working code first
>  - http://wiki.laptop.org/go/User:Martinlanghoff
>



-- 
«Sugar Labs is anyone who participates in improving and using Sugar.
What Sugar Labs does is determined by the participants.» - David
Farning
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1 users in Uy

2009-08-25 Thread Martin Langhoff
On Thu, Aug 20, 2009 at 6:56 PM, Martin
Langhoff wrote:
> On olpc-sur we're seeing reports of "my journal contents disappeared
> after reboot". (Thread starts at
> http://lists.laptop.org/pipermail/olpc-sur/2009-August/004177.html )

So, we have a couple of users who could send a sample datastore to a
developer. I think Tomeu is interested in diagnosing and fixing the
problem within Sugar (great!) -- if so, do you want me to ask for a
zipfile of the corrupt datastore on your behalf?

It would also be very important for these Uy users to get a "rescue my
lost journal" activity. Anyone interested in helping?

cheers,



m
-- 
 martin.langh...@gmail.com
 mar...@laptop.org -- School Server Architect
 - ask interesting questions
 - don't get distracted with shiny stuff  - working code first
 - http://wiki.laptop.org/go/User:Martinlanghoff
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel


Re: [Sugar-devel] RFH - Journal corruption reports fom 8.2.1 users in Uy

2009-08-25 Thread Tomeu Vizoso
On Tue, Aug 25, 2009 at 17:42, Martin Langhoff wrote:
> On Thu, Aug 20, 2009 at 6:56 PM, Martin
> Langhoff wrote:
>> On olpc-sur we're seeing reports of "my journal contents disappeared
>> after reboot". (Thread starts at
>> http://lists.laptop.org/pipermail/olpc-sur/2009-August/004177.html )
>
> So, we have a couple of users who could send a sample datastore to a
> developer. I think Tomeu is interested in diagnosing and fixing the
> problem within Sugar (great!) -- if so, do you want me to ask for a
> zipfile of the corrupt datastore on your behalf?

If this is easier than getting logs, then yes.

> It would also be very important for these Uy users to get a "rescue my
> lost journal" activity. Anyone interested in helping?

I will be happy to answer any questions about this.

Regards,

Tomeu

> cheers,
>
>
>
> m
> --
>  martin.langh...@gmail.com
>  mar...@laptop.org -- School Server Architect
>  - ask interesting questions
>  - don't get distracted with shiny stuff  - working code first
>  - http://wiki.laptop.org/go/User:Martinlanghoff
> ___
> Sugar-devel mailing list
> Sugar-devel@lists.sugarlabs.org
> http://lists.sugarlabs.org/listinfo/sugar-devel
>



-- 
«Sugar Labs is anyone who participates in improving and using Sugar.
What Sugar Labs does is determined by the participants.» - David
Farning
___
Sugar-devel mailing list
Sugar-devel@lists.sugarlabs.org
http://lists.sugarlabs.org/listinfo/sugar-devel