On Mon, Jan 4, 2021 at 11:32 AM Sreyan Chakravarty <sreya...@gmail.com> wrote:
>
> On Mon, Jan 4, 2021 at 10:14 PM Chris Murphy <li...@colorremedies.com> wrote:
> > transid errors like this indicate out of order writes due to drive
> > firmware not honoring file system write ordering and then getting a
> > badly timed crash/powerfail/shutdown.
>
> First of all thanks for your quick response.
>
> So would I be correct assuming that the problem is in my firmware ? Or
> is it too early to say anything like that ?

Too early. The usual case of transid errors is drive firmware bugs
*and* ill timed shutdown. Since you don't have an ill timed shutdown,
it's less likely this is a drive firmware bug, but can't be ruled out.
i.e. I'm proposing there might be a software bug here and we just need
to figure it out. Bad memory usually shows up as bit flips and doesn't
result in damage like this - but it has to be considered whether a
bitflip can affect code.

It can also be a kernel bug - the storage stack has many layers, not
just Btrfs and dm-crypt. But no one wants to go blaming other people's
work without understanding the problem.

> Is my firmware so outdated that it can't handle BTRFS ?

No. It's a bit complicated.

Buggy drive firmware is common. But normally it doesn't matter mainly
due to good luck. More than one thing has to go wrong to cause a
problem like (a) firmware bug exists (b) firmware bug is triggered (c)
crash/powerfail. If one of those is not true, then it's not a problem.
There is also the transient hardware defect problem that can act like
a bug but it's just rotting the metadata or data. It's not obvious but
it is possible to piece together what's happened when we have enough
information.



> # btrfs-image -c9 -t4 /dev/mapper/dm_crypt /run/media/liveuser/Backup\
> Plus/btrfs_meta.img
>
> parent transid verify failed on 55640064 wanted 44146 found 44438
> parent transid verify failed on 55640064 wanted 44146 found 44438
> parent transid verify failed on 55640064 wanted 44146 found 44438
> Ignoring transid failure
> parent transid verify failed on 55902208 wanted 44170 found 44438
> Ignoring transid failure
> parent transid verify failed on 56410112 wanted 44170 found 44439
> Ignoring transid failure
> parent transid verify failed on 58621952 wanted 44170 found 44439
> Ignoring transid failure
> ERROR: child eb corrupted: parent bytenr=178081497088 item=246 parent
> level=1 child level=2
> ERROR: cannot go to next leaf -5
> ERROR: create failed: -5
>
> What do I do now ?

Rats. Can you retry by adding -w option? In the meantime I'll report
back to upstream and see what they recommend next.



> > I'm on irc.freenode.net as cmurf that's usually the easier way to get
> > help, on #fedora channel.
> >
>
> Do I need to have a bouncer ? I am in India, and I believe you are in
> the US, so when you are active, I am usually sleeping.

An alternative is matrix. We have a matrix-irc bridge in #fedora and
pretty soon I think the plan is to switch mainly to matrix. So if you
know about matrix then you can  join #fedora - but I don't know how to
explain it very well since I don't use matrix yet. I think it keeps
the history for you, unlike IRC (I use a bouncer so I will see your
messages later). I keep weird hours so it might overlap at some point.




--
Chris Murphy
_______________________________________________
users mailing list -- users@lists.fedoraproject.org
To unsubscribe send an email to users-le...@lists.fedoraproject.org
Fedora Code of Conduct: 
https://docs.fedoraproject.org/en-US/project/code-of-conduct/
List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines
List Archives: 
https://lists.fedoraproject.org/archives/list/users@lists.fedoraproject.org

Reply via email to