Bug#942078: [high quality bug] baloo crash, can't recover; forced reindex does not fix; unusable via dolphin C-f

2021-02-04 Thread Aurélien COUDERC
Dear Nicholas,

Le jeudi 4 février 2021, 04:23:36 CET Nicholas D Steeves a écrit :
> Norbert Preining  writes:
> 
> > Control: tag -1 +unreproducible
> >
> > Hi Nicholas,
> >
> > to be clear, tagging this bug as "unreproducible" means that **we** the
> > developer cannot reproduce it. So please don't change it. Thanks.
> >
> > Also, moreinfo is still needed, and you didn't provide it.
> >
> 
> I did. The minimum dataset needed to trigger was with an extensive set
> of exclusion patterns that result in 436767 indexed files, with the
> following results returned for balooctl indexSize
> 
> Actual Size: 3.27 GiB
> Expected Size: 2.12 GiB
> 
>PostingDB: 424.52 MiB19.516 %
>   PositionDB: 845.84 MiB38.884 %
> DocTerms: 344.65 MiB15.844 %
> DocFilenameTerms:  47.45 MiB 2.181 %
>DocXattrTerms:0 B 0.000 %
>   IdTree:   7.64 MiB 0.351 %
>   IdFileName:  35.77 MiB 1.645 %
>  DocTime:  22.29 MiB 1.024 %
>  DocData:  23.21 MiB 1.067 %
>ContentIndexingDB:0 B 0.000 %
>  FailedIdsDB:0 B 0.000 %
>  MTimeDB:   2.59 MiB 0.119 %

I beat you although by a small margin ;)

Baloo File Indexer is not running
Total files indexed: 438,883
Files waiting for content indexing: 0
Files failed to index: 0
Current size of index is 4.26 GiB

File Size: 4,26 Gio
Used:  2,17 Gio

   PostingDB: 584,37 Mio26.301 %
  PositionDB: 946,16 Mio42.585 %
DocTerms: 499,82 Mio22.496 %
DocFilenameTerms:  57,64 Mio 2.594 %
   DocXattrTerms:0 o 0.000 %
  IdTree:   9,84 Mio 0.443 %
  IdFileName:  45,09 Mio 2.029 %
 DocTime:  28,65 Mio 1.290 %
 DocData:  43,68 Mio 1.966 %
   ContentIndexingDB:0 o 0.000 %
 FailedIdsDB:0 o 0.000 %
 MTimeDB:   6,57 Mio 0.296 %

… and I can’t reproduce the bug either… :(
I’m using it for searches on a nearly daily basis and it’s been working 
reliably for as long as I can remember. Both with file names and file contents.

[…]

> >> Has anyone on the team tried testing baloo with a large $HOME?  Mine is
> >> 131GB, in 142460 mixed files and directories.  At the moment my
> >> baloo_file process is consuming over a terabyte (!) of memory.  After
> >> resetting baloo and forcing a full reindex it takes a couple of weeks
> >> before it gets out of control again and crashes.
> >
> > Well, maybe you should disable the service since it obviously is not
> > able to cope with the amount of data/files/size you are asking it to
> > index.
> >
> 
> Right, that's why I filed this bug.  With a non-toy dataset, baloo is
> more often than not unusable.  Typical lore says to disable baloo and
> use the more robust recoll.  I'm willing to conceded that it's possible
> that there's something special about my data/files, and that they're
> causing baloo to fail in a corner-case.  If that's the case, what data
> do you need?  Other than all my files ;-)

As said above, this has not been my experience on a definitely non-toy dataset.
Maybe there’s something more specific to the crash you experienced that would 
need investigating before it can be fixed.

I had an issue on a a secondary machine where it would crash / stop due to 
starvation of inotify watches. This was clearly visible in .xsession-errors 
with the advices to increase fs.inotify.max_user_watches plainy written, and 
baloo was not the only piece of software having issues in this situation. 
Probably not your case but I’m mentioning it here just in case.

> >> Please don't sweep baloo bugs under the rug by tagging bugs containing
> >> crashes with obvious misbehaviour as severity "normal", eg:
> >
> > Again, since it is not reproducible and nobody else has either reported
> > this problem and none of us see this problem, it is normal, would you
> > please kindly trust the decision of the maintainers?
> > And thanks, you don't have to educate me about severities.
> >
> 
> Important: "a bug which has a major effect on the usability of a
> package, without rendering it completely unusable to everyone."  It is
> clear that this bug meets Debian criteria for "important".  If no one
> else on the team (I'm a team member too) is testing baloo with a medium
> size set of diverse file types than no one else one the team will be
> able to reproduce it.
> 
> A politic of "normal + unreproducible" is a contributing factor to
> why baloo is never fixed.  In other words, it's a politic of ignoring
> problems in baloo.

Given that we failed to reproduce the issue so far, with little indication as 
to what could actually help reproducing it, this looks really overkill to me.


Happy hacking !
--
Aurélien



Bug#942078: [high quality bug] baloo crash, can't recover; forced reindex does not fix; unusable via dolphin C-f

2021-02-03 Thread Sandro Knauß
Hey,

> The crash is not specific to jack_capture_90.mp3; except for a period of
> time a couple of weeks long leading up to the original bug.

But do you see any pattern?  I cannot reproduce it, too.
 
> Has anyone on the team tried testing baloo with a large $HOME?  

Yes I think myself.  Baloo indexed 1.387.086 files (mixed content) ; 650GB 
source file =>10GB baloo database. According your description of the bug, that 
is bigger than your dataset and be enough to reproduce. Why do you think  we 
do not use the software they maintain? Baloo is working great for me, I 
haven't seen issues for a long long time.

Okay what can you do to tackle down the bug:
* debug symbols for libKF5BalooEngine.so.5 and liblmdb.so.0, so that we know 
what line triggers the segfault. because the interesting part is without debug 
symbols.
* can you reproducible trigger the crash with jack_capture_90.mp3, or any 
other file?
* did you checked upstream bug tracker - to find any matching bug? As this is 
an upstream issue, it needs to be fixed by upstream anyways, so we need to 
clarify if the issue is still at a current version. Otherwise create a 
bugreport at bugs.kde.org, they should also know better what infos they need 
to fix the issue.
* start a Debian unstable VM/ KDE neon VM and try to reproduce the issue

> It could take up to a month to reproduce this bug, and by then
the reply will be "ignore for bullseye".  This is a contributing factor
to why baloo is in perpetual beta (I'd argue alpha, but subjectivity ;-))

Yes I know the timeframe small between the freeze and release to tackle down 
complicated issues. But please don't forget the other side. What should 
maintainers do if, they cannot reproduce the issue and upstream does not have 
stable LTS branches and doing monthly releases?  The first thing is checking 
the current version:  If it is fixed, than one can search for the bugfix to 
backport it or file the bug upstream otherwise. But without checking against 
the current version upstream reply most of the time: please update first.

So I cannot reproduce it nor Norbert. Mark the bug as "unreproducible" is a 
valid approach, as we cannot tell if it is fixed or not. With more information 
it may be possible to decide. That's why the question is so important: Can you 
reproduce it somehow, so we are able to see the same issue?

hefee



signature.asc
Description: This is a digitally signed message part.


Bug#942078: [high quality bug] baloo crash, can't recover; forced reindex does not fix; unusable via dolphin C-f

2021-02-03 Thread Nicholas D Steeves
Hi Norbert,

Norbert Preining  writes:

> Control: tag -1 +unreproducible
>
> Hi Nicholas,
>
> to be clear, tagging this bug as "unreproducible" means that **we** the
> developer cannot reproduce it. So please don't change it. Thanks.
>
> Also, moreinfo is still needed, and you didn't provide it.
>

I did. The minimum dataset needed to trigger was with an extensive set
of exclusion patterns that result in 436767 indexed files, with the
following results returned for balooctl indexSize

Actual Size: 3.27 GiB
Expected Size: 2.12 GiB

   PostingDB: 424.52 MiB19.516 %
  PositionDB: 845.84 MiB38.884 %
DocTerms: 344.65 MiB15.844 %
DocFilenameTerms:  47.45 MiB 2.181 %
   DocXattrTerms:0 B 0.000 %
  IdTree:   7.64 MiB 0.351 %
  IdFileName:  35.77 MiB 1.645 %
 DocTime:  22.29 MiB 1.024 %
 DocData:  23.21 MiB 1.067 %
   ContentIndexingDB:0 B 0.000 %
 FailedIdsDB:0 B 0.000 %
 MTimeDB:   2.59 MiB 0.119 %

>> > Can you reproduce this with current frameworks 5.77 in unstable?
>
> Since you haven't answered this questions, that BTW is alread at 5.78,
> we cannot really do anything.
>
> You don't expect us to try to triage bugs that we cannot reproduce on a
> system that is not running the current version?
>

I will upgrade this system in mid to late February, and hope to be
pleasantly surprised that this bug was fixed upstream :-)  At that time,
how would you like me run baloo to generate a higher quality debugging
info?  It could take up to a month to reproduce this bug, and by then
the reply will be "ignore for bullseye".  This is a contributing factor
to why baloo is in perpetual beta (I'd argue alpha, but subjectivity ;-))

>> Has anyone on the team tried testing baloo with a large $HOME?  Mine is
>> 131GB, in 142460 mixed files and directories.  At the moment my
>> baloo_file process is consuming over a terabyte (!) of memory.  After
>> resetting baloo and forcing a full reindex it takes a couple of weeks
>> before it gets out of control again and crashes.
>
> Well, maybe you should disable the service since it obviously is not
> able to cope with the amount of data/files/size you are asking it to
> index.
>

Right, that's why I filed this bug.  With a non-toy dataset, baloo is
more often than not unusable.  Typical lore says to disable baloo and
use the more robust recoll.  I'm willing to conceded that it's possible
that there's something special about my data/files, and that they're
causing baloo to fail in a corner-case.  If that's the case, what data
do you need?  Other than all my files ;-)

>> Please don't sweep baloo bugs under the rug by tagging bugs containing
>> crashes with obvious misbehaviour as severity "normal", eg:
>
> Again, since it is not reproducible and nobody else has either reported
> this problem and none of us see this problem, it is normal, would you
> please kindly trust the decision of the maintainers?
> And thanks, you don't have to educate me about severities.
>

Important: "a bug which has a major effect on the usability of a
package, without rendering it completely unusable to everyone."  It is
clear that this bug meets Debian criteria for "important".  If no one
else on the team (I'm a team member too) is testing baloo with a medium
size set of diverse file types than no one else one the team will be
able to reproduce it.

A politic of "normal + unreproducible" is a contributing factor to
why baloo is never fixed.  In other words, it's a politic of ignoring
problems in baloo.

Cheers,
Nicholas


signature.asc
Description: PGP signature


Bug#942078: [high quality bug] baloo crash, can't recover; forced reindex does not fix; unusable via dolphin C-f

2021-02-03 Thread Norbert Preining
Control: tag -1 +unreproducible

Hi Nicholas,

to be clear, tagging this bug as "unreproducible" means that **we** the
developer cannot reproduce it. So please don't change it. Thanks.

Also, moreinfo is still needed, and you didn't provide it.

> > Can you reproduce this with current frameworks 5.77 in unstable?

Since you haven't answered this questions, that BTW is alread at 5.78,
we cannot really do anything.

You don't expect us to try to triage bugs that we cannot reproduce on a
system that is not running the current version?

> Has anyone on the team tried testing baloo with a large $HOME?  Mine is
> 131GB, in 142460 mixed files and directories.  At the moment my
> baloo_file process is consuming over a terabyte (!) of memory.  After
> resetting baloo and forcing a full reindex it takes a couple of weeks
> before it gets out of control again and crashes.

Well, maybe you should disable the service since it obviously is not
able to cope with the amount of data/files/size you are asking it to
index.

> Please don't sweep baloo bugs under the rug by tagging bugs containing
> crashes with obvious misbehaviour as severity "normal", eg:

Again, since it is not reproducible and nobody else has either reported
this problem and none of us see this problem, it is normal, would you
please kindly trust the decision of the maintainers?
And thanks, you don't have to educate me about severities.

Best

Norbert

--
PREINING Norbert  https://www.preining.info
Fujitsu Research Labs  +  IFMGA Guide + TU Wien + TeX Live + Debian Dev
GPG: 0x860CDC13   fp: F7D8 A928 26E3 16A1 9FA0 ACF0 6CAC A448 860C DC13



Bug#942078: [high quality bug] baloo crash, can't recover; forced reindex does not fix; unusable via dolphin C-f

2021-02-03 Thread Nicholas D Steeves
Control: tag -1 -unreproducible -moreinfo
Control: severity -1 important

Hi Norbert,

Norbert Preining  writes:

> Hi Nicholas,
>
> On Wed, 09 Oct 2019, Nicholas D Steeves wrote:
>> Version: 5.54.0-1
>> Severity: serious
>> Justification: poor experience will cause user to give up on baloo; worse 
>> than GNOME
>
> Can you reproduce this with current frameworks 5.77 in unstable?
>
> Does it always crash at the same file (jack_capture_90.mp3) or always in
> different files?
>

The crash is not specific to jack_capture_90.mp3; except for a period of
time a couple of weeks long leading up to the original bug.

Thank you for following up in this bug (finally someone did!)  It's
still too early in the freeze for me to upgrade this system to bullseye.

Has anyone on the team tried testing baloo with a large $HOME?  Mine is
131GB, in 142460 mixed files and directories.  At the moment my
baloo_file process is consuming over a terabyte (!) of memory.  After
resetting baloo and forcing a full reindex it takes a couple of weeks
before it gets out of control again and crashes.

Please don't sweep baloo bugs under the rug by tagging bugs containing
crashes with obvious misbehaviour as severity "normal", eg:

> id seems to have changed. Perhaps baloo was
> not running, and this file was deleted + re-created
> mdb.c:3124: Assertion 'pglast <= env->me_pglast' failed in
> mdb_freelist_save()

which is somewhere between important and serious
https://www.debian.org/Bugs/Developer#severities

Regards,
Nicholas


signature.asc
Description: PGP signature


Processed: Re: Bug#942078: [high quality bug] baloo crash, can't recover; forced reindex does not fix; unusable via dolphin C-f

2021-01-11 Thread Debian Bug Tracking System
Processing commands for cont...@bugs.debian.org:

> severity 942078 normal
Bug #942078 [baloo-kf5] [high quality bug] baloo crash, can't recover; forced 
reindex does not fix; unusable via dolphin C-f
Severity set to 'normal' from 'serious'
> tags 942078 + moreinfo unreproducible
Bug #942078 [baloo-kf5] [high quality bug] baloo crash, can't recover; forced 
reindex does not fix; unusable via dolphin C-f
Added tag(s) moreinfo and unreproducible.
> thanks
Stopping processing here.

Please contact me if you need assistance.
-- 
942078: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=942078
Debian Bug Tracking System
Contact ow...@bugs.debian.org with problems



Bug#942078: [high quality bug] baloo crash, can't recover; forced reindex does not fix; unusable via dolphin C-f

2021-01-11 Thread Norbert Preining
severity 942078 normal
tags 942078 + moreinfo unreproducible
thanks

Hi Nicholas,

I cannot reproduce this, nor anyone else in the team, and you haven't
given an update to whether this is reproducible or not on your side.

I thus downgrade this bug and tag it appropriately.

Thanks for your understanding

Norbert

On Wed, 23 Dec 2020, Norbert Preining wrote:
> Can you reproduce this with current frameworks 5.77 in unstable?
> 
> Does it always crash at the same file (jack_capture_90.mp3) or always in
> different files?

--
PREINING Norbert  https://www.preining.info
Accelia Inc. + IFMGA ProGuide + TU Wien + JAIST + TeX Live + Debian Dev
GPG: 0x860CDC13   fp: F7D8 A928 26E3 16A1 9FA0 ACF0 6CAC A448 860C DC13



Bug#942078: [high quality bug] baloo crash, can't recover; forced reindex does not fix; unusable via dolphin C-f

2020-12-22 Thread Norbert Preining
Hi Nicholas,

On Wed, 09 Oct 2019, Nicholas D Steeves wrote:
> Version: 5.54.0-1
> Severity: serious
> Justification: poor experience will cause user to give up on baloo; worse 
> than GNOME

Can you reproduce this with current frameworks 5.77 in unstable?

Does it always crash at the same file (jack_capture_90.mp3) or always in
different files?

Thanks

Norbert

--
PREINING Norbert  https://www.preining.info
Accelia Inc. + IFMGA ProGuide + TU Wien + JAIST + TeX Live + Debian Dev
GPG: 0x860CDC13   fp: F7D8 A928 26E3 16A1 9FA0 ACF0 6CAC A448 860C DC13



Bug#942078: [high quality bug] baloo crash, can't recover; forced reindex does not fix; unusable via dolphin C-f

2020-12-22 Thread Paul Gevers
Hi all,

On Wed, 09 Oct 2019 18:53:25 -0400 Nicholas D Steeves
 wrote:
> Recently baloo has been crashing whenever I log in.  Today I decided it
> was a persistent and serious enough problem that a serious bug was
> warranted.  I've attached the backtrace produced by drkonqi.  Here is
> some additional info that I hope will help quickly resolve this bug.

Any progress on this? The bullseye freeze is approaching. baloo-kf5 is a
key package so isn't autoremoved. Hence it's on the Release Team's radar.

Paul



OpenPGP_signature
Description: OpenPGP digital signature


Bug#942078: [high quality bug] baloo crash, can't recover; forced reindex does not fix; unusable via dolphin C-f

2019-10-09 Thread Nicholas D Steeves
Package: baloo-kf5
Version: 5.54.0-1
Severity: serious
Justification: poor experience will cause user to give up on baloo; worse than 
GNOME

Hi,

Recently baloo has been crashing whenever I log in.  Today I decided it
was a persistent and serious enough problem that a serious bug was
warranted.  I've attached the backtrace produced by drkonqi.  Here is
some additional info that I hope will help quickly resolve this bug.

$ balooctl start
QCoreApplication::arguments: Please instantiate the QApplication object first
QCoreApplication::applicationDirPath: Please instantiate the QApplication 
object first
This process needs a QCoreApplication instance in order to use KCrash
"/home/sten/jack_capture_90.mp3" id seems to have changed. Perhaps baloo was 
not running, and this file was deleted + re-created
mdb.c:3124: Assertion 'pglast <= env->me_pglast' failed in mdb_freelist_save()
KCrash: crashing... crashRecursionCounter = 2
KCrash: Application Name = baloo_file_extractor path = /usr/bin pid = 15575
KCrash: Arguments: /usr/bin/baloo_file_extractor 
KCrash: Attempting to start /usr/lib/x86_64-linux-gnu/libexec/drkonqi from 
kdeinit
QSocketNotifier: Invalid socket 9 and type 'Read', disabling...
QSocketNotifier: Invalid socket 18 and type 'Read', disabling...
QSocketNotifier: Invalid socket 10 and type 'Read', disabling...

$ balooctl clear /home/sten/jack_capture_90.mp3
Skipping: /home/sten/jack_capture_90.mp3 Reason: Not yet indexed
File(s) cleared

$ balooctl index /home/sten/jack_capture_90.mp3
Indexing /home/sten/jack_capture_90.mp3
mdb.c:3124: Assertion 'pglast <= env->me_pglast' failed in mdb_freelist_save()
Aborted

$ balooctl status
Baloo File Indexer is running
Indexer state: Indexing file content
Indexed 57061 / 57062 files
Current size of index is 703.08 MiB

$ balooctl  indexSize
Actual Size: 703.08 MiB
Expected Size: 456.43 MiB

   PostingDB: 111.95 MiB24.527 %
  PositionDB: 189.31 MiB41.476 %
DocTerms:  58.71 MiB12.862 %
DocFilenameTerms:   6.20 MiB 1.357 %
   DocXattrTerms:0 B 0.000 %
  IdTree: 996.00 KiB 0.213 %
  IdFileName:   4.65 MiB 1.019 %
 DocTime:   2.57 MiB 0.563 %
 DocData:   3.51 MiB 0.769 %
   ContentIndexingDB:   4.00 KiB 0.001 %
 FailedIdsDB:0 B 0.000 %
 MTimeDB:   1.20 MiB 0.263 %

...wait a while, then

$ balooctl stop

...wait a while.  Baloo fails to stop.

$ ps xa | grep /usr/bin/baloo_file
$ gdb -p $THAT_PID
$ kill $THAT_PID

Result: no backtrace

$ gdb /usr/bin/baloo_file

Starting program: /usr/bin/baloo_file 
[Thread debugging using libthread_db enabled]
Using host libthread_db library "/lib/x86_64-linux-gnu/libthread_db.so.1".
[New Thread 0x72c7e700 (LWP 17185)]
[New Thread 0x7215b700 (LWP 17186)]
[Detaching after fork from child process 17187]
QCoreApplication::arguments: Please instantiate the QApplication object first
QCoreApplication::applicationDirPath: Please instantiate the QApplication 
object first
This process needs a QCoreApplication instance in order to use KCrash
"/home/sten/jack_capture_90.mp3" id seems to have changed. Perhaps baloo was 
not running, and this file was deleted + re-created
mdb.c:3124: Assertion 'pglast <= env->me_pglast' failed in mdb_freelist_save()
KCrash: crashing... crashRecursionCounter = 2
KCrash: Application Name = baloo_file_extractor path = /usr/bin pid = 17187
KCrash: Arguments: /usr/bin/baloo_file_extractor 
KCrash: Attempting to start /usr/lib/x86_64-linux-gnu/libexec/drkonqi from 
kdeinit
QSocketNotifier: Invalid socket 9 and type 'Read', disabling...
QSocketNotifier: Invalid socket 18 and type 'Read', disabling...
QSocketNotifier: Invalid socket 10 and type 'Read', disabling...

Result: CRASH.  Drkonqi provides a bt for baloo_file_extractor.  I've attached 
that one too.


Regards,
Nicholas

-- System Information:
Debian Release: 10.1
  APT prefers stable-debug
  APT policy: (500, 'stable-debug'), (500, 'proposed-updates'), (500, 'stable')
Architecture: amd64 (x86_64)
Foreign Architectures: i386

Kernel: Linux 4.19.72-rt25 (SMP w/4 CPU cores; PREEMPT)
Locale: LANG=en_CA.UTF-8, LC_CTYPE=en_CA.UTF-8 (charmap=UTF-8), 
LANGUAGE=en_CA:en (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Init: systemd (via /run/systemd/system)

Versions of packages baloo-kf5 depends on:
ii  kio  5.54.1-1
ii  libc62.28-10
ii  libkf5baloo5 5.54.0-1
ii  libkf5balooengine5   5.54.0-1
ii  libkf5configcore55.54.0-1+deb10u1
ii  libkf5coreaddons55.54.0-1
ii  libkf5crash5 5.54.0-1
ii  libkf5dbusaddons55.54.0-1
ii  libkf5filemetadata3  5.54.0-1
ii  libkf5i18n5  5.54.0-1
ii  libkf5idletime5  5.54.0-1
ii  libkf5kiocore5   5.54.1-1
ii  libkf5solid5 5.54.0-1
ii  libqt5core5a 5.11.3+dfsg1-1
ii  libqt5dbus5  5.11.3+dfsg1-1
ii  libqt5gui5