https://bugs.kde.org/show_bug.cgi?id=485979

            Bug ID: 485979
           Summary: Expose Yolo Resizing Parameter to Improve Face
                    Detection Accuracy
    Classification: Applications
           Product: digikam
           Version: 8.3.0
          Platform: unspecified
                OS: Unspecified
            Status: REPORTED
          Severity: wishlist
          Priority: NOR
         Component: Faces-Detection
          Assignee: digikam-bugs-n...@kde.org
          Reporter: kde-bugzi...@benjamin-portner.de
  Target Milestone: ---

SUMMARY

Face detection in large images can be improved by increasing the re-sizing
parameter in the detection pipeline. To strike a balance between accuracy and
performance, the parameter shouldn't be fixed. I propose exposing it to the
user for manual adjustment.


STEPS TO REPRODUCE
1. Load a large image (many pixels) into digikam
2. Use the facial detection feature

OBSERVED RESULT

Not all faces are found

EXPECTED RESULT

All faces are found

SOFTWARE/OS VERSIONS
Windows: Windows 11 Pro
KDE Frameworks: 5.248.0
Qt: 6.6.1

ADDITIONAL INFORMATION

Image re-scaling is an important part of the face detection pipeline because it
improves the computation speed. Currently, the final image size after resizing
is hard-coded in the source code (e.g.
https://invent.kde.org/graphics/digikam/-/blob/master/core/libs/facesengine/detection/opencv-dnn/dnnfacedetectoryolo.cpp#L42
for the Yolo model). This is problematic, because if an image is very big
initially (many pixels), redical rescaling to the given size can make face
detection impossible. By making this parameter adjustable by the user, we could
improve the detection accuracy in such cases greatly.

-- 
You are receiving this mail because:
You are watching all bug changes.

Reply via email to