Script 'mail_helper' called by obssrc Hello community, here is the log from the commit of package tesseract-ocr for openSUSE:Factory checked in at 2023-01-12 22:44:26 ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Comparing /work/SRC/openSUSE:Factory/tesseract-ocr (Old) and /work/SRC/openSUSE:Factory/.tesseract-ocr.new.32243 (New) ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Package is "tesseract-ocr" Thu Jan 12 22:44:26 2023 rev:11 rq:1057918 version:5.3.0 Changes: -------- --- /work/SRC/openSUSE:Factory/tesseract-ocr/tesseract-ocr.changes 2020-03-27 21:57:54.958814340 +0100 +++ /work/SRC/openSUSE:Factory/.tesseract-ocr.new.32243/tesseract-ocr.changes 2023-01-12 22:44:30.740938746 +0100 @@ -1,0 +2,101 @@ +Thu Jan 5 09:00:32 UTC 2023 - Martin Pluskal <mplus...@suse.com> + +- Move unversioned libraries to main package + +------------------------------------------------------------------- +Mon Jan 02 16:13:51 UTC 2023 - Markus Ebner <i...@ebner-markus.de> + +- Update to version 5.3.0: + * Fix memory issues in ScrollView::MessageReceiver + * autotools: Add rule for svpaint executable + * Replace call of exit function by return statement in main function + * Fix the build on CodeQL/Analyze by @arseniy-sonar in #3888 + * CI: Remove Ubuntu 18.04 + * configure.ac: fix build on aarch64_be + * SW CI: Add paths filter + * Create .mailmap + * Fix tesseract.pc from cmake to match autotools + * Update README.md + * Fixed 2 errors + * fix issue #3940 - remove colormap before thresholding + * Update upload-artifact action + * Update checkout action to version 3 + * Fix Markdownlint + * Fix broken links in CONTRIBUTING.md + * pdfrenderer.cpp: Ignore non-text blocks + * lstm.train: allow .box from .raw.png too + * Fix a number of performance issues (reported by Coverity Scan) + * Fix training tools for legacy engine (issue #3925) + * Fix function tesseract::WriteFeature (issue #3925) + * Modernize function ObjectCache::DeleteUnusedObjects (fix issue with s⦠+ * More fixes for issue #3925 +- Fixed packaging to include missing shared libs: + * libcommon_training.so + * libunicharset_training.so + +------------------------------------------------------------------- +Fri Sep 16 11:29:04 UTC 2022 - Markus Ebner <i...@ebner-markus.de> + +- Switched to new CMake buildsystem +- Update to version 5.2.0: + * Improvements and fixes for continuous integration, autoconf and cmake builds + * Set /Os for some 32 bit MS compilers + * Improve comments and other documentation + * Add initial support for Intel AVX512F + * Fix for very large PDF files on 32 bit hosts + * Fix NEON detection on FreeBSD + * Fix regression with UZN files + * Fix calling delete[] for memory allocated by malloc in C API + * Add an API function to init tesseract with traineddata from memory + * Replace direct access to Leptonica internal data structures by function calls and + support latest releases of Leptonica. + * Replace std::regex by std::string functions. + * Use compiled-in TESSDATA_PREFIX also on Windows + * Add new parameter 'invert_threshold', change the default threshold from 0.5 to 0.7 + and mark parameter 'tessedit_do_invert' as deprecated + +- Update to version 5.1.0: + * Handle image and line regions in output formats ALTO, hOCR and text. + * New parameter curl_timeout for curl_easy_setop. + * Build fixes and improvements. + * Catch nullptr in PageIterator::Orientation to improve robustness. + * Remove unused code. + +- Update to version 5.0.1: + * Add SPDX-License-Identifier to public include files. + * Support redirections when running OCR on a URL. + * Lots of fixes and improvements for cmake builds. + * Distributions should use the autoconf build. + * Fix broken msys2 build with gcc 11. + * Fix parameter certainty_scale (was duplicated). + * Fix some compiler warnings and clean code. + * Correctly detect amd64 and i386 on FreeBSD. + * Add libarchive and libcurl in continuous integration actions. + * Update submodule googletest to release v1.11.0. + +- Update to version 5.0.0: + * Enable fast float32 LSTM by default + * Switch to NFC normalisation everywhere + * Remove banner message + * Disable music staff detection and removal + * Add new command line option --loglevel + * Fix regression for OCR with more than one model file + * Optimizations + * Improve training messages + * Add RowAttributes getter to PageIterator + * Limit BCER to interval [0,1] + * Improved build process + * Cleaned code + +- Update to version 4.1.3: + * Fix broken autoconf build + +- Update to version 4.1.2: + * Allow line images with larger width for training + * Bugfixes + * Build updates and fixes + +- Removed tesseract-ocr-no-cpudetection.patch + Obsolete with the use of CMake build system instead of Makefiles + +------------------------------------------------------------------- Old: ---- tesseract-ocr-4.1.1.tar.gz tesseract-ocr-no-cpudetection.patch New: ---- tesseract-5.3.0.tar.gz ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Other differences: ------------------ ++++++ tesseract-ocr.spec ++++++ --- /var/tmp/diff_new_pack.oRtIBD/_old 2023-01-12 22:44:31.276941755 +0100 +++ /var/tmp/diff_new_pack.oRtIBD/_new 2023-01-12 22:44:31.280941777 +0100 @@ -1,7 +1,7 @@ # # spec file for package tesseract-ocr # -# Copyright (c) 2020 SUSE LLC +# Copyright (c) 2023 SUSE LLC # # All modifications and additions to the file contributed by third parties # remain the property of their copyright owners, unless otherwise agreed @@ -16,19 +16,18 @@ # -%define so_ver 4 +%define lname libtesseract-5_3_0 Name: tesseract-ocr -Version: 4.1.1 +Version: 5.3.0 Release: 0 Summary: Open Source OCR Engine License: Apache-2.0 AND GPL-2.0-or-later URL: https://github.com/tesseract-ocr/tesseract -Source0: https://github.com/tesseract-ocr/tesseract/archive/%{version}.tar.gz#/%{name}-%{version}.tar.gz -# PATCH-FIX-OPENSUSE -- boo#1159231 -Patch0: tesseract-ocr-no-cpudetection.patch +Source0: https://github.com/tesseract-ocr/tesseract/archive/refs/tags/%{version}.tar.gz#/tesseract-%{version}.tar.gz BuildRequires: asciidoc -BuildRequires: autoconf -BuildRequires: automake +BuildRequires: chrpath +BuildRequires: cmake +BuildRequires: curl-devel BuildRequires: doxygen BuildRequires: fdupes BuildRequires: gcc-c++ @@ -36,6 +35,7 @@ BuildRequires: libxslt-tools BuildRequires: opencl-headers BuildRequires: pkgconfig >= 0.9.0 +BuildRequires: plantuml BuildRequires: pkgconfig(OpenCL) BuildRequires: pkgconfig(cairo) BuildRequires: pkgconfig(fontconfig) @@ -55,7 +55,7 @@ %package devel Summary: Tesseract Open Source OCR Engine Development files -Requires: libtesseract%{so_ver} = %{version} +Requires: %{lname} = %{version} Requires: pkgconfig(lept) >= 1.74 Requires: pkgconfig(libarchive) @@ -63,10 +63,10 @@ This package contains development files for the Tesseract Open Source OCR Engine. -%package -n libtesseract%{so_ver} +%package -n %{lname} Summary: Open Source OCR Engine -%description -n libtesseract%{so_ver} +%description -n %{lname} A commercial quality OCR engine originally developed at HP between 1985 and 1995. In 1995, this engine was among the top 3 evaluated by UNLV. It was open-sourced by HP and UNLV in 2005. From 2007 it is developed by Google. @@ -75,37 +75,38 @@ %autosetup -n tesseract-%{version} -p1 %build -autoreconf -fiv -%configure \ - --enable-opencl \ - --disable-static -%make_build all training doc +%cmake -DCMAKE_INSTALL_LIBDIR=%{_lib} -DTESSDATA_PREFIX=%{_datadir}/%{name} +%cmake_build -%install -%make_install all training-install - -# Remove libtool config files -rm -f %{buildroot}%{_libdir}/libtesseract.la +chrpath --delete src/training/libpango_training.so -# Manually install the devel docs in order to fix rpmlint warnings "files-duplicate" and "doc-file-dependency" -mkdir -p %{buildroot}%{_defaultdocdir}/%{name}-devel -cp -a doc/html/ %{buildroot}%{_defaultdocdir}/%{name}-devel/ -# Fix rpmlint warning "doc-file-dependency" -rm -f %{buildroot}%{_defaultdocdir}/%{name}-devel/html/installdox +# Manually build manfiles, cmake does not build them +cd ../doc +sh generate_manpages.sh +ls -alh -# Fix rpmlint warning "non-executable-in-bin" -chmod 0755 %{buildroot}%{_bindir}/tesstrain_utils.sh +%install +%cmake_install +install -D build/src/training/libpango_training.so \ + %{buildroot}%{_libdir}/libpango_training.so +mkdir -p %{buildroot}%{_mandir}/{man1,man5}/ +cp -a doc/*.1 %{buildroot}%{_mandir}/man1/ +cp -a doc/*.5 %{buildroot}%{_mandir}/man5/ +cp -a tessdata/pdf.ttf %{buildroot}/%{_datadir}/tessdata/ # Fix rpmlint warning "files-duplicate" %fdupes -s %{buildroot} -%post -n libtesseract%{so_ver} -p /sbin/ldconfig -%postun -n libtesseract%{so_ver} -p /sbin/ldconfig +%post -n %{lname} -p /sbin/ldconfig +%postun -n %{lname} -p /sbin/ldconfig %files %doc AUTHORS ChangeLog README.md %license LICENSE %{_bindir}/* +%{_libdir}/libcommon_training.so +%{_libdir}/libpango_training.so +%{_libdir}/libunicharset_training.so %dir %{_datadir}/tessdata %{_datadir}/tessdata/configs/ %{_datadir}/tessdata/tessconfigs/ @@ -114,12 +115,13 @@ %{_mandir}/man5/*.5%{?ext_man} %files devel -%doc %{_defaultdocdir}/tesseract-ocr-devel/ -%{_includedir}/tesseract/ -%{_libdir}/libtesseract*.so +%{_includedir}/tesseract +%{_libdir}/libtesseract.so +%{_libdir}/cmake/tesseract/ %{_libdir}/pkgconfig/*.pc -%files -n libtesseract%{so_ver} -%{_libdir}/libtesseract.so.%{so_ver}* +%files -n %{lname} +%license LICENSE +%{_libdir}/libtesseract.so.* %changelog ++++++ tesseract-ocr-4.1.1.tar.gz -> tesseract-5.3.0.tar.gz ++++++ ++++ 312705 lines of diff (skipped)