29 Commits

Author SHA1 Message Date
Wang Mingyu
9742869636
tesseract: upgrade 5.5.1 -> 5.5.2
Changelog:
=============
- Simplify code for osdetect
- Fix and improve configuration for cmake builds
- Modernize some for loops and fix some signed/unsigned issues
- Cmake optimization with warp2
- Update checkout action
- chore: fix cyrillic typo
- Move version info. to appropriate ALTO element
- CI: Remove unneeded export statements from cmake workflow for macOS
- Bump actions/checkout from 5 to 6
- Bump github/codeql-action from 2 to 4
- Bump actions/upload-artifact from 4 to 5
- Bump mikepenz/action-junit-report from 4 to 6
- Bump actions/upload-artifact from 5 to 6

Signed-off-by: Wang Mingyu <wangmy@fujitsu.com>
Signed-off-by: Khem Raj <raj.khem@gmail.com>
2025-12-31 08:28:49 -08:00
Yoann Congal
ae0d7f632a
leptonica,tesseract: add native variant
Since commit d6736b37c3f4 ("python-pytesseract: Update runtime
dependencies"), pytesseract rdepends on tesseract and has a native
variant. So, tesseract and its dependency leptonica also need the native
variant.

Fixes these warnings from AB[0]:
WARNING: Nothing RPROVIDES 'tesseract-native' (but virtual:native:/srv/pokybuild/yocto-worker/meta-oe-mirror/build/meta-openembedded/meta-python/recipes-extended/python-pytesseract/pytesseract_0.3.13.bb RDEPENDS on or otherwise requires it)
[0]: https://autobuilder.yoctoproject.org/valkyrie/#/builders/82/builds/377/steps/12/logs/warnings

Signed-off-by: Yoann Congal <yoann.congal@smile.fr>
Signed-off-by: Khem Raj <raj.khem@gmail.com>
2025-07-26 09:56:18 -07:00
Alexander Kanavin
fc78d37ff0
meta-openembedded/all: adapt to UNPACKDIR changes
Please see
https://git.yoctoproject.org/poky/commit/?id=4dd321f8b83afecd962393101b2a6861275b5265
for what changes are needed, and sed commands that can be used to make them en masse.

I've verified that bitbake -c patch world works with these, but did not run a world
build; the majority of recipes shouldn't need further fixups, but if there are
some that still fall out, they can be fixed in followups.

Signed-off-by: Alexander Kanavin <alex@linutronix.de>
Signed-off-by: Khem Raj <raj.khem@gmail.com>
2025-06-25 06:44:52 -07:00
Wang Mingyu
8782f1012d
tesseract: upgrade 5.5.0 -> 5.5.1
Signed-off-by: Wang Mingyu <wangmy@fujitsu.com>
Signed-off-by: Khem Raj <raj.khem@gmail.com>
2025-06-09 21:24:17 -07:00
Wang Mingyu
1c28d5d850
tesseract: upgrade 5.4.1 -> 5.5.0
Changelog:
============
* Set hOCR capabilities ocrp_dir and ocrp_lang unconditionally.
* Calculate row bounding box in single-word mode per (issue #4304).
* Reduce clock syscalls (#4303).
* Several small performance and other code fixes.
* Modernized code.
* Print time for tessedit_timing_debug in milliseconds.
* Print time for ErrorCounter::ComputeErrorRate in milliseconds.
* cmake: Correctly set the soversion based on SemVer properties.
* Do not export PDBs for static libraries (issue #4279).
* Several other small fixes and improvements for builds and CI.
* Modernize code for renderers and remove filename conversion for Windows (#4330).
* Add build rule for Windows installer.
* Support symbolic values for --oem and --psm options.
* Remove Tensorflow support.
* Add RISC-V V support (#4346).
* Remove broken GitHub action msys2-4.1.1.

Signed-off-by: Wang Mingyu <wangmy@fujitsu.com>
Signed-off-by: Khem Raj <raj.khem@gmail.com>
2024-11-19 16:59:15 -08:00
Wang Mingyu
848997b32a
tesseract: upgrade 5.3.4 -> 5.4.1
Changelog:
============
* Avoid FP overflow in NormEvidenceOf
* Small build fixes and code improvements
* Fix setup of datadir on installations with Conda
* Fix FP exception in Wordrec::angle_change
* Build fixes, code refactoring and other smaller changes.
* Fix grey result of indexed PNG in pdfrenderer.
* Rename frk -> deu_latf (ISO 639-3, ISO 15924).
* Remove broken Dockerfile.
* Fixes for several issues reported by Coverity Scan.
* Remove unsupported OpenCL code and related API functions
* Facilitate vectorization for generic build
* Add PAGE XML renderer / export
* Support training without lstmf files.
* Improve CCUtil::main_setup (fixes issue #4230 related to Coda).
* Allow for text angle/gradient to be retrieved

Signed-off-by: Wang Mingyu <wangmy@fujitsu.com>
Signed-off-by: Khem Raj <raj.khem@gmail.com>
2024-06-27 09:19:05 -07:00
Wang Mingyu
cb0c748ebe
tesseract: upgrade 5.3.3 -> 5.3.4
Changelog:
============
* Fixes for scrollview
* Fixes for autoconf, clang and sw builds
* Improve OCR for an image URL
  * Fail on curl download errors
  * New parameter curl_cookiefile
  * Set User-Agent: header field in HTTP request for curl downloads
* Output directory list from "combine_tessdata -d" to stdout
* Other small improvements for code and documentation.

Signed-off-by: Wang Mingyu <wangmy@fujitsu.com>
Signed-off-by: Khem Raj <raj.khem@gmail.com>
2024-01-23 11:55:39 -08:00
Wang Mingyu
f17c7b0b7e tesseract: upgrade 5.3.2 -> 5.3.3
Changelog:
===========
* Small code fixes and improvements to fix Coverity Scan issues.
* Disable -mfpu=neon for aarch64.
* Fix build without git clone in cloned directory (required for FreeBSD).
* Other build fixes for autotools, cmake and sw.
* Fix regression in layout detection which was introduced in release 5.0.0.
* Fix regression which prevented loading of submodels, introduced in release 5.0.0-rc2.
* Other small improvements for code and documentation.

Signed-off-by: Wang Mingyu <wangmy@fujitsu.com>
Signed-off-by: Khem Raj <raj.khem@gmail.com>
2023-10-23 17:00:49 -07:00
Wang Mingyu
bb50c6e1f0 tesseract: upgrade 5.3.1 -> 5.3.2
Changelog:
==========
* Updates for snap package building.
* Support for Sgaw and W Pwo Karen languages in the Myanmar validator (#4065).
* Improve format of logging from lstmtraining.
* Use less digits in filenames of checkpoints written by lstmtraining.
* Replace deprecated sprintf.
* Remove unused code in function fix_rep_char.
* Avoid 32 bit overflow in multiplication (fixes 3 CodeQL CI alerts).
* Avoid conversions from std::string to char* to std::string.
* Abort with error message if OSD is requested with LSTM-only model.
* cmake: allow to disable tiff (-DDISABLE_TIFF=ON).
* cmake: provide info about disabled LibArchive and CURL.
* cmake: check if leptonica was build with tiff support.
* Remove old broken GitHub action vcpkg-4.1.1 (fixes issue #4078).
* Create config.yml.
* Fix typos.

Signed-off-by: Wang Mingyu <wangmy@fujitsu.com>
Signed-off-by: Khem Raj <raj.khem@gmail.com>
2023-07-20 19:08:57 -07:00
Wang Mingyu
ff8375af85 tesseract: upgrade 5.3.0 -> 5.3.1
0001-Fix-build-with-gcc-13-by-including-cstdint.patch
removed since it's included in 5.3.1

Changelog:
==========
 Update README.md by @seupedro in #3992
 Fix FP division by zero (issue #3995)
 Fix linkage of icu and pango
 Fix build with gcc 13 by including
 msvc debug: fix wrong lib name in generated pkgconfig file
 Fix libdir in tesseract.pc from CMake
 Replace 'can not' by 'cannot'
 Readme: Link to list of supported languages
 Improve the DebugDump output by slightly adjusting the format.
 Fix issue #4010

Signed-off-by: Wang Mingyu <wangmy@fujitsu.com>
Signed-off-by: Khem Raj <raj.khem@gmail.com>
2023-04-10 09:39:38 -07:00
Khem Raj
6cdb98eed2 tesseract: Fix build with gcc13
Signed-off-by: Khem Raj <raj.khem@gmail.com>
2023-02-01 09:11:49 -08:00
Wang Mingyu
72735574e2 tesseract: upgrade 5.2.0 -> 5.3.0
Changelog:
=========
 * Fix the training tools for the legacy OCR engine (fix issue #3925).
 * PDF renderer: Ignore non-text blocks (fix issue #3957).
 * Remove colormap before thresholding (fix issue #3940).
 * Fix a number of performance issues reported by Coverity Scan.
 * Training tools: Replace call of exit function by return statement in main function.
 * Fix double free in function vigorous_noise_removal (fix issue #3876).
 * Create to_win if needed in Textord::make_spline_rows (fix issue #3875).
 * Bug fixes for ScrollView viewer:
   * Fix memory issues in ScrollView::MessageReceiver.
   * Catch potential nullptr in SVNetwork::SVNetwork.
   * Move svpaint.cpp from src/viewer to src/.
   * Add rule for svpaint executable in Autotools.
 * Bug fixes and improvements for build tools:
   * Fix AMD64 detection with autobuild on FreeBSD (fix issue #3964).
   * Fix tesseract.pc generated from CMake to match Autotools.
   * Detect availability of AVX512-VNNI.
   * configure.ac: fix build on aarch64_be.
 * Drop CI for old versions of macOS and Ubuntu.

Signed-off-by: Wang Mingyu <wangmy@fujitsu.com>
Signed-off-by: Khem Raj <raj.khem@gmail.com>
2022-12-27 16:03:06 -08:00
Wang Mingyu
1e5473508a tesseract: upgrade 4.1.3 -> 5.2.0
0001-include-sys-time.h.patch
removed since it's not available in new version.

Signed-off-by: Wang Mingyu <wangmy@fujitsu.com>
Signed-off-by: Khem Raj <raj.khem@gmail.com>
2022-10-18 08:23:44 -07:00
Martin Jansa
c3d3cfcfe3 tesseract-lang: switch from master branch to main
Signed-off-by: Martin Jansa <Martin.Jansa@gmail.com>
Signed-off-by: Khem Raj <raj.khem@gmail.com>
2022-06-30 13:39:28 -04:00
Martin Jansa
856902b8c0 Fix DeprecationWarning about regexps
* fixes:
meta-oe/meta-networking/recipes-support/strongswan/strongswan_5.9.4.bb:125: DeprecationWarning: invalid escape sequence \.
meta-oe/meta-networking/recipes-support/strongswan/strongswan_5.9.4.bb:126: DeprecationWarning: invalid escape sequence \.
meta-oe/meta-networking/recipes-support/strongswan/strongswan_5.9.4.bb:128: DeprecationWarning: invalid escape sequence \.
meta-oe/meta-networking/recipes-support/strongswan/strongswan_5.9.4.bb:129: DeprecationWarning: invalid escape sequence \.
meta-oe/meta-networking/recipes-support/strongswan/strongswan_5.9.4.bb:130: DeprecationWarning: invalid escape sequence \.
meta-oe/meta-oe/recipes-graphics/ttf-fonts/ttf-mplus_027.bb:18: DeprecationWarning: invalid escape sequence \.
meta-oe/meta-oe/recipes-multimedia/libcdio/libcdio-paranoia_10.2+2.0.1.bb:21: DeprecationWarning: invalid escape sequence \.
meta-oe/meta-oe/recipes-multimedia/libcdio/libcdio_2.1.0.bb:28: DeprecationWarning: invalid escape sequence \.
oe-core/meta/classes/package.bbclass:1342: DeprecationWarning: invalid escape sequence \.
oe-core/meta/classes/package.bbclass:1343: DeprecationWarning: invalid escape sequence \-
oe-core/meta/classes/package.bbclass:1343: DeprecationWarning: invalid escape sequence \.
oe-core/meta/classes/package.bbclass:1344: DeprecationWarning: invalid escape sequence \.
oe-core/meta/classes/package.bbclass:1345: DeprecationWarning: invalid escape sequence \.
oe-core/meta/classes/package.bbclass:1348: DeprecationWarning: invalid escape sequence \.
oe-core/meta/classes/package.bbclass:1350: DeprecationWarning: invalid escape sequence \.
oe-core/meta/classes/package.bbclass:1353: DeprecationWarning: invalid escape sequence \.
oe-core/meta/classes/package.bbclass:1355: DeprecationWarning: invalid escape sequence \.
oe-core/meta/classes/package.bbclass:1358: DeprecationWarning: invalid escape sequence \.
oe-core/meta/classes/package.bbclass:1360: DeprecationWarning: invalid escape sequence \.
oe-core/meta/classes/package.bbclass:1365: DeprecationWarning: invalid escape sequence \.

Signed-off-by: Martin Jansa <Martin.Jansa@gmail.com>
Signed-off-by: Khem Raj <raj.khem@gmail.com>
2022-02-21 18:12:04 -08:00
wangmy
4640900864 tesseract: upgrade 4.1.1 -> 4.1.3
Fix broken autoconf build
(https://github.com/tesseract-ocr/tesseract/issues/3642)

Signed-off-by: Wang Mingyu <wangmy@fujitsu.com>
Signed-off-by: Khem Raj <raj.khem@gmail.com>
2021-11-22 10:15:00 -08:00
Richard Purdie
b402a3076f recipes: Update SRC_URI branch and protocols
This patch updates SRC_URIs using git to include branch=master if no branch is set
and also to use protocol=https for github urls as generated by the conversion script
in OE-Core.

Signed-off-by: Richard Purdie <richard.purdie@linuxfoundation.org>
Signed-off-by: Khem Raj <raj.khem@gmail.com>
2021-11-03 06:57:49 -07:00
Martin Jansa
c61dc077bb Convert to new override syntax
This is the result of automated script (0.9.1) conversion:

oe-core/scripts/contrib/convert-overrides.py .

converting the metadata to use ":" as the override character instead of "_".

Signed-off-by: Martin Jansa <Martin.Jansa@gmail.com>
2021-08-03 10:21:25 -07:00
Wang Mingyu
f8c29cd3be tesseract-lang: upgrade 4.0.0 -> 4.1.0
There is no file COPYING in version 4.1.0, so the file LICENSE is used instead.

Signed-off-by: Wang Mingyu <wangmy@cn.fujitsu.com>
Signed-off-by: Khem Raj <raj.khem@gmail.com>
2021-02-28 10:20:43 -08:00
Oleksandr Kravchuk
d916d59703 tesseract: update to 4.1.1
License-Update: file that mentioned licencing information was replaced
with an actual license file

Add patch to fix build with musl

Signed-off-by: Oleksandr Kravchuk <open.source@oleksandr-kravchuk.com>
Signed-off-by: Khem Raj <raj.khem@gmail.com>
2020-09-15 18:41:23 -07:00
Oleksandr Kravchuk
fedf039ff8 tesseract-lang: update to 4.0.0
Signed-off-by: Oleksandr Kravchuk <open.source@oleksandr-kravchuk.com>
Signed-off-by: Khem Raj <raj.khem@gmail.com>
2019-09-25 17:58:46 -07:00
Khem Raj
788b13604b tesseract: Upgrade to 3.05.01
LIC_FILES_CHKSUM changes are typo corrections in COPYING file
5a3ccf06bf (diff-7116ef0705885343c9e1b2171a06be0e)

Signed-off-by: Khem Raj <raj.khem@gmail.com>
Signed-off-by: Martin Jansa <Martin.Jansa@gmail.com>
2017-09-13 11:13:21 +02:00
Joshua Lock
efd3696e70 remove True option to getVar calls
getVar() now defaults to expanding by default, thus remove the True
option from getVar() calls with a regex search and replace.

Search made with the following regex: getVar ?\(( ?[^,()]*), True\)

Signed-off-by: Joshua Lock <joshua.g.lock@intel.com>
2016-12-02 09:16:17 +01:00
Juhee Cho
37967410b3 tesseract: upgrade to 3.04
* use do_split_packages instead of many separate recipes
* fetch from github

Signed-off-by: Juhee Cho <juhee.cho@lge.com>
Signed-off-by: Martin Jansa <Martin.Jansa@gmail.com>
2016-10-21 18:20:43 +02:00
Martin Jansa
ac21b4de8b tesseract: Blacklist
Signed-off-by: Martin Jansa <Martin.Jansa@gmail.com>
2016-02-25 17:47:24 +01:00
Yi Zhao
41fe0f663e tesseract-lang.inc: set ${S} to fix the QA warning
The ${S} is not set in the recipe cause the QA warning:
WARNING: tesseract-lang-eng: the directory ${WORKDIR}/${BP}
(/buildarea2/build/tmp/work/all-poky-linux/tesseract-lang-eng/3.02-r0/tesseract-lang-eng-3.02)
pointed to by the S variable doesn't exist - please set S within the recipe to point to where the source has been unpacked to

Signed-off-by: Yi Zhao <yi.zhao@windriver.com>
2015-06-17 22:36:12 +02:00
Martin Jansa
b3f0a6c6ec tesseract-lang: Drop runtime dependency on TUNE_PKGARCH tesseract
* in order to keep allarch

Signed-off-by: Martin Jansa <Martin.Jansa@gmail.com>
2014-06-21 19:22:25 +02:00
Mario Domenech Goulart
fccc8f1514 tesseract-lang: add recipes for all languages available
One recipe per language, all including the common tesseract-lang.inc
file.

Signed-off-by: Mario Domenech Goulart <mario@ossystems.com.br>
Signed-off-by: Martin Jansa <Martin.Jansa@gmail.com>
2014-06-10 12:16:23 +02:00
Mario Domenech Goulart
9d655b7b72 tesseract: add recipe for version 3.02.02
Signed-off-by: Mario Domenech Goulart <mario@ossystems.com.br>
Signed-off-by: Martin Jansa <Martin.Jansa@gmail.com>
2014-06-10 12:16:23 +02:00