Hi,
I found tesseract4.0 crashed when I use Intel i7-8700 CPU with Ubuntu 18.04.
tesseract4.0 is a OCR open source. ( https://github.com/tesseract-ocr/tesseract GitHub - tesseract-ocr/tesseract: Tesseract Open Source OCR Engine (main repository) )
I used pyocr for wrapping lib for tesseract.
Here is the way I use, http://www.riptutorial.com/python/example/28811/pyocr PyOCR | python Tutorial
When I call below function several times, it is OK.
txt = tool.image_to_string(
lang=lang,
builder=pyocr.builders.TextBuilder()
)
But, when I call it over one hundred times, crash happens and PC is reboot.
By the way,
when I use i5-8400 CPU, sometimes error message is shown, but no crash happens.
And, one more thing,
Tesseract uses AVX and SSE, and when I disable them and re-test it, everything is fine. Never crash happens. No error message is shown.
But, you know, running speed is not good.
I think i7-8700 CPU has something with AVX and SSE function which is different from those of i5-8400.
Is there anybody who has a similar experience to me?
Is there any solution I can use tesseract with i7-8700K CPU turning on AVX and SSE.
Link Copied
Hello DrPRO,
Thank you for joining the Intel Community Support.
I understand that Tesseract 4.0 crashes in a system built with an Intel® Core™ i7-8700K Processor in Ubuntu 18.04. I would like to provide further recommendations.
To address this issue correctly, we recommend you to visit these websites as you may need additional documentation to fix this issue.
https://askubuntu.com/
https://ubuntuforums.org/
You can also contact the developers of this open source engine.
I hope this helps.
Wanner G.
For more complete information about compiler optimizations, see our Optimization Notice.