Note.. Step 5 above the last command should be "sudo make install-langs"
On Sunday, August 10, 2014 4:32:55 PM UTC-4, testing1234 wrote: > > I was building based on the guide at - [1] > https://code.google.com/p/tesseract-ocr/wiki/TesseractSvnInstallation > with no OpenCL > > *Update:* as I was able to fix this. I'll detail my whole process in case > other OSX users need it... I was able to get it to build on OSX 10.9.4 from > SVN and it is working with some warnings (detailed below). The only > difference I can think of is I started from scratch by removing everything > (ports and formulas) I had installed using either MacPorts or Homebrew, as > well as MacPorts and Homebrew themselves. (This was probably a terribly > inexperienced mistake having them both installed at the same time.) > > Everything built well and without errors this time (Note: I did have > warnings, but no errors.). > > I have tested Tesseract with TIFF (single and multiple pages) and it is > working well. It gives me the following error "Warning in pixReadMemTiff: > tiff page 25 not found" in which the page # is always the last page of the > file, but it doesn't seem to be a problem. > > PNG files do not seem to work (it outputs two identically named files: one > that can't be opened and one that only has the first page) > > PDF files provide the following error and I can't remember if Leptonica is > supposed to be able to input PDF files or not. > > > Error in fopenReadStream: file not found > Error in pixRead: image file not found: %PDF-1.2 > Image file %PDF-1.2 cannot be read! > Error during processing. > > > I can work on these if I find time, but since TIFF is working they aren't > a priority. > > *So here is the process that worked for me.* > > 1. Open Terminal > 2. Install, update, and verify Homebrew by entering the following one at a > time: > > ruby -e "$(curl -fsSL https://raw.github.com/Homebrew/homebrew/go/install > )" > brew update > brew doctor > > > 3. Make sure brew doctor comes back clean > > 4. Install the tesseract dependencies listed at [1] above again by > entering one at a time (Note: I did not need to install aclocal or > autoheader from Homebrew as they aren't formulas.). > > > brew install autoconf > brew install automake > brew install libtool > brew install leptonica --with-libtiff > > > 5 .Run the following command (still in Terminal entering one at a time) > (again based on the instructions in [1]): > > svn checkout http://tesseract-ocr.googlecode.com/svn/trunk/ tesseract-ocr > cd tesseract-ocr > ./autogen.sh > ./configure > make > sudo make install > sudo make install-pangs > > > 6. Assuming you don't get any failures or errors, you can then test using > the following commands in Terminal (the italics should be change to your > docs specific filenames and the filetype you want to output) (Note: > Tesseract defaults its output to .TXT files). > > tesseract *inputfilename*.tiff *outputfilename outputfiletype* > > > For example: "tesseract mytiff.tiff mysearchablepdf pdf" will make > "mytiff.tiff" a searchable pdf with the name "mysearchablepdf.pdf" and save > it into whatever location you run the tesseract command from. > > Hopefully this helps someone else and it may be useful to post it under a > different (more searchable post title). > > Best, > > Cory > > > > > On Sunday, August 10, 2014 12:23:04 PM UTC-4, zdenop wrote: >> >> How are you building tesseract? >> According issue tracker[1] there is problem only with OpenCL... >> >> [1] https://code.google.com/p/tesseract-ocr/issues/detail?id=1272 >> >> Zdenko >> >> >> On Sat, Aug 9, 2014 at 10:28 PM, testing1234 <[email protected]> wrote: >> >>> When compiling and running "make" I get the following error: >>> >>> scanutils.cpp:38:14: error: typedef redefinition with different types >>> ('long' vs '__darwin_off_t' >>> (aka 'long long')) >>> typedef long off_t; >>> ^ >>> /usr/include/sys/_types/_off_t.h:30:25: note: previous definition is here >>> typedef __darwin_off_t off_t; >>> ^ >>> 1 error generated. >>> make[2]: *** [scanutils.lo] Error 1 >>> make[1]: *** [install-recursive] Error 1 >>> make: *** [install-recursive] Error 1 >>> >>> >>> Can anyone help me resolve this? >>> >>> -- >>> You received this message because you are subscribed to the Google >>> Groups "tesseract-ocr" group. >>> To unsubscribe from this group and stop receiving emails from it, send >>> an email to [email protected]. >>> To post to this group, send email to [email protected]. >>> Visit this group at http://groups.google.com/group/tesseract-ocr. >>> To view this discussion on the web visit >>> https://groups.google.com/d/msgid/tesseract-ocr/4295af62-3fb5-412f-8d23-878707d33af7%40googlegroups.com >>> >>> <https://groups.google.com/d/msgid/tesseract-ocr/4295af62-3fb5-412f-8d23-878707d33af7%40googlegroups.com?utm_medium=email&utm_source=footer> >>> . >>> For more options, visit https://groups.google.com/d/optout. >>> >> >> -- You received this message because you are subscribed to the Google Groups "tesseract-ocr" group. To unsubscribe from this group and stop receiving emails from it, send an email to [email protected]. To post to this group, send email to [email protected]. Visit this group at http://groups.google.com/group/tesseract-ocr. To view this discussion on the web visit https://groups.google.com/d/msgid/tesseract-ocr/4fadf9d3-8039-4955-b9f3-a635254c8caa%40googlegroups.com. For more options, visit https://groups.google.com/d/optout.

