Re: Version 3.02 in alpha

2012-02-03 Thread Sriranga(78yrsold)
Zenko, Thanks for the valuable guidance. in fact I had followed http://code.google.com/p/tesseract-ocr/wiki/TesseractSvnInstallation -which leads to confusion. Now I followed as per your valuable guidance, downloaded all required items as per readme http://code.google.com/p/tesseract-ocr/wiki/ReadM

Re: Latest version of Cowboxer is here

2012-02-03 Thread raakeshvara rao
Hi Wil Thanks for testing it. Glad to know that it runs on Win7 too. Did you have to recompile it? If anyone can upload a Win7 64 bit version too that will be great. Being written in C++ it is really fast. For a cheat sheet you can just take a snapshot of the Help screen, I guess, and keep it ope

Help with user-words

2012-02-03 Thread KAH
I am running windows version 3.01 of tesseract that I just downloaded. I am using stock tessdata files and i have added a user-words file in effort to understand how this works. The image has the word 6400025948 in it... and Tess 3 continually is reading this as s4ooo2s94a I have placed a file eng

Re: Latest version of Cowboxer is here

2012-02-03 Thread Wil Hadden
Got it working on Win7, I'm liking it so far, it's easy to use and to run through a load of text. It's probably me but tab does not seem to do anything. I have a small would be nice, either a tool bar that has buttons for the keyboard actions or a cheatsheet window, like the help that stays perma

Re: Version 3.02 in alpha

2012-02-03 Thread zdenko podobny
I just uploaded some fixes to VC2008 build - target was to compile and run tesseract.exe ("tesseract.exe eurotext.tif eurotext" produced output :-) ) Please test it. Feel free to improve it. I still continue to support the current "vs2008 structure". When Tom will finalize his contribution[1] I

Re: Version 3.02 in alpha

2012-02-03 Thread Speedy
Getting packages into Ubuntu precise would be awesome! As someone involved in putting together Vinux, a distribution of Ubuntu for the blind and visually impaired, OCR is essential. We have several utilities people have built to simplify these tasks. Is tesseract version 3.02 backward compatibl

Re: Version 3.02 in alpha

2012-02-03 Thread Speedy
Another feature that sounds very promising are the bigrams. Is this a feature that works on a word level? Does this include a probability for the first word? I.e., is position 0 a valid context for a bigram? So for example, if I wanted to recognize license plates and I know that the first one or tw

Re: Version 3.02 in alpha

2012-02-03 Thread Speedy
I'd be very interested in this as well. How does it work? I mean, if I have a font in one language and another in the other language, dies it make sure that no characters from different languages are intermingled in the same word? How about in the same line? Is there a way to influence this? Does

Re: Version 3.02 in alpha

2012-02-03 Thread zdenko podobny
On Fri, Feb 3, 2012 at 5:29 PM, Sriranga(78yrsold) wrote: > zdenko, > Tried in ubuntu 11.10 - failed to install even after following the > guidelines in wiki. No, you did not follow guidelines in wiki [1]. Try to read it first ;-) [1] http://code.google.com/p/tesseract-ocr/wiki/ReadMe#Linux >

Re: Version 3.02 in alpha

2012-02-03 Thread Ray Smith
Try using eng+hin as the language code... On Fri, Feb 3, 2012 at 4:56 AM, Derek Dohler wrote: > I'm excited by this: > >> Added simultaneous multi-language capability. > > > Can you provide any info on how this works? > > Cheers, > Derek > > On Fri, Feb 3, 2012 at 4:32 PM, Sriranga(78yrsold) <

Re: Funding for a student job or research project in OCR

2012-02-03 Thread Merve Temizer
Hello, Are the letters adjacent to each other in words? If not, Tesseract is successfull on recognizing handwriting. There are little problems for example if you write "lt" tesseract might understand "H". Thus i looked for a way and explored that, with support of opencv, it is possible to have sing

Re: Tesseract with colored images

2012-02-03 Thread Lahiru Himash Madusanka
Tes will work with color images. But accuracy will be low as my experience. On 2/3/12, emre wrote: > any ideas ? > > On 2 Şubat, 16:41, Yunus Emre Cavusoglu > wrote: >> I want to know that if i can use tesseract with colored images or not ? If >> i can use should i do any configuration ? And i w

Re: Version 3.02 in alpha

2012-02-03 Thread Derek Dohler
I'm excited by this: > Added simultaneous multi-language capability. Can you provide any info on how this works? Cheers, Derek On Fri, Feb 3, 2012 at 4:32 PM, Sriranga(78yrsold) wrote: > Attached release notes for 3.02. Download can be done from svn of the > project site.tesseract-ocr - Proje

tesseract in android

2012-02-03 Thread sam
Hello Everyone, i am interested in developing an android application which uses ocr method to recognize characters.i made a through research and found tesseract as the most effective one.can any one help me in how to use tesseract in android project. -- You received this me

Re: Version 3.02 in alpha

2012-02-03 Thread Sriranga(78yrsold)
zdenko, Tried in ubuntu 11.10 - failed to install even after following the guidelines in wiki. In this connection attached typescript for your perusal and valuable guidance. Where i made mistake may kindly be intimated to me. With Warmest Regards, -sriranga(79yrs) On Fri, Feb 3, 2012 at 6:14 PM, S

Re: Tesseract with colored images

2012-02-03 Thread emre
any ideas ? On 2 Şubat, 16:41, Yunus Emre Cavusoglu wrote: > I want to know that if i can use tesseract with colored images or not ? If > i can use should i do any configuration ? And i want to know that how was > the output with colored images ? > > Sorry for my bad english > > Thanks -- You r

Re: Version 3.02 in alpha

2012-02-03 Thread Sriranga(78yrsold)
Zdenko, Thanks for the information. I don't have VS2008 in Linux but in winXP(sp3) :-). Actually i downloaded from svn into ubuntu 11.10 and then copied to winxp. Since there was file tesseract.sln in the folder "VS2008", as such I tried- only 24 succeeded. Now I shall wait for patches for VS2008

Re: Version 3.02 in alpha

2012-02-03 Thread zdenko podobny
Do you have VS2008 for linux ;-) (as Ray wrote "currently Linux-only") ? PS: I work on patches for VS2008, but there are some problems... I need to made some additional tests... Zdenko On Fri, Feb 3, 2012 at 1:06 PM, Sriranga(78yrsold) wrote: > When tried to generate exe files using VS2008 but

Re: Version 3.02 in alpha

2012-02-03 Thread Sriranga(78yrsold)
Attached release notes for 3.02. Download can be done from svn of the project site.tesseract-ocr - Project Hosting on Google Code cheers, -sriranga(79yrs) On Fri, Feb 3, 2012 at 4:54 PM, Wil Hadden wrote: > Hi Ray, > > Any idea of timescales when there wi

Re: Is there any Java package to get coordinates of a string using OCR

2012-02-03 Thread Sriranga(78yrsold)
Harry. Yes there are java OCR.jar and .NET and also jboxeditor in which co-ord of char can get. Relevant source codes for the said programs are also available at http://vietocr.sourceforge.net/.for download the said programs can be used for any other lang also.I hope your purpose will be served.

Re: Version 3.02 in alpha

2012-02-03 Thread Sriranga(78yrsold)
When tried to generate exe files using VS2008 but failed. where exe files will be stored? in bin or bin.dbg or training folder ? On Fri, Feb 3, 2012 at 4:54 PM, Wil Hadden wrote: > Hi Ray, > > Any idea of timescales when there will be a 3.02 package on the > downloads page of googlecode? > > Or

Re: Latest version of Cowboxer is here

2012-02-03 Thread Wil Hadden
Excellent, hopefully I'll get to play with it tonight. Wil On Jan 28, 9:24 pm, raakeshvara rao wrote: > Hi, > > Cowboxer could previously, given a word, break it into letters (Tab). > > Now, I have modified cowboxer to be able to cut lines in a big box > (Ctrl Tab). > > Even better, it can cut l

Re: Any one successfully build Tesseract 3.0.1 under MSYS+MinGW?

2012-02-03 Thread Wil Hadden
Sorry to get back late on this. I have got mingw scripts that build 3.01 using the android ndk. I can furnish you with the scripts if you are interested / still need them. Wil On Jan 21, 4:08 am, asmwarrior wrote: > Hi, all. > I'm trying to do this, but failed. > > The first thing I do is build

Re: Version 3.02 in alpha

2012-02-03 Thread Wil Hadden
Hi Ray, Any idea of timescales when there will be a 3.02 package on the downloads page of googlecode? Or are there any release notes between 3.01 and 3.02, I'm, just a bit wary of being bleeding edge :) Wil On Feb 2, 6:55 pm, Ray Smith wrote: > Tesseract 3.02 is now available in svn for prelim

Re: Font Training Request for Proposal

2012-02-03 Thread M.N.S.Rao
We are( 2 in the group) currently working on Kannada language traineddata file. We would be interested in taking up the work in case the fonts to be trained are sent to us. We also request you to state the terms for us to decide on the contract - Original Message - From: "Mariner" To