hi all, If run script "./run-box-training" - It is noticed that script generated two folders viz. (1) boxdata.h5 and (2)boxdata.split (3) boxdata.cmodel Out of *curiosity*, I like to view the contents the said generated three folders - Is it possible and if so,how to do? thanks in advance. With regards, -sriranga(79yrs)
On Sat, Aug 25, 2012 at 8:14 AM, Sriranga(78yrsold) <[email protected] > wrote: > Brad, > forgot to add that regarding "On ubuntu, I'd suggest installing: > graphicsmagick > and > graphicsmagick-imagemagick- > compat > packages " > succeeded to install above softwares using "ubuntu Software centre". > thanks for all help. > -sriranga(79yrs) > > > On Sat, Aug 25, 2012 at 8:09 AM, Sriranga(78yrsold) < > [email protected]> wrote: > >> Brad, >> Extremely thankful to you for your valuable guidance. Succeeded to >> convert from .tif to .png - vide reproduced the extract of terminal >> below: >> With Warmest Regards, >> -sriranga(79yrs) >> dell@ubuntu:~/ocropus_6.0/fraktur-boxes$ ./run-box-training >> ================================================================ >> === This script illustrates training of a simple, initial >> === character recognizer from the kind of boxdata training >> === files used with Tesseract. >> ================================================================ >> >> >> + for image in 'deu-f/*.tif' >> + convert -depth 8 deu-f/fontfile_1.tif deu-f/fontfile_1.png >> + for image in 'deu-f/*.tif' >> + convert -depth 8 deu-f/fontfile_2.tif deu-f/fontfile_2.png >> >> + for image in 'deu-f/*.tif' >> + convert -depth 8 deu-f/fontfile_3.tif deu-f/fontfile_3.png >> >> + for image in 'deu-f/*.tif' >> + convert -depth 8 deu-f/fontfile_4.tif deu-f/fontfile_4.png >> >> + for image in 'deu-f/*.tif' >> + convert -depth 8 deu-f/fontfile_5.tif deu-f/fontfile_5.png >> >> + for image in 'deu-f/*.tif' >> + convert -depth 8 deu-f/fontfile_6.tif deu-f/fontfile_6.png >> >> + for image in 'deu-f/*.tif' >> + convert -depth 8 deu-f/fontfile_7.tif deu-f/fontfile_7.png >> >> + for image in 'deu-f/*.tif' >> + convert -depth 8 deu-f/fontfile_8.tif deu-f/fontfile_8.png >> + ocropus-db tess2h5 deu-f/fontfile_1.png deu-f/fontfile_2.png >> deu-f/fontfile_3.png deu-f/fontfile_4.png deu-f/fontfile_5.png >> deu-f/fontfile_6.png deu-f/fontfile_7.png deu-f/fontfile_8.png -o boxdata.h5 >> ['deu-f/fontfile_1.png', 'deu-f/fontfile_2.png', 'deu-f/fontfile_3.png', >> 'deu-f/fontfile_4.png', 'deu-f/fontfile_5.png', 'deu-f/fontfile_6.png', >> 'deu-f/fontfile_7.png', 'deu-f/fontfile_8.png'] >> === deu-f/fontfile_1.png >> === deu-f/fontfile_2.png >> === deu-f/fontfile_3.png >> === deu-f/fontfile_4.png >> deu-f/fontfile_4.box : 2734 : syntax error >> e 257 112 268 131P >> >> deu-f/fontfile_4.box : 2807 : syntax error >> e 609 75 621 94g >> >> === deu-f/fontfile_5.png >> deu-f/fontfile_5.box : 1102 : bad box dimensions >> si 750 3198 1783 3246 >> >> === deu-f/fontfile_6.png >> === deu-f/fontfile_7.png >> === deu-f/fontfile_8.png >> + ocropus-tsplit --pca 0.95 --vq 80 -d boxdata.h5 -o boxdata.split >> loading dataset >> got 11959 samples out of 11959 >> # classes 121 >> most common e 1827 / n 1051 / i 808 / r 769 / a 600 / t 587 / s 508 / d >> 492 / u 415 / l 386 / ... >> starting training >> pcakmeans 11959 k 80 d 0.95 >> predicting 11959 1024 >> writing >> + ocropus-tleaves -Q 4 -s boxdata.split -d boxdata.h5 -o boxdata.cmodel >> loading splitter >> got <ocrolib.patrec.HierarchicalSplitter instance at 0x2c704d0> >> #splits 80 >> excluding [ _\000-\037] >> sizemode perchar >> loading dataset >> sizemode (data) perchar >> splitting >> 0 >> 10000 >> cluster 0 len 180 d 176 / h 3 / c 1 >> cluster 5 len 111 b 104 / h 5 / d 1 / ö 1 >> cluster 10 len 197 n 196 / N 1 >> cluster 15 len 209 o 178 / v 12 / 0 9 / e 2 / V 2 >> cluster 6 len 145 ä 50 / ü 39 / ö 37 / 5 4 / 4 4 >> cluster 11 len 171 n 169 / h 1 / t 1 >> cluster 1 len 103 b 61 / h 41 / d 1 >> cluster 16 len 179 p 82 / v 55 / P 11 / D 7 / B 5 >> cluster 12 len 30 M 17 / W 10 / wi 1 / ap 1 / es 1 >> cluster 2 len 63 h 61 / H 2 >> cluster 7 len 178 g 176 / q 1 / 9 1 >> cluster 17 len 208 s 118 / J 18 / I 14 / H 13 / Z 13 >> cluster 13 len 227 a 227 >> cluster 3 len 101 d 100 / 6 1 >> cluster 8 len 178 ß 40 / ü 37 / st 21 / K 13 / si 11 >> cluster 18 len 89 o 87 / 0 1 / O 1 >> cluster 14 len 366 a 359 / A 5 / d 1 / O 1 >> cluster 4 len 209 d 208 / b 1 >> cluster 9 len 165 h 127 / ß 12 / tz 10 / y 5 / ö 3 >> cluster 19 len 67 D 30 / v 14 / O 8 / V 6 / H 3 >> cluster 20 len 35 A 35 >> cluster 25 len 147 n 147 >> cluster 30 len 192 n 176 / u 9 / A 3 / tt 2 / a 1 >> cluster 35 len 111 H 14 / q 14 / F 12 / C 12 / 6 12 >> cluster 21 len 69 B 14 / K 11 / * 6 / s 6 / N 4 >> cluster 26 len 87 n 80 / tz 6 / g 1 >> cluster 31 len 107 n 80 / R 14 / u 5 / K 4 / U 2 >> cluster 36 len 46 ? 20 / 7 14 / 2 9 / L 1 / w 1 >> cluster 27 len 158 u 148 / U 7 / a 2 / n 1 >> cluster 22 len 231 . 219 / « 2 / * 2 / a 1 / e 1 >> cluster 32 len 188 n 185 / y 1 / N 1 / u 1 >> cluster 37 len 141 T 31 / ck 30 / D 27 / E 12 / C 10 >> cluster 28 len 129 u 115 / U 9 / a 1 / h 1 / n 1 >> cluster 33 len 103 g 103 >> cluster 38 len 150 z 68 / F 20 / ’ 19 / J 11 / 4 9 >> cluster 23 len 114 s 81 / 8 5 / g 3 / « 3 / - 3 >> cluster 29 len 138 u 131 / n 4 / h 1 / U 1 / ü 1 >> cluster 34 len 86 Q 30 / » 21 / « 13 / O 8 / N 5 >> cluster 39 len 330 t 328 / i 2 >> cluster 40 len 127 l 66 / t 32 / i 11 / : 7 / 1 3 >> cluster 24 len 88 st 42 / si 25 / ll 9 / a 7 / K 1 >> cluster 45 len 52 — 35 / = 5 / ~ 3 / - 2 / tm 1 >> cluster 55 len 63 S 37 / G 18 / E 6 / s 1 / Ö 1 >> cluster 50 len 229 i 223 / j 2 / l 2 / s 1 / t 1 >> cluster 41 len 261 , 255 / y 3 / e 2 / - 1 >> cluster 46 len 102 ) 27 / ; 25 / : 21 / x 11 / - 5 >> cluster 56 len 84 G 37 / E 21 / S 18 / * 3 / O 3 >> cluster 51 len 78 ch 78 >> cluster 42 len 130 k 65 / f 55 / s 5 / 5 2 / b 1 >> cluster 47 len 205 l 132 / 1 30 / I 10 / ! 7 / i 6 >> cluster 57 len 63 B 15 / P 13 / V 13 / N 9 / R 8 >> cluster 52 len 59 ch 59 >> cluster 43 len 120 s 94 / f 10 / H 6 / L 3 / l 3 >> cluster 48 len 240 l 182 / ( 21 / ! 14 / i 10 / k 4 >> cluster 58 len 178 ch 175 / Ö 2 / f 1 >> cluster 49 len 199 i 193 / t 5 / 4 1 >> cluster 53 len 96 w 92 / W 4 >> cluster 44 len 264 s 185 / f 67 / k 5 / e 3 / i 2 >> cluster 60 len 160 m 138 / M 14 / n 3 / ch 1 / o 1 >> cluster 59 len 121 m 109 / M 7 / ru 1 / en 1 / la 1 >> cluster 54 len 72 w 55 / W 12 / sp 2 / tz 1 / m 1 >> cluster 65 len 103 i 75 / j 15 / t 5 / f 3 / s 3 >> cluster 61 len 81 i 81 >> cluster 70 len 344 e 337 / L 5 / 9 1 / c 1 >> cluster 75 len 108 t 88 / k 11 / e 8 / m 1 >> cluster 66 len 98 r 93 / x 4 / D 1 >> cluster 62 len 57 i 56 / z 1 >> cluster 71 len 180 e 179 / c 1 >> cluster 67 len 113 r 112 / y 1 >> cluster 76 len 118 t 116 / : 1 / r 1 >> cluster 63 len 61 i 61 >> cluster 77 len 134 r 132 / Y 1 / t 1 >> cluster 72 len 488 e 461 / c 15 / L 11 / s 1 >> cluster 68 len 187 r 187 >> cluster 64 len 87 i 86 / t 1 >> cluster 78 len 140 r 140 >> cluster 69 len 217 e 217 >> cluster 73 len 341 e 340 / h 1 >> cluster 79 len 101 r 100 / y 1 >> cluster 74 len 272 e 272 >> writing >> + ocropus-db predict -m boxdata.cmodel boxdata.h5 >> 19 11959 0.158876160214 >> + convert deu-f/fontfile_2.tif page.bin.png >> + ocropus-gpageseg page.bin.png >> page.bin.png >> computing segmentation >> computing column separators >> computing lines >> propagating labels >> spreading labels >> number of lines 27 >> finding reading order >> writing lines >> 26 page.bin.png 41.3 27 >> + ocropus-lattices -m boxdata.cmodel page/010001.bin.png >> page/010002.bin.png page/010003.bin.png page/010004.bin.png >> page/010005.bin.png page/010006.bin.png page/010007.bin.png >> page/010008.bin.png page/010009.bin.png page/01000a.bin.png >> page/01000b.bin.png page/01000c.bin.png page/01000d.bin.png >> page/01000e.bin.png page/01000f.bin.png page/010010.bin.png >> page/010011.bin.png page/010012.bin.png page/010013.bin.png >> page/010014.bin.png page/010015.bin.png page/010016.bin.png >> page/010017.bin.png page/010018.bin.png page/010019.bin.png >> page/01001a.bin.png page/01001b.bin.png >> loading boxdata.cmodel >> got <ocrolib.patrec.LocalCmodel instance at 0x3a1bea8> >> sizemode perchar >> loading /usr/local/share/ocropus/en-space.model >> got <ocrolib.wmodel.WhitespaceModel instance at 0x3a31518> >> loading /usr/local/share/ocropus/en-mixed.lineest >> got <ocrolib.lineest.TrainedLineGeometry instance at 0x3a315a8> >> segmenter lineseg.DPSegmentLine() >> got <ocrolib.lineseg.DPSegmentLine instance at 0x3a31680> >> recognizing 27 files >> page/010001.bin.png =RAW= Jizasser rinlien aus einer Quelle. Er trinkt, >> er wird frisch >> page/010002.bin.png =RAW= F)jste nach allen ~eiten, und die Äste gehen >> wieder ili so >> page/010003.bin.png =RAW= viele, viele kleine Zweige, aber alles endet in >> Pyraiiiideii- >> page/010004.bin.png =RAW= ist mein größtes 2;ergniigen (~Freude). >> page/010005.bin.png =RAW= ich war früherin New York oft sel)r nervös. >> Seitdem ich >> page/010006.bin.png =RAW= Columbus, so ging es Galilei, so ging es Johann >> Guten- >> page/010007.bin.png =RAW= laufeli, sä)wingen, fechten, boxen uiid >> tanzeli; kurz, >> page/010008.bin.png =RAW= halle llnd habe ())ymnasiik. O, wie ist das >> sc)ön, i1neili >> page/010009.bin.png =RAW= dem Tische sah ich Blumen, Blumensträuße (~Bou- >> page/01000a.bin.png =RAW= quets), Niedaillons, Früchte und noch viele, >> viele andere >> page/01000b.bin.png =RAW= soll ich beginnen? wo enden? ljber einen Tanz >> muß ich >> page/01000c.bin.png =RAW= bin. Vor einer halben (~tunde kam ich aus >> seinen=ßause. >> page/01000d.bin.png =RAW= Da war große Gesellschast. Viele interessante >> Personen >> page/01000e.bin.png =RAW= 1nich gerettet aus deii Händen der Räuber, du >> hast mich >> page/01000f.bin.png =RAW= Die kommen von Sn)raklls, llnd erhörte sie >> sagen: ,,~etzt >> page/010010.bin.png =RAW= böse Frau, sie hieß (~ihr Name war) Xantippe. >> page/010011.bin.png =RAW= Bella: Ach, das 2i;ort X antippe habe ich ost >> page/010012.bin.png =RAW= Donnerwetter muß ein Regen kommen,i, und ging >> (ich >> page/010013.bin.png =RAW= der PhiIosoph auf und ging aus dem Hause. >> Dieses >> page/010014.bin.png =RAW= machte Xantippe sehr böse. ~ie nahm eine Kanne >> mit >> page/010015.bin.png =RAW= Netzt’ ihm den nackten Fuß; >> page/010016.bin.png =RAW= von tHeine, von Goethe, von ~chiller, von >> Riickert. >> page/010017.bin.png =RAW= werde selbst siir mich sprechen. Und ich sage: >> Der >> page/010018.bin.png =RAW= *** FAILED (no bestpath) *** >> page/010019.bin.png =RAW= ser, als de=Herbst. Ich weiß, der Ci;inter l)at >> TheateL >> page/01001a.bin.png =RAW= Konzert und Ball. Das ist sehr schön, o, ja! >> und schön >> page/01001b.bin.png =RAW= Liszt setzte den Kranz auf den Kopf des >> glücklichen Niaw >> + set +x >> >> ================================================================ >> === You now have a simple Fraktur model, boxdata.cmodel. >> === >> === This is only an initial model. It isn't using any baseline >> === information. The next training step consists of retraining >> === the model by aligning text lines with ground truth (see the >> === example in uw3-500). >> === >> === In addition, you probably should construct a language model. >> === You can do that with ocropus-ngraphs. >> ================================================================ >> [ end ] >> >> >> >> On Sat, Aug 25, 2012 at 5:45 AM, Brad Hards <[email protected]> wrote: >> >>> On Saturday 25 August 2012 00:36:02 Sriranga(78yrsold) wrote: >>> > re-run script Fraktur-boxes. Error displayed as follow: >>> > >>> > + for image in 'deu-f/*.tif' >>> > + convert -depth 8 deu-f/fontfile_1.tif deu-f/fontfile_1.png >>> > ./run-box-training: line 23: convert:* command not found* >>> It looks like you don't have the "convert" executable, so when the script >>> tries to run it, it doesn't work. "convert" comes from imagemagick >>> package (or >>> something similar, like graphicsmagick with compatibility wrappers). >>> >>> On ubuntu, I'd suggest installing: >>> graphicsmagick >>> and >>> graphicsmagick-imagemagick-compat >>> packages >>> >>> Let us know if that helps at all. >>> >>> Brad >>> >>> -- >>> You received this message because you are subscribed to the Google >>> Groups "ocropus" group. >>> To post to this group, send email to [email protected]. >>> To unsubscribe from this group, send email to >>> [email protected]. >>> For more options, visit https://groups.google.com/groups/opt_out. >>> >>> >>> >> > -- You received this message because you are subscribed to the Google Groups "ocropus" group. To post to this group, send email to [email protected]. To unsubscribe from this group, send email to [email protected]. For more options, visit https://groups.google.com/groups/opt_out.
