hi all,
If run script "./run-box-training" - It is noticed that script generated
two folders viz. (1) boxdata.h5 and (2)boxdata.split (3) boxdata.cmodel
Out of *curiosity*, I like to view the contents the said generated three
folders - Is it possible and if so,how to do?
thanks in advance.
With regards,
-sriranga(79yrs)

On Sat, Aug 25, 2012 at 8:14 AM, Sriranga(78yrsold) <[email protected]
> wrote:

> Brad,
> forgot to add that  regarding "On ubuntu, I'd suggest installing:
> graphicsmagick
> and
> graphicsmagick-imagemagick-
> compat
> packages "
> succeeded to install above softwares using  "ubuntu Software centre".
> thanks for all help.
> -sriranga(79yrs)
>
>
> On Sat, Aug 25, 2012 at 8:09 AM, Sriranga(78yrsold) <
> [email protected]> wrote:
>
>>  Brad,
>> Extremely thankful to you for your valuable guidance. Succeeded to
>> convert from .tif to .png - vide reproduced the extract of terminal
>> below:
>> With Warmest Regards,
>> -sriranga(79yrs)
>> dell@ubuntu:~/ocropus_6.0/fraktur-boxes$ ./run-box-training
>> ================================================================
>> === This script illustrates training of a simple, initial
>> === character recognizer from the kind of boxdata training
>> === files used with Tesseract.
>> ================================================================
>>
>>
>> + for image in 'deu-f/*.tif'
>> + convert -depth 8 deu-f/fontfile_1.tif deu-f/fontfile_1.png
>> + for image in 'deu-f/*.tif'
>> + convert -depth 8 deu-f/fontfile_2.tif deu-f/fontfile_2.png
>>
>> + for image in 'deu-f/*.tif'
>> + convert -depth 8 deu-f/fontfile_3.tif deu-f/fontfile_3.png
>>
>> + for image in 'deu-f/*.tif'
>> + convert -depth 8 deu-f/fontfile_4.tif deu-f/fontfile_4.png
>>
>> + for image in 'deu-f/*.tif'
>> + convert -depth 8 deu-f/fontfile_5.tif deu-f/fontfile_5.png
>>
>> + for image in 'deu-f/*.tif'
>> + convert -depth 8 deu-f/fontfile_6.tif deu-f/fontfile_6.png
>>
>> + for image in 'deu-f/*.tif'
>> + convert -depth 8 deu-f/fontfile_7.tif deu-f/fontfile_7.png
>>
>> + for image in 'deu-f/*.tif'
>> + convert -depth 8 deu-f/fontfile_8.tif deu-f/fontfile_8.png
>> + ocropus-db tess2h5 deu-f/fontfile_1.png deu-f/fontfile_2.png
>> deu-f/fontfile_3.png deu-f/fontfile_4.png deu-f/fontfile_5.png
>> deu-f/fontfile_6.png deu-f/fontfile_7.png deu-f/fontfile_8.png -o boxdata.h5
>> ['deu-f/fontfile_1.png', 'deu-f/fontfile_2.png', 'deu-f/fontfile_3.png',
>> 'deu-f/fontfile_4.png', 'deu-f/fontfile_5.png', 'deu-f/fontfile_6.png',
>> 'deu-f/fontfile_7.png', 'deu-f/fontfile_8.png']
>> === deu-f/fontfile_1.png
>> === deu-f/fontfile_2.png
>> === deu-f/fontfile_3.png
>> === deu-f/fontfile_4.png
>> deu-f/fontfile_4.box : 2734 : syntax error
>>     e 257 112 268 131P
>>
>> deu-f/fontfile_4.box : 2807 : syntax error
>>     e 609 75 621 94g
>>
>> === deu-f/fontfile_5.png
>> deu-f/fontfile_5.box : 1102 : bad box dimensions
>>     si 750 3198 1783 3246
>>
>> === deu-f/fontfile_6.png
>> === deu-f/fontfile_7.png
>> === deu-f/fontfile_8.png
>> + ocropus-tsplit --pca 0.95 --vq 80 -d boxdata.h5 -o boxdata.split
>> loading dataset
>> got 11959 samples out of 11959
>> # classes 121
>> most common e 1827 / n 1051 / i 808 / r 769 / a 600 / t 587 / s 508 / d
>> 492 / u 415 / l 386 / ...
>> starting training
>>  pcakmeans 11959 k 80 d 0.95
>>  predicting 11959 1024
>> writing
>> + ocropus-tleaves -Q 4 -s boxdata.split -d boxdata.h5 -o boxdata.cmodel
>> loading splitter
>> got <ocrolib.patrec.HierarchicalSplitter instance at 0x2c704d0>
>> #splits 80
>> excluding [ _\000-\037]
>> sizemode perchar
>> loading dataset
>> sizemode (data) perchar
>> splitting
>> 0
>> 10000
>> cluster    0 len    180    d 176 / h 3 / c 1
>> cluster    5 len    111    b 104 / h 5 / d 1 / ö 1
>> cluster   10 len    197    n 196 / N 1
>> cluster   15 len    209    o 178 / v 12 / 0 9 / e 2 / V 2
>> cluster    6 len    145    ä 50 / ü 39 / ö 37 / 5 4 / 4 4
>> cluster   11 len    171    n 169 / h 1 / t 1
>> cluster    1 len    103    b 61 / h 41 / d 1
>> cluster   16 len    179    p 82 / v 55 / P 11 / D 7 / B 5
>> cluster   12 len     30    M 17 / W 10 / wi 1 / ap 1 / es 1
>> cluster    2 len     63    h 61 / H 2
>> cluster    7 len    178    g 176 / q 1 / 9 1
>> cluster   17 len    208    s 118 / J 18 / I 14 / H 13 / Z 13
>> cluster   13 len    227    a 227
>> cluster    3 len    101    d 100 / 6 1
>> cluster    8 len    178    ß 40 / ü 37 / st 21 / K 13 / si 11
>> cluster   18 len     89    o 87 / 0 1 / O 1
>> cluster   14 len    366    a 359 / A 5 / d 1 / O 1
>> cluster    4 len    209    d 208 / b 1
>> cluster    9 len    165    h 127 / ß 12 / tz 10 / y 5 / ö 3
>> cluster   19 len     67    D 30 / v 14 / O 8 / V 6 / H 3
>> cluster   20 len     35    A 35
>> cluster   25 len    147    n 147
>> cluster   30 len    192    n 176 / u 9 / A 3 / tt 2 / a 1
>> cluster   35 len    111    H 14 / q 14 / F 12 / C 12 / 6 12
>> cluster   21 len     69    B 14 / K 11 / * 6 / s 6 / N 4
>> cluster   26 len     87    n 80 / tz 6 / g 1
>> cluster   31 len    107    n 80 / R 14 / u 5 / K 4 / U 2
>> cluster   36 len     46    ? 20 / 7 14 / 2 9 / L 1 / w 1
>> cluster   27 len    158    u 148 / U 7 / a 2 / n 1
>> cluster   22 len    231    . 219 / « 2 / * 2 / a 1 / e 1
>> cluster   32 len    188    n 185 / y 1 / N 1 / u 1
>> cluster   37 len    141    T 31 / ck 30 / D 27 / E 12 / C 10
>> cluster   28 len    129    u 115 / U 9 / a 1 / h 1 / n 1
>> cluster   33 len    103    g 103
>> cluster   38 len    150    z 68 / F 20 / ’ 19 / J 11 / 4 9
>> cluster   23 len    114    s 81 / 8 5 / g 3 / « 3 / - 3
>> cluster   29 len    138    u 131 / n 4 / h 1 / U 1 / ü 1
>> cluster   34 len     86    Q 30 / » 21 / « 13 / O 8 / N 5
>> cluster   39 len    330    t 328 / i 2
>> cluster   40 len    127    l 66 / t 32 / i 11 / : 7 / 1 3
>> cluster   24 len     88    st 42 / si 25 / ll 9 / a 7 / K 1
>> cluster   45 len     52    — 35 / = 5 / ~ 3 / - 2 / tm 1
>> cluster   55 len     63    S 37 / G 18 / E 6 / s 1 / Ö 1
>> cluster   50 len    229    i 223 / j 2 / l 2 / s 1 / t 1
>> cluster   41 len    261    , 255 / y 3 / e 2 / - 1
>> cluster   46 len    102    ) 27 / ; 25 / : 21 / x 11 / - 5
>> cluster   56 len     84    G 37 / E 21 / S 18 / * 3 / O 3
>> cluster   51 len     78    ch 78
>> cluster   42 len    130    k 65 / f 55 / s 5 / 5 2 / b 1
>> cluster   47 len    205    l 132 / 1 30 / I 10 / ! 7 / i 6
>> cluster   57 len     63    B 15 / P 13 / V 13 / N 9 / R 8
>> cluster   52 len     59    ch 59
>> cluster   43 len    120    s 94 / f 10 / H 6 / L 3 / l 3
>> cluster   48 len    240    l 182 / ( 21 / ! 14 / i 10 / k 4
>> cluster   58 len    178    ch 175 / Ö 2 / f 1
>> cluster   49 len    199    i 193 / t 5 / 4 1
>> cluster   53 len     96    w 92 / W 4
>> cluster   44 len    264    s 185 / f 67 / k 5 / e 3 / i 2
>> cluster   60 len    160    m 138 / M 14 / n 3 / ch 1 / o 1
>> cluster   59 len    121    m 109 / M 7 / ru 1 / en 1 / la 1
>> cluster   54 len     72    w 55 / W 12 / sp 2 / tz 1 / m 1
>> cluster   65 len    103    i 75 / j 15 / t 5 / f 3 / s 3
>> cluster   61 len     81    i 81
>> cluster   70 len    344    e 337 / L 5 / 9 1 / c 1
>> cluster   75 len    108    t 88 / k 11 / e 8 / m 1
>> cluster   66 len     98    r 93 / x 4 / D 1
>> cluster   62 len     57    i 56 / z 1
>> cluster   71 len    180    e 179 / c 1
>> cluster   67 len    113    r 112 / y 1
>> cluster   76 len    118    t 116 / : 1 / r 1
>> cluster   63 len     61    i 61
>> cluster   77 len    134    r 132 / Y 1 / t 1
>> cluster   72 len    488    e 461 / c 15 / L 11 / s 1
>> cluster   68 len    187    r 187
>> cluster   64 len     87    i 86 / t 1
>> cluster   78 len    140    r 140
>> cluster   69 len    217    e 217
>> cluster   73 len    341    e 340 / h 1
>> cluster   79 len    101    r 100 / y 1
>> cluster   74 len    272    e 272
>> writing
>> + ocropus-db predict -m boxdata.cmodel boxdata.h5
>> 19 11959 0.158876160214
>> + convert deu-f/fontfile_2.tif page.bin.png
>> + ocropus-gpageseg page.bin.png
>> page.bin.png
>> computing segmentation
>> computing column separators
>> computing lines
>> propagating labels
>> spreading labels
>> number of lines 27
>> finding reading order
>> writing lines
>>     26 page.bin.png 41.3 27
>> + ocropus-lattices -m boxdata.cmodel page/010001.bin.png
>> page/010002.bin.png page/010003.bin.png page/010004.bin.png
>> page/010005.bin.png page/010006.bin.png page/010007.bin.png
>> page/010008.bin.png page/010009.bin.png page/01000a.bin.png
>> page/01000b.bin.png page/01000c.bin.png page/01000d.bin.png
>> page/01000e.bin.png page/01000f.bin.png page/010010.bin.png
>> page/010011.bin.png page/010012.bin.png page/010013.bin.png
>> page/010014.bin.png page/010015.bin.png page/010016.bin.png
>> page/010017.bin.png page/010018.bin.png page/010019.bin.png
>> page/01001a.bin.png page/01001b.bin.png
>> loading boxdata.cmodel
>> got <ocrolib.patrec.LocalCmodel instance at 0x3a1bea8>
>> sizemode perchar
>> loading /usr/local/share/ocropus/en-space.model
>> got <ocrolib.wmodel.WhitespaceModel instance at 0x3a31518>
>> loading /usr/local/share/ocropus/en-mixed.lineest
>> got <ocrolib.lineest.TrainedLineGeometry instance at 0x3a315a8>
>> segmenter lineseg.DPSegmentLine()
>> got <ocrolib.lineseg.DPSegmentLine instance at 0x3a31680>
>> recognizing 27 files
>> page/010001.bin.png =RAW= Jizasser rinlien aus einer Quelle. Er trinkt,
>> er wird frisch
>> page/010002.bin.png =RAW= F)jste nach allen ~eiten, und die Äste gehen
>> wieder ili so
>> page/010003.bin.png =RAW= viele, viele kleine Zweige, aber alles endet in
>> Pyraiiiideii-
>> page/010004.bin.png =RAW= ist mein größtes 2;ergniigen (~Freude).
>> page/010005.bin.png =RAW= ich war früherin New York oft sel)r nervös.
>> Seitdem ich
>> page/010006.bin.png =RAW= Columbus, so ging es Galilei, so ging es Johann
>> Guten-
>> page/010007.bin.png =RAW= laufeli, sä)wingen, fechten, boxen uiid
>> tanzeli; kurz,
>> page/010008.bin.png =RAW= halle llnd habe ())ymnasiik. O, wie ist das
>> sc)ön, i1neili
>> page/010009.bin.png =RAW= dem Tische sah ich Blumen, Blumensträuße (~Bou-
>> page/01000a.bin.png =RAW= quets), Niedaillons, Früchte und noch viele,
>> viele andere
>> page/01000b.bin.png =RAW= soll ich beginnen? wo enden? ljber einen Tanz
>> muß ich
>> page/01000c.bin.png =RAW= bin. Vor einer halben (~tunde kam ich aus
>> seinen=ßause.
>> page/01000d.bin.png =RAW= Da war große Gesellschast. Viele interessante
>> Personen
>> page/01000e.bin.png =RAW= 1nich gerettet aus deii Händen der Räuber, du
>> hast mich
>> page/01000f.bin.png =RAW= Die kommen von Sn)raklls, llnd erhörte sie
>> sagen: ,,~etzt
>> page/010010.bin.png =RAW= böse Frau, sie hieß (~ihr Name war) Xantippe.
>> page/010011.bin.png =RAW= Bella: Ach, das 2i;ort X antippe habe ich ost
>> page/010012.bin.png =RAW= Donnerwetter muß ein Regen kommen,i, und ging
>> (ich
>> page/010013.bin.png =RAW= der PhiIosoph auf und ging aus dem Hause.
>> Dieses
>> page/010014.bin.png =RAW= machte Xantippe sehr böse. ~ie nahm eine Kanne
>> mit
>> page/010015.bin.png =RAW= Netzt’ ihm den nackten Fuß;
>> page/010016.bin.png =RAW= von tHeine, von Goethe, von ~chiller, von
>> Riickert.
>> page/010017.bin.png =RAW= werde selbst siir mich sprechen. Und ich sage:
>> Der
>> page/010018.bin.png =RAW= *** FAILED (no bestpath) ***
>> page/010019.bin.png =RAW= ser, als de=Herbst. Ich weiß, der Ci;inter l)at
>> TheateL
>> page/01001a.bin.png =RAW= Konzert und Ball. Das ist sehr schön, o, ja!
>> und schön
>> page/01001b.bin.png =RAW= Liszt setzte den Kranz auf den Kopf des
>> glücklichen Niaw
>> + set +x
>>
>> ================================================================
>> === You now have a simple Fraktur model, boxdata.cmodel.
>> ===
>> === This is only an initial model.  It isn't using any baseline
>> === information.  The next training step consists of retraining
>> === the model by aligning text lines with ground truth (see the
>> === example in uw3-500).
>> ===
>> === In addition, you probably should construct a language model.
>> === You can do that with ocropus-ngraphs.
>> ================================================================
>>                                [ end ]
>>
>>
>>
>> On Sat, Aug 25, 2012 at 5:45 AM, Brad Hards <[email protected]> wrote:
>>
>>> On Saturday 25 August 2012 00:36:02 Sriranga(78yrsold) wrote:
>>> > re-run script Fraktur-boxes. Error displayed as follow:
>>> >
>>> > + for image in 'deu-f/*.tif'
>>> > + convert -depth 8 deu-f/fontfile_1.tif deu-f/fontfile_1.png
>>> > ./run-box-training: line 23: convert:* command not found*
>>> It looks like you don't have the "convert" executable, so when the script
>>> tries to run it, it doesn't work. "convert" comes from imagemagick
>>> package (or
>>> something similar, like graphicsmagick with compatibility wrappers).
>>>
>>> On ubuntu, I'd suggest installing:
>>> graphicsmagick
>>> and
>>> graphicsmagick-imagemagick-compat
>>> packages
>>>
>>> Let us know if that helps at all.
>>>
>>> Brad
>>>
>>> --
>>> You received this message because you are subscribed to the Google
>>> Groups "ocropus" group.
>>> To post to this group, send email to [email protected].
>>> To unsubscribe from this group, send email to
>>> [email protected].
>>> For more options, visit https://groups.google.com/groups/opt_out.
>>>
>>>
>>>
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"ocropus" group.
To post to this group, send email to [email protected].
To unsubscribe from this group, send email to 
[email protected].
For more options, visit https://groups.google.com/groups/opt_out.


Reply via email to