Hello

Thanks for support, here is the results of the try :


First try this morning not working !

   data =. 1!:1 < 'D:\\DLTTraceLog\\DLTTraceLog.dlt'

   smalData =. 400 {. data

   smalData

   DLT 8��U� ECU5─ECU┤�kX└ APPCON├DLT 8��U� ECU5 ECU┤�kX└ APPCON┬ remoDLT
8��U� ECU5 ECU┤�kX└ APPCON┐remoDLT 8��U� ECU5 ECU┤�kX└ APPCON...

   'DLT' rxmatches smalData







Joe´s solutionis working I have to split the big data Working



  substr =: (];.0~ ,.)~"1

   smalData=: 400 {. ' ' (I. 128 < a. i. (0 400 substr data)) } data

   smalData

   DLT 8 U ECU5─ECU┤ kX└ APPCON├DLT 8 U ECU5 ECU┤ kX└ APPCON┬ remoDLT 8 U
ECU5 ECU┤ kX└ APPCON┐remoDLT 8 U ECU5 ECU┤ kX└ APPCON DLT 8 U ECU5 ECU┤ kX└
APPCON ...



     'DLT' rxmatches smalData

     0 3





Raul ´s  Working  without split the data



      RX_OPTIONS_UTF8_jregex_=: 0



     400 {. data


DLT8��U�ECU5─ECU┤�kX└APPCON├DLT8��U�ECU5ECU┤�kX└APPCON┬remoDLT8��U�ECU5ECU┤�kX└APPCON┐remoDLT8��U�ECU5ECU┤�kX└APPCON...

       'DLT' rxmatches 400 {. data

       0 3

With the big data still working!



    'DLT' rxmatches data

     0 3



I tried to work with the 3 GByte data using simple search commands like

    +/  'DLT'  E. data

     21219936

It got some few seconds to finish the task but was succesfully executed





Thank you



Paolo



On Fri, Sep 11, 2015 at 2:34 PM, Raul Miller <[email protected]> wrote:

> Does it work if you disable utf8 support in regex?
>
> RX_OPTIONS_UTF8_jregex_=: 0
>
> Thanks,
>
> --
> Raul
>
>
> On Fri, Sep 11, 2015 at 5:42 AM, Strale <[email protected]> wrote:
> > Hello
> >
> > I have a very big file 3 G Bytes and I need to make some search on it :(
> > data is 8 bit char data
> >
> > J open it without problem
> > but I have some problem to use "rxmatches" due to the not 7 bit ASCII
> chars
> > (I presume)
> >
> > I have used a trik to delete data > of 7 bit  but is very badd and lead
> to
> > out of memory
> >
> > I take the inddexes of ASCII chars > of 127
> > and then I look inside data for those indexes with  command i.
> > once found I change with spaces the indexes found with the comand  ' '
> > (indexes) } data
> >
> >
> > data <-  is loaded with 3 G Bytes
> > remove =.( 128 + i. 128 ) { a. NB.  >7bit ASCII
> >
> > data =. ' ' ((128 > remove i. data) # i. $ data) } data NB. to 7bit ASCII
> >
> > |out of memory
> >
> >
> >
> > Is there a better way to do it  ?
> >
> >
> >
> > Thanks
> >
> > Paolo
> > ----------------------------------------------------------------------
> > For information about J forums see http://www.jsoftware.com/forums.htm
> ----------------------------------------------------------------------
> For information about J forums see http://www.jsoftware.com/forums.htm
>



-- 
We are made of star dust
(Carl Sagan)
----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm

Reply via email to