Hello
Thanks for support, here is the results of the try :
First try this morning not working !
data =. 1!:1 < 'D:\\DLTTraceLog\\DLTTraceLog.dlt'
smalData =. 400 {. data
smalData
DLT 8��U� ECU5─ECU┤�kX└ APPCON├DLT 8��U� ECU5 ECU┤�kX└ APPCON┬ remoDLT
8��U� ECU5 ECU┤�kX└ APPCON┐remoDLT 8��U� ECU5 ECU┤�kX└ APPCON...
'DLT' rxmatches smalData
Joe´s solutionis working I have to split the big data Working
substr =: (];.0~ ,.)~"1
smalData=: 400 {. ' ' (I. 128 < a. i. (0 400 substr data)) } data
smalData
DLT 8 U ECU5─ECU┤ kX└ APPCON├DLT 8 U ECU5 ECU┤ kX└ APPCON┬ remoDLT 8 U
ECU5 ECU┤ kX└ APPCON┐remoDLT 8 U ECU5 ECU┤ kX└ APPCON DLT 8 U ECU5 ECU┤ kX└
APPCON ...
'DLT' rxmatches smalData
0 3
Raul ´s Working without split the data
RX_OPTIONS_UTF8_jregex_=: 0
400 {. data
DLT8��U�ECU5─ECU┤�kX└APPCON├DLT8��U�ECU5ECU┤�kX└APPCON┬remoDLT8��U�ECU5ECU┤�kX└APPCON┐remoDLT8��U�ECU5ECU┤�kX└APPCON...
'DLT' rxmatches 400 {. data
0 3
With the big data still working!
'DLT' rxmatches data
0 3
I tried to work with the 3 GByte data using simple search commands like
+/ 'DLT' E. data
21219936
It got some few seconds to finish the task but was succesfully executed
Thank you
Paolo
On Fri, Sep 11, 2015 at 2:34 PM, Raul Miller <[email protected]> wrote:
> Does it work if you disable utf8 support in regex?
>
> RX_OPTIONS_UTF8_jregex_=: 0
>
> Thanks,
>
> --
> Raul
>
>
> On Fri, Sep 11, 2015 at 5:42 AM, Strale <[email protected]> wrote:
> > Hello
> >
> > I have a very big file 3 G Bytes and I need to make some search on it :(
> > data is 8 bit char data
> >
> > J open it without problem
> > but I have some problem to use "rxmatches" due to the not 7 bit ASCII
> chars
> > (I presume)
> >
> > I have used a trik to delete data > of 7 bit but is very badd and lead
> to
> > out of memory
> >
> > I take the inddexes of ASCII chars > of 127
> > and then I look inside data for those indexes with command i.
> > once found I change with spaces the indexes found with the comand ' '
> > (indexes) } data
> >
> >
> > data <- is loaded with 3 G Bytes
> > remove =.( 128 + i. 128 ) { a. NB. >7bit ASCII
> >
> > data =. ' ' ((128 > remove i. data) # i. $ data) } data NB. to 7bit ASCII
> >
> > |out of memory
> >
> >
> >
> > Is there a better way to do it ?
> >
> >
> >
> > Thanks
> >
> > Paolo
> > ----------------------------------------------------------------------
> > For information about J forums see http://www.jsoftware.com/forums.htm
> ----------------------------------------------------------------------
> For information about J forums see http://www.jsoftware.com/forums.htm
>
--
We are made of star dust
(Carl Sagan)
----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm