Re: Custom InputFormat/OutputFormat

2008-07-10 Thread Runping Qi
d to write your own >> OutputFormat to get the output file exactly how you want (perhaps >> something like LineRecordWriter which doesn't write the key out and >> outputs a separator of your choosing between each record). >> >> >> -Original Message- >> From: F

Re: Custom InputFormat/OutputFormat

2008-07-10 Thread Francesco Tamberi
key out and outputs a separator of your choosing between each record). -Original Message- From: Francesco Tamberi [mailto:[EMAIL PROTECTED] Sent: 10 July 2008 17:15 To: core-user@hadoop.apache.org Subject: Re: Custom InputFormat/OutputFormat Ok, I would not like to annoy you but I thin

RE: Custom InputFormat/OutputFormat

2008-07-10 Thread Jingkei Ly
l Message- > From: Francesco Tamberi [mailto:[EMAIL PROTECTED] > Sent: 10 July 2008 14:17 > To: core-user@hadoop.apache.org > Subject: Re: Custom InputFormat/OutputFormat > > Thank you so much. > The problem is that I need to operate on text as is, without > modification,

Re: Custom InputFormat/OutputFormat

2008-07-10 Thread Francesco Tamberi
which does not output the key, and does not use newline characters to separate records. I would suggest looking at TextOutputFormat to start. HTH, Jingkei -Original Message- From: Francesco Tamberi [mailto:[EMAIL PROTECTED] Sent: 10 July 2008 14:17 To: core-user@hadoop.apache.org Subj

RE: Custom InputFormat/OutputFormat

2008-07-10 Thread Jingkei Ly
From: Francesco Tamberi [mailto:[EMAIL PROTECTED] Sent: 10 July 2008 14:17 To: core-user@hadoop.apache.org Subject: Re: Custom InputFormat/OutputFormat Thank you so much. The problem is that I need to operate on text as is, without modification, and I don't want the filepos to be outputte

Re: Custom InputFormat/OutputFormat

2008-07-10 Thread Francesco Tamberi
apache.org Subject: Custom InputFormat/OutputFormat Hi all, I want to use hadoop for some streaming text processing on text documents like: text text text ... Just xml-like notation but not real xml files. I have to work on text included between tags, so I implemented an InputFormat

RE: Custom InputFormat/OutputFormat

2008-07-10 Thread Jingkei Ly
@hadoop.apache.org Subject: Custom InputFormat/OutputFormat Hi all, I want to use hadoop for some streaming text processing on text documents like: text text text ... Just xml-like notation but not real xml files. I have to work on text included between tags, so I implemented an InputFormat (extending

Re: Custom InputFormat/OutputFormat

2008-07-10 Thread Francesco Tamberi
Hi all, No one can give me some hint? Please apoligize me but I cannot understand if there's something wrong with my ask. Thak you, Francesco

Custom InputFormat/OutputFormat

2008-07-09 Thread Francesco Tamberi
Hi all, I want to use hadoop for some streaming text processing on text documents like: text text text ... Just xml-like notation but not real xml files. I have to work on text included between tags, so I implemented an InputFormat (extending FileInputFormat) with a RecordReader that ret