If your record is variably multi-line, then quite logically the newline
character cannot be its "record delimiter". Use the right character or
byte(s)/info that defines the real "record delimiter" and read based on
that.
The same logic as the one described at
http://wiki.apache.org/hadoop/HadoopMa
Hi folks,
I have a huge text file in TBs and it has multiline records. And we are not
given that each records takes how many lines. One records can be of size 5
lines, other may be of 6 lines another may be 4 lines. Its not sure. Line
size may vary for each record.
Since we cannot use default TextI
+1
On 5/17/13 4:10 PM, "Thomas Graves" wrote:
>Hello all,
>
>We've had a few critical issues come up in 0.23.7 that I think warrants a
>0.23.8 release. The main one is MAPREDUCE-5211. There are a couple of
>other issues that I want finished up and get in before we spin it. Those
>include HDFS-
[
https://issues.apache.org/jira/browse/MAPREDUCE-5095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Hitesh Shah resolved MAPREDUCE-5095.
Resolution: Fixed
Release Note: Thanks Arpit. Committed to branch-1.
> TestS
+1 (non-binding)
On Mon, May 20, 2013 at 11:13 AM, Siddharth Seth
wrote:
> +1.
>
>
> On Fri, May 17, 2013 at 2:10 PM, Thomas Graves >wrote:
>
> > Hello all,
> >
> > We've had a few critical issues come up in 0.23.7 that I think warrants a
> > 0.23.8 release. The main one is MAPREDUCE-5211. The
+1.
On Fri, May 17, 2013 at 2:10 PM, Thomas Graves wrote:
> Hello all,
>
> We've had a few critical issues come up in 0.23.7 that I think warrants a
> 0.23.8 release. The main one is MAPREDUCE-5211. There are a couple of
> other issues that I want finished up and get in before we spin it. Thos
Jason Lowe created MAPREDUCE-5261:
-
Summary: TestRMContainerAllocator is exiting and failing the build
Key: MAPREDUCE-5261
URL: https://issues.apache.org/jira/browse/MAPREDUCE-5261
Project: Hadoop Map/
+1
On Fri, May 17, 2013 at 4:10 PM, Thomas Graves wrote:
> Hello all,
>
> We've had a few critical issues come up in 0.23.7 that I think warrants a
> 0.23.8 release. The main one is MAPREDUCE-5211. There are a couple of
> other issues that I want finished up and get in before we spin it. Those
+1
--
Arpit Gupta
Hortonworks Inc.
http://hortonworks.com/
On May 17, 2013, at 2:10 PM, Thomas Graves wrote:
> Hello all,
>
> We've had a few critical issues come up in 0.23.7 that I think warrants a
> 0.23.8 release. The main one is MAPREDUCE-5211. There are a couple of
> other issues that I
Thanks Chris, I'll take a look at HDFS-3180.
Tom
On 5/19/13 12:54 PM, "Chris Nauroth" wrote:
>+1 (non-binding)
>
>BTW, I left a comment on HDFS-4835 suggesting that you include HDFS-3180
>for WebHDFS socket connect/read timeouts. It's up to you. (I'm voting +1
>for the release plan either way
See https://builds.apache.org/job/Hadoop-Mapreduce-trunk/1431/
###
## LAST 60 LINES OF THE CONSOLE
###
[...truncated 29305 lines...]
Running org.apache.hadoop.mapreduce.
+1 (binding)
On 17 May 2013 14:10, Thomas Graves wrote:
> Hello all,
>
> We've had a few critical issues come up in 0.23.7 that I think warrants a
> 0.23.8 release. The main one is MAPREDUCE-5211. There are a couple of
> other issues that I want finished up and get in before we spin it. Those
12 matches
Mail list logo