[jira] Created: (HADOOP-960) Incorrect number of map tasks when there are multiple input files

2007-01-29 Thread Andrew McNabb (JIRA)
Incorrect number of map tasks when there are multiple input files - Key: HADOOP-960 URL: https://issues.apache.org/jira/browse/HADOOP-960 Project: Hadoop Issue Type: Bug Aff

[jira] Updated: (HADOOP-959) TestCheckpoint fails on Windows

2007-01-29 Thread dhruba borthakur (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] dhruba borthakur updated HADOOP-959: Attachment: renameToWindows.patch Here is a patch for review. > TestCheckpoint fails on Wi

[jira] Commented: (HADOOP-928) make checksums optional per FileSystem

2007-01-29 Thread Hairong Kuang (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468459 ] Hairong Kuang commented on HADOOP-928: -- Hi Doug! Yes, wrapping eliminates all the raw methods and thus makes the

[jira] Commented: (HADOOP-928) make checksums optional per FileSystem

2007-01-29 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468456 ] Doug Cutting commented on HADOOP-928: - > keep all the raw methods in the FileSystem interface > DistributedFileSy

[jira] Commented: (HADOOP-928) make checksums optional per FileSystem

2007-01-29 Thread Hairong Kuang (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468453 ] Hairong Kuang commented on HADOOP-928: -- Yes, all you said makes sense. But I'd like to make two changes to your

[jira] Commented: (HADOOP-958) Building Hadoop results in a lot of warnings

2007-01-29 Thread Milind Bhandarkar (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468452 ] Milind Bhandarkar commented on HADOOP-958: -- Here is what I see when I build from command line: compile-core

[jira] Updated: (HADOOP-884) Create scripts to run Hadoop on Amazon EC2

2007-01-29 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doug Cutting updated HADOOP-884: Resolution: Fixed Fix Version/s: 0.11.0 Status: Resolved (was: Patch Available)

[jira] Commented: (HADOOP-958) Building Hadoop results in a lot of warnings

2007-01-29 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-958?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468450 ] Doug Cutting commented on HADOOP-958: - Warnings from what? Eclipse? When I build and/or run unit tests from the

[jira] Created: (HADOOP-959) TestCheckpoint fails on Windows

2007-01-29 Thread dhruba borthakur (JIRA)
TestCheckpoint fails on Windows --- Key: HADOOP-959 URL: https://issues.apache.org/jira/browse/HADOOP-959 Project: Hadoop Issue Type: Bug Components: dfs Reporter: dhruba borthakur A

[jira] Created: (HADOOP-958) Building Hadoop results in a lot of warnings

2007-01-29 Thread eric baldeschwieler (JIRA)
Building Hadoop results in a lot of warnings Key: HADOOP-958 URL: https://issues.apache.org/jira/browse/HADOOP-958 Project: Hadoop Issue Type: Improvement Reporter: eric baldeschwieler

[jira] Commented: (HADOOP-957) Map/Reduce job with SequenceFileOutputFormat should be able to add user specified metadata to the output file

2007-01-29 Thread Runping Qi (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468449 ] Runping Qi commented on HADOOP-957: --- Certainly. That perhaps the most convenient way to do. > Map/Reduce job with

[jira] Commented: (HADOOP-957) Map/Reduce job with SequenceFileOutputFormat should be able to add user specified metadata to the output file

2007-01-29 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-957?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468448 ] Doug Cutting commented on HADOOP-957: - Could this be done as a subclass of SequenceFileOutputFormat, RecordOutpu

[jira] Created: (HADOOP-957) Map/Reduce job with SequenceFileOutputFormat should be able to add user specified metadata to the output file

2007-01-29 Thread Runping Qi (JIRA)
Map/Reduce job with SequenceFileOutputFormat should be able to add user specified metadata to the output file - Key: HADOOP-957 URL: https://issues.apache.

[jira] Commented: (HADOOP-956) DFSShell/Wen UI should return the metadata of a file

2007-01-29 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468447 ] Doug Cutting commented on HADOOP-956: - We could add a 'file' command, like the unix 'file' program, that uses var

[jira] Commented: (HADOOP-950) Job completes but command doesn't return

2007-01-29 Thread Andrew McNabb (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468446 ] Andrew McNabb commented on HADOOP-950: -- The hanging seems to have been caused by blocked writes to a pipe. I ha

[jira] Created: (HADOOP-956) DFSShell/Wen UI should return the metadata of a file

2007-01-29 Thread Runping Qi (JIRA)
DFSShell/Wen UI should return the metadata of a file Key: HADOOP-956 URL: https://issues.apache.org/jira/browse/HADOOP-956 Project: Hadoop Issue Type: New Feature Reporter: Run

[jira] Commented: (HADOOP-432) support undelete, snapshots, or other mechanism to recover lost files

2007-01-29 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-432?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468438 ] Doug Cutting commented on HADOOP-432: - Here's a proposed design: - change the FS shell's 'rm' command to move fi

[jira] Created: (HADOOP-955) Metrics.report() metricValue parameter type should be float, not long

2007-01-29 Thread Nigel Daley (JIRA)
Metrics.report() metricValue parameter type should be float, not long - Key: HADOOP-955 URL: https://issues.apache.org/jira/browse/HADOOP-955 Project: Hadoop Issue Type: Imp

[jira] Commented: (HADOOP-951) java.util.ConcurrentModificationException in FSNamesystem.chooseTargets

2007-01-29 Thread Konstantin Shvachko (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-951?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468437 ] Konstantin Shvachko commented on HADOOP-951: chooseTargets() iterates over the heartbeats list. Looks lik

[jira] Created: (HADOOP-954) Metrics should offer complete set of static report methods or none at all

2007-01-29 Thread Nigel Daley (JIRA)
Metrics should offer complete set of static report methods or none at all - Key: HADOOP-954 URL: https://issues.apache.org/jira/browse/HADOOP-954 Project: Hadoop Issue T

Re: [jira] Commented: (HADOOP-939) No-sort optimization

2007-01-29 Thread Doug Cutting
Arkady Borkovsky wrote: So it is 27*D+13*M vs. 17*D+2*M . With D

[jira] Commented: (HADOOP-952) Create a public (shared) Hadoop EC2 AMI

2007-01-29 Thread Tom White (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468427 ] Tom White commented on HADOOP-952: -- I was planning on using my S3 storage - at least until the AMI got too popular

Re: [jira] Commented: (HADOOP-939) No-sort optimization

2007-01-29 Thread Arkady Borkovsky
How savings can be much higher than 1/3. (probably I'm already preaching to the quire) 1. The situation I have in mind looks like this: -- you have a data set M with 10^11 to 10^12 records that is produced once in a while and used hundreds of times (before its new version is generated). This

[jira] Commented: (HADOOP-952) Create a public (shared) Hadoop EC2 AMI

2007-01-29 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468426 ] Doug Cutting commented on HADOOP-952: - This would be great to have! Someone would need to donate the S3 storage

[jira] Created: (HADOOP-953) huge log files

2007-01-29 Thread Andrew McNabb (JIRA)
huge log files -- Key: HADOOP-953 URL: https://issues.apache.org/jira/browse/HADOOP-953 Project: Hadoop Issue Type: Improvement Affects Versions: 0.10.1 Environment: N/A Reporter: Andrew McNabb On our s

[jira] Commented: (HADOOP-928) make checksums optional per FileSystem

2007-01-29 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468425 ] Doug Cutting commented on HADOOP-928: - > It's hard to for me to see how to remove createRaw and openRaw from > F

[jira] Assigned: (HADOOP-432) support undelete, snapshots, or other mechanism to recover lost files

2007-01-29 Thread Sameer Paranjpye (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-432?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sameer Paranjpye reassigned HADOOP-432: --- Assignee: Doug Cutting (was: Wendy Chien) > support undelete, snapshots, or other me

[jira] Commented: (HADOOP-884) Create scripts to run Hadoop on Amazon EC2

2007-01-29 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468423 ] Hadoop QA commented on HADOOP-884: -- +1, because http://issues.apache.org/jira/secure/attachment/12349853/hadoop-884

[jira] Created: (HADOOP-952) Create a public (shared) Hadoop EC2 AMI

2007-01-29 Thread Tom White (JIRA)
Create a public (shared) Hadoop EC2 AMI --- Key: HADOOP-952 URL: https://issues.apache.org/jira/browse/HADOOP-952 Project: Hadoop Issue Type: Improvement Components: scripts Affects Versions:

[jira] Created: (HADOOP-951) java.util.ConcurrentModificationException in FSNamesystem.chooseTargets

2007-01-29 Thread Koji Noguchi (JIRA)
java.util.ConcurrentModificationException in FSNamesystem.chooseTargets - Key: HADOOP-951 URL: https://issues.apache.org/jira/browse/HADOOP-951 Project: Hadoop Issue T

[jira] Commented: (HADOOP-950) Job completes but command doesn't return

2007-01-29 Thread Andrew McNabb (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-950?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468419 ] Andrew McNabb commented on HADOOP-950: -- I'm doing a test right now. The output of hadoop-streaming is being sen

[jira] Commented: (HADOOP-926) control-c of the submitting program should kill the job

2007-01-29 Thread Owen O'Malley (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468418 ] Owen O'Malley commented on HADOOP-926: -- The problem with command line switches is that not all applications use

[jira] Commented: (HADOOP-928) make checksums optional per FileSystem

2007-01-29 Thread Hairong Kuang (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468417 ] Hairong Kuang commented on HADOOP-928: -- > The 'raw' FileSystem methods would be removed. It's hard to for me to

[jira] Updated: (HADOOP-884) Create scripts to run Hadoop on Amazon EC2

2007-01-29 Thread Tom White (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tom White updated HADOOP-884: - Attachment: hadoop-884.patch The attached patch includes the Hadoop EC2 scripts in contrib/ec2/bin. I thi

[jira] Updated: (HADOOP-884) Create scripts to run Hadoop on Amazon EC2

2007-01-29 Thread Tom White (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tom White updated HADOOP-884: - Status: Patch Available (was: In Progress) > Create scripts to run Hadoop on Amazon EC2 > --

[jira] Created: (HADOOP-950) Job completes but command doesn't return

2007-01-29 Thread Andrew McNabb (JIRA)
Job completes but command doesn't return Key: HADOOP-950 URL: https://issues.apache.org/jira/browse/HADOOP-950 Project: Hadoop Issue Type: Bug Components: contrib/streaming Affects Versi

[jira] Resolved: (HADOOP-949) reduce hangs after applying HADOOP-248 on 20 node cluster

2007-01-29 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-949?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doug Cutting resolved HADOOP-949. - Resolution: Fixed I just reverted the patch from HADOOP-248. Thanks, Nigel! > reduce hangs afte

[jira] Reopened: (HADOOP-248) locating map outputs via random probing is inefficient

2007-01-29 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Doug Cutting reopened HADOOP-248: - I just reverted this, since it was causing things to hang. > locating map outputs via random probing

[jira] Commented: (HADOOP-946) Map directly to HDFS or reduce()

2007-01-29 Thread Doug Judd (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468405 ] Doug Judd commented on HADOOP-946: -- Ok, I see your reasoning. Go ahead and downgrade this one or remove it entirel

Re: [jira] Commented: (HADOOP-948) Coding style issues

2007-01-29 Thread Doug Cutting
Nigel Daley wrote: Currently, when I run checkstyle (with sun's style conventions, which we use for Hadoop) I get tons of warnings on almost each and every source file. ...which is perhaps one reason it is too late to use CheckStyle. We can't use CheckStyle, but not for that reason: it's LGP

[jira] Commented: (HADOOP-948) Coding style issues

2007-01-29 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468402 ] Doug Cutting commented on HADOOP-948: - CheckStyle is LGPL, so cannot be included in an Apache project. I would n

[jira] Commented: (HADOOP-946) Map directly to HDFS or reduce()

2007-01-29 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468399 ] Doug Cutting commented on HADOOP-946: - > Seems like you'd need an atomic record append API to handle this. Using

Re: [jira] Commented: (HADOOP-948) Coding style issues

2007-01-29 Thread Nigel Daley
On a related note, we should really use CheckStyle (http:// checkstyle.sourceforge.net) to enforce the coding styles. Currently, when I run checkstyle (with sun's style conventions, which we use for Hadoop) I get tons of warnings on almost each and every source file. ...which is perhaps on

[jira] Commented: (HADOOP-949) reduce hangs after applying HADOOP-248 on 20 node cluster

2007-01-29 Thread Nigel Daley (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-949?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468396 ] Nigel Daley commented on HADOOP-949: Task tracker log: ... [reduce task launches] 2007-01-27 04:35:49,483 INFO o

[jira] Created: (HADOOP-949) reduce hangs after applying HADOOP-248 on 20 node cluster

2007-01-29 Thread Nigel Daley (JIRA)
reduce hangs after applying HADOOP-248 on 20 node cluster - Key: HADOOP-949 URL: https://issues.apache.org/jira/browse/HADOOP-949 Project: Hadoop Issue Type: Bug Components: m

[jira] Commented: (HADOOP-941) Make Hadoop Record I/O Easier to use outside Hadoop

2007-01-29 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468395 ] Doug Cutting commented on HADOOP-941: - > It is not clear that Hadoop considers the record io a stable interface t

[jira] Updated: (HADOOP-948) Coding style issues

2007-01-29 Thread David Bowen (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Bowen updated HADOOP-948: --- Attachment: (was: hadoop-948.patch) > Coding style issues > > >

[jira] Updated: (HADOOP-948) Coding style issues

2007-01-29 Thread David Bowen (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Bowen updated HADOOP-948: --- Attachment: hadoop-948.patch > Coding style issues > > > Key: H

[jira] Updated: (HADOOP-948) Coding style issues

2007-01-29 Thread David Bowen (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-948?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Bowen updated HADOOP-948: --- Attachment: hadoop-948.patch > Coding style issues > > > Key: H

[jira] Commented: (HADOOP-946) Map directly to HDFS or reduce()

2007-01-29 Thread Doug Judd (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468389 ] Doug Judd commented on HADOOP-946: -- > Actually, the ability to keep a large number of maps (one per block) and a >

[jira] Updated: (HADOOP-917) NPE in org.apache.hadoop.io.SequenceFile$Sorter$MergeQueue

2007-01-29 Thread Sami Siren (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sami Siren updated HADOOP-917: -- Attachment: sequencefile-npe-test.patch Attached patch reproduces the NPE in trunk version of hadoop:

[jira] Commented: (HADOOP-948) Coding style issues

2007-01-29 Thread Milind Bhandarkar (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468385 ] Milind Bhandarkar commented on HADOOP-948: -- On a related note, we should really use CheckStyle (http://chec

[jira] Resolved: (HADOOP-942) dfsadmin -report returns deadnode as "In Service"

2007-01-29 Thread Koji Noguchi (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-942?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Koji Noguchi resolved HADOOP-942. - Resolution: Duplicate Dupilcate of HADOOP-632. > dfsadmin -report returns deadnode as "In Servi

[jira] Commented: (HADOOP-926) control-c of the submitting program should kill the job

2007-01-29 Thread Milind Bhandarkar (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468380 ] Milind Bhandarkar commented on HADOOP-926: -- >I will likely leave a config option that retains the current be

[jira] Commented: (HADOOP-926) control-c of the submitting program should kill the job

2007-01-29 Thread Yoram Arnon (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468379 ] Yoram Arnon commented on HADOOP-926: wouldn't simply running the command in the background provide the current b

[jira] Created: (HADOOP-948) Coding style issues

2007-01-29 Thread David Bowen (JIRA)
Coding style issues Key: HADOOP-948 URL: https://issues.apache.org/jira/browse/HADOOP-948 Project: Hadoop Issue Type: Bug Components: metrics Reporter: David Bowen Priority: Minor I w

[jira] Commented: (HADOOP-937) data node re-registration

2007-01-29 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468378 ] Doug Cutting commented on HADOOP-937: - Do you think this change to the protocol is entirely back-compatible, or s

[jira] Commented: (HADOOP-926) control-c of the submitting program should kill the job

2007-01-29 Thread Owen O'Malley (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468377 ] Owen O'Malley commented on HADOOP-926: -- To be honest, I've used it as a feature too, but it confuses users who d

[jira] Commented: (HADOOP-941) Make Hadoop Record I/O Easier to use outside Hadoop

2007-01-29 Thread Milind Bhandarkar (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468376 ] Milind Bhandarkar commented on HADOOP-941: -- >Personally, I'm fine with it either way. Ben, The way I see

[jira] Commented: (HADOOP-946) Map directly to HDFS or reduce()

2007-01-29 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468375 ] Doug Cutting commented on HADOOP-946: - Note that there is a standard workaround for this: have your map function

[jira] Created: (HADOOP-947) isReplicationInProgress() is very heavyweight

2007-01-29 Thread dhruba borthakur (JIRA)
isReplicationInProgress() is very heavyweight - Key: HADOOP-947 URL: https://issues.apache.org/jira/browse/HADOOP-947 Project: Hadoop Issue Type: Bug Components: dfs Reporter:

[jira] Commented: (HADOOP-941) Make Hadoop Record I/O Easier to use outside Hadoop

2007-01-29 Thread Benjamin Reed (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468374 ] Benjamin Reed commented on HADOOP-941: -- Using the Hadoop jar would be an extreme cognitive burden. Plunking a m

[jira] Created: (HADOOP-946) Map directly to HDFS or reduce()

2007-01-29 Thread Doug Judd (JIRA)
Map directly to HDFS or reduce() Key: HADOOP-946 URL: https://issues.apache.org/jira/browse/HADOOP-946 Project: Hadoop Issue Type: New Feature Components: mapred Environment: all

[jira] Assigned: (HADOOP-928) make checksums optional per FileSystem

2007-01-29 Thread Hairong Kuang (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hairong Kuang reassigned HADOOP-928: Assignee: Hairong Kuang (was: Doug Cutting) > make checksums optional per FileSystem > ---

weird exceptions

2007-01-29 Thread Andrew McNabb
Hi. I'm getting some weird errors on the task trackers, which die as a result. I was wondering if this is a known problem. These are showing up in 0.10.0; I'm upgrading to 0.10.1 right now, but I thought I'd send the logs along in case it's not something that's fixed in 0.10.1. I hope this is h

[jira] Commented: (HADOOP-926) control-c of the submitting program should kill the job

2007-01-29 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468364 ] Doug Cutting commented on HADOOP-926: - > I always thought this was a feature ... ;) Heh. That was my first thou

[jira] Commented: (HADOOP-941) Make Hadoop Record I/O Easier to use outside Hadoop

2007-01-29 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-941?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468361 ] Doug Cutting commented on HADOOP-941: - > I will admit to being one of the motivators of this bug. So how much of

[jira] Updated: (HADOOP-934) TaskTracker sends duplicate status when updating task metrics throws exception

2007-01-29 Thread Arun C Murthy (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Arun C Murthy updated HADOOP-934: - Attachment: HADOOP-934_20070129_1.patch Simple fix which catches the MetricsException and just lo

Re: [jira] Commented: (HADOOP-939) No-sort optimization

2007-01-29 Thread Doug Cutting
Arkady Borkovsky wrote: Does this model assume that the size of the output of reduce is similar to the size of the input? An important class of applications (mentioned in this thread before) uses two inputs: -- M ("master file") -- very large, presorted and not changing from run to run, -- D

[jira] Commented: (HADOOP-934) TaskTracker sends duplicate status when updating task metrics throws exception

2007-01-29 Thread Arun C Murthy (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468359 ] Arun C Murthy commented on HADOOP-934: -- Ok, this entails a simple fix; however what are everyone's view on how w

[jira] Assigned: (HADOOP-934) TaskTracker sends duplicate status when updating task metrics throws exception

2007-01-29 Thread Arun C Murthy (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Arun C Murthy reassigned HADOOP-934: Assignee: Arun C Murthy > TaskTracker sends duplicate status when updating task metrics thr

Re: [jira] Commented: (HADOOP-939) No-sort optimization

2007-01-29 Thread Arkady Borkovsky
Doug's calculation shows that the total gain can be only 1/3 (15 are unavoidable, and taking advantage of largely pre-sorted input reduces overhead from 12/27 to 3/18, so the maximum total gain is 27->18.) Does this model assume that the size of the output of reduce is similar to the size

[jira] Commented: (HADOOP-944) Redundant field: DatanodeDescriptor.isAlive

2007-01-29 Thread dhruba borthakur (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-944?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468351 ] dhruba borthakur commented on HADOOP-944: - I agree. We also should keep in mind that reducing the number of

[jira] Commented: (HADOOP-926) control-c of the submitting program should kill the job

2007-01-29 Thread Andrzej Bialecki (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12468247 ] Andrzej Bialecki commented on HADOOP-926: -- I alaways thought this was a feature ... ;) I would sometimes fo