[jira] Commented: (HADOOP-1459) FileSystem.getFileCacheHints returns IP addresses rather than hostnames, which breaks 'data-locality' in map-reduce

2007-06-07 Thread dhruba borthakur (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502699 ] dhruba borthakur commented on HADOOP-1459: -- This allows the WebUI to display machine names as hostNames rat

[jira] Commented: (HADOOP-1300) deletion of excess replicas does not take into account 'rack-locality'

2007-06-07 Thread dhruba borthakur (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502696 ] dhruba borthakur commented on HADOOP-1300: -- +1 Sounds good to me. Thanks. > deletion of excess replicas do

[jira] Assigned: (HADOOP-1459) FileSystem.getFileCacheHints returns IP addresses rather than hostnames, which breaks 'data-locality' in map-reduce

2007-06-07 Thread dhruba borthakur (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] dhruba borthakur reassigned HADOOP-1459: Assignee: dhruba borthakur > FileSystem.getFileCacheHints returns IP addresses rat

[jira] Updated: (HADOOP-1459) FileSystem.getFileCacheHints returns IP addresses rather than hostnames, which breaks 'data-locality' in map-reduce

2007-06-07 Thread dhruba borthakur (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] dhruba borthakur updated HADOOP-1459: - Attachment: getHintsIpAddress.patch The getBlockLocations patch HADOOP-894 had a sideeff

[jira] Updated: (HADOOP-1477) Streaming should allow to re-start the command if it failed in the middle of input

2007-06-07 Thread arkady borkovsky (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] arkady borkovsky updated HADOOP-1477: - Description: Sometimes, we need to use imperfect programs to process data. Recently, I

[jira] Created: (HADOOP-1477) Stream should allow to re-start the command if it failed in the middle of input

2007-06-07 Thread arkady borkovsky (JIRA)
Stream should allow to re-start the command if it failed in the middle of input --- Key: HADOOP-1477 URL: https://issues.apache.org/jira/browse/HADOOP-1477 Project: Hadoop

RE: [jira] Updated: (HADOOP-1375) a simple parser for hbase.

2007-06-07 Thread 윤진석
Thank you for your advice, stack. Then, I'll focus efforts to enhance the basic functions of 1375 issue. > Date: Thu, 7 Jun 2007 09:33:23 -0700 > From: [EMAIL PROTECTED] > To: hadoop-dev@lucene.apache.org > Subject: Re: [jira] Updated: (HADOOP-1375) a simple parser for hbase. > > Edwa

[jira] Updated: (HADOOP-1476) Distributed version of 'Performance Evaluation' script

2007-06-07 Thread stack (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] stack updated HADOOP-1476: -- Status: Patch Available (was: Open) > Distributed version of 'Performance Evaluation' script > --

[jira] Updated: (HADOOP-1476) Distributed version of 'Performance Evaluation' script

2007-06-07 Thread stack (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] stack updated HADOOP-1476: -- Attachment: performance.patch Patch that adds a mapreduce task whose map runs a loading client. Commit messag

[jira] Created: (HADOOP-1476) Distributed version of 'Performance Evaluation' script

2007-06-07 Thread stack (JIRA)
Distributed version of 'Performance Evaluation' script -- Key: HADOOP-1476 URL: https://issues.apache.org/jira/browse/HADOOP-1476 Project: Hadoop Issue Type: Test Reporter: stac

[jira] Updated: (HADOOP-1465) Add cluster stop/start scripts for hbase

2007-06-07 Thread stack (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] stack updated HADOOP-1465: -- Status: In Progress (was: Patch Available) > Add cluster stop/start scripts for hbase > -

[jira] Updated: (HADOOP-1465) Add cluster stop/start scripts for hbase

2007-06-07 Thread stack (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] stack updated HADOOP-1465: -- Status: Patch Available (was: In Progress) > Add cluster stop/start scripts for hbase > -

[jira] Updated: (HADOOP-1465) Add cluster stop/start scripts for hbase

2007-06-07 Thread stack (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] stack updated HADOOP-1465: -- Attachment: clusterscripts-v3.patch Version 3. Add in a hbase-env.sh. > Add cluster stop/start scripts for h

[jira] Commented: (HADOOP-1470) Rework FSInputChecker and FSOutputSummer to support checksum code sharing between ChecksumFileSystem and block level crc dfs

2007-06-07 Thread Hairong Kuang (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502578 ] Hairong Kuang commented on HADOOP-1470: --- Chatting with Raghu and understand his concern. He does not like the

[jira] Updated: (HADOOP-1475) local filecache disappears

2007-06-07 Thread Christian Kunz (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christian Kunz updated HADOOP-1475: --- Component/s: mapred > local filecache disappears > -- > >

[jira] Created: (HADOOP-1475) local filecache disappears

2007-06-07 Thread Christian Kunz (JIRA)
local filecache disappears -- Key: HADOOP-1475 URL: https://issues.apache.org/jira/browse/HADOOP-1475 Project: Hadoop Issue Type: Bug Affects Versions: 0.13.0 Reporter: Christian Kunz Prior

[jira] Created: (HADOOP-1474) Submittable interface, for the ability to execute and monitor jobs from a java class

2007-06-07 Thread Srikanth Kakani (JIRA)
Submittable interface, for the ability to execute and monitor jobs from a java class Key: HADOOP-1474 URL: https://issues.apache.org/jira/browse/HADOOP-1474 Project:

[jira] Commented: (HADOOP-1400) JobClient rpc times out getting job status

2007-06-07 Thread Owen O'Malley (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502549 ] Owen O'Malley commented on HADOOP-1400: --- I filed the unique job id bug over in HADOOP-1473. > JobClient rpc t

[jira] Commented: (HADOOP-1473) Make jobids unique across jobtracker restarts

2007-06-07 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1473?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502547 ] Doug Cutting commented on HADOOP-1473: -- This sounds reasonable to me. +1 > Make jobids unique across jobtrack

[jira] Created: (HADOOP-1473) Make jobids unique across jobtracker restarts

2007-06-07 Thread Owen O'Malley (JIRA)
Make jobids unique across jobtracker restarts - Key: HADOOP-1473 URL: https://issues.apache.org/jira/browse/HADOOP-1473 Project: Hadoop Issue Type: Improvement Components: mapred Affe

[jira] Commented: (HADOOP-1300) deletion of excess replicas does not take into account 'rack-locality'

2007-06-07 Thread Hairong Kuang (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502544 ] Hairong Kuang commented on HADOOP-1300: --- First of all I think removing 30 replicas is not a common case. Even

[jira] Commented: (HADOOP-1470) Rework FSInputChecker and FSOutputSummer to support checksum code sharing between ChecksumFileSystem and block level crc dfs

2007-06-07 Thread Raghu Angadi (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502538 ] Raghu Angadi commented on HADOOP-1470: -- > if you think that model is broken, propose another. I am more than h

[jira] Commented: (HADOOP-1470) Rework FSInputChecker and FSOutputSummer to support checksum code sharing between ChecksumFileSystem and block level crc dfs

2007-06-07 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502533 ] Doug Cutting commented on HADOOP-1470: -- > I don't agree with 'it is good enough until it is extremely difficult

[jira] Commented: (HADOOP-1470) Rework FSInputChecker and FSOutputSummer to support checksum code sharing between ChecksumFileSystem and block level crc dfs

2007-06-07 Thread Hairong Kuang (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502526 ] Hairong Kuang commented on HADOOP-1470: --- Raghu, the more I think about it, it makes sense to have generic clas

[jira] Commented: (HADOOP-1400) JobClient rpc times out getting job status

2007-06-07 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502512 ] Doug Cutting commented on HADOOP-1400: -- So your contention is that only TimeoutException is retried, and that t

[jira] Commented: (HADOOP-1470) Rework FSInputChecker and FSOutputSummer to support checksum code sharing between ChecksumFileSystem and block level crc dfs

2007-06-07 Thread Raghu Angadi (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502511 ] Raghu Angadi commented on HADOOP-1470: -- > could you use the above? _can_ I? Yeah sure, my argument has never b

[jira] Commented: (HADOOP-71) The SequenceFileRecordReader uses the default FileSystem rather than the supplied one

2007-06-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-71?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502510 ] Hadoop QA commented on HADOOP-71: - +1 http://issues.apache.org/jira/secure/attachment/12359029/non-default-fs-input.p

[jira] Commented: (HADOOP-1400) JobClient rpc times out getting job status

2007-06-07 Thread Owen O'Malley (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502509 ] Owen O'Malley commented on HADOOP-1400: --- I think that my patch addresses the problem at hand. If the JobTracke

[jira] Commented: (HADOOP-1470) Rework FSInputChecker and FSOutputSummer to support checksum code sharing between ChecksumFileSystem and block level crc dfs

2007-06-07 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502508 ] Doug Cutting commented on HADOOP-1470: -- > Do you think attached patch is generic? Nothing is generic until it'

[jira] Commented: (HADOOP-1444) Block allocation method does not check pendingCreates for duplicate block ids

2007-06-07 Thread Konstantin Shvachko (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1444?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502506 ] Konstantin Shvachko commented on HADOOP-1444: - PendingCreates: - Redundant import: import org.apache.had

[jira] Commented: (HADOOP-1400) JobClient rpc times out getting job status

2007-06-07 Thread Michael Bieniosek (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502504 ] Michael Bieniosek commented on HADOOP-1400: --- > An easy fix is to make job-ids more unique. +1 to that: I

[jira] Commented: (HADOOP-1470) Rework FSInputChecker and FSOutputSummer to support checksum code sharing between ChecksumFileSystem and block level crc dfs

2007-06-07 Thread Raghu Angadi (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502499 ] Raghu Angadi commented on HADOOP-1470: -- Doug, I personally like the condition that block size should be a multi

[jira] Updated: (HADOOP-1472) Timed-out tasks are marked as 'KILLED' rather than as 'FAILED' which means the framework doesn't fail a TIP with 4 or more timed-out attempts

2007-06-07 Thread Arun C Murthy (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Arun C Murthy updated HADOOP-1472: -- Attachment: HADOOP-1472_1_20070608.patch Patch for review while I continue testing, this one h

[jira] Commented: (HADOOP-1400) JobClient rpc times out getting job status

2007-06-07 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502497 ] Doug Cutting commented on HADOOP-1400: -- I don't see how this handles the case I mentioned above, where: 1. Cli

[jira] Commented: (HADOOP-1400) JobClient rpc times out getting job status

2007-06-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1400?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502496 ] Hadoop QA commented on HADOOP-1400: --- +1 http://issues.apache.org/jira/secure/attachment/12359198/job-client-retry

[jira] Commented: (HADOOP-1470) Rework FSInputChecker and FSOutputSummer to support checksum code sharing between ChecksumFileSystem and block level crc dfs

2007-06-07 Thread Doug Cutting (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502492 ] Doug Cutting commented on HADOOP-1470: -- > changing block size during upgrade. that will surely not be a picnic.

[jira] Commented: (HADOOP-1472) Timed-out tasks are marked as 'KILLED' rather than as 'FAILED' which means the framework doesn't fail a TIP with 4 or more timed-out attempts

2007-06-07 Thread Arun C Murthy (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1472?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502493 ] Arun C Murthy commented on HADOOP-1472: --- Forgot to mention: Thanks to Alejandro and his team for pointing thi

[jira] Updated: (HADOOP-1472) Timed-out tasks are marked as 'KILLED' rather than as 'FAILED' which means the framework doesn't fail a TIP with 4 or more timed-out attempts

2007-06-07 Thread Arun C Murthy (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Arun C Murthy updated HADOOP-1472: -- Fix Version/s: (was: 0.13.0) 0.14.0 > Timed-out tasks are marked as 'KI

[jira] Created: (HADOOP-1472) Timed-out tasks are marked as 'KILLED' rather than as 'FAILED' which means the framework doesn't fail a TIP with 4 or more timed-out attempts

2007-06-07 Thread Arun C Murthy (JIRA)
Timed-out tasks are marked as 'KILLED' rather than as 'FAILED' which means the framework doesn't fail a TIP with 4 or more timed-out attempts -

[jira] Updated: (HADOOP-1470) Rework FSInputChecker and FSOutputSummer to support checksum code sharing between ChecksumFileSystem and block level crc dfs

2007-06-07 Thread Hairong Kuang (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Hairong Kuang updated HADOOP-1470: -- Attachment: genericChecksum.patch This is an initial patch for review. It assumes that block s

[jira] Commented: (HADOOP-1470) Rework FSInputChecker and FSOutputSummer to support checksum code sharing between ChecksumFileSystem and block level crc dfs

2007-06-07 Thread Raghu Angadi (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502477 ] Raghu Angadi commented on HADOOP-1470: -- > For backward compatability, could we enforce this during current dfs

[jira] Commented: (HADOOP-1283) Eliminate internal UTF8 to String and vice versa conversions in the name-node.

2007-06-07 Thread Konstantin Shvachko (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502469 ] Konstantin Shvachko commented on HADOOP-1283: - UTF8 elimination proposal. # Protocols (ClientProtocol, D

[jira] Commented: (HADOOP-1470) Rework FSInputChecker and FSOutputSummer to support checksum code sharing between ChecksumFileSystem and block level crc dfs

2007-06-07 Thread Raghu Angadi (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502452 ] Raghu Angadi commented on HADOOP-1470: -- This does not fix existing data. I have written quite a bit of code in

[jira] Commented: (HADOOP-1470) Rework FSInputChecker and FSOutputSummer to support checksum code sharing between ChecksumFileSystem and block level crc dfs

2007-06-07 Thread Hairong Kuang (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502446 ] Hairong Kuang commented on HADOOP-1470: --- > Finally, I think it's okay to throw an exception in the client when

[jira] Updated: (HADOOP-1400) JobClient rpc times out getting job status

2007-06-07 Thread Owen O'Malley (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley updated HADOOP-1400: -- Status: Patch Available (was: Open) > JobClient rpc times out getting job status > --

[jira] Updated: (HADOOP-1400) JobClient rpc times out getting job status

2007-06-07 Thread Owen O'Malley (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Owen O'Malley updated HADOOP-1400: -- Attachment: job-client-retry-4.patch Ok, this patch has: 1. Retry on timeouts for JobSubmiss

Build failed in Hudson: Hadoop-Nightly #114

2007-06-07 Thread hudson
See http://lucene.zones.apache.org:8080/hudson/job/Hadoop-Nightly/114/changes -- [...truncated 24660 lines...] [junit] 07/06/07 17:37:33 INFO ipc.Server: IPC Server handler 6 on 42276: starting [junit] 07/06/07 17:37:33 INFO ipc.Server: IPC Server h

[jira] Commented: (HADOOP-1300) deletion of excess replicas does not take into account 'rack-locality'

2007-06-07 Thread dhruba borthakur (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502431 ] dhruba borthakur commented on HADOOP-1300: -- +1 for the code. Looks good. I still think that the algorithm

Re: [jira] Updated: (HADOOP-1375) a simple parser for hbase.

2007-06-07 Thread Michael Stack
Edward: I reformatted your message below because the original runs together w/o new lines and so is hard to read. Matrix support in an hbaseshell looks like an interesting addition but I'm more interested in getting basic hbaseshell functionality, hadoop-1375, committed first (smile). I'd sugges

[jira] Commented: (HADOOP-1467) Remove redundant counters from WordCount example

2007-06-07 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1467?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502409 ] Hadoop QA commented on HADOOP-1467: --- +1 http://issues.apache.org/jira/secure/attachment/12359032/no-wc-counter.pa

[jira] Commented: (HADOOP-1470) Rework FSInputChecker and FSOutputSummer to support checksum code sharing between ChecksumFileSystem and block level crc dfs

2007-06-07 Thread Raghu Angadi (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502388 ] Raghu Angadi commented on HADOOP-1470: -- readBuffer.java is tested in my dev environment. Checksumming closest

RE: [jira] Updated: (HADOOP-1375) a simple parser for hbase.

2007-06-07 Thread edward yoon
HBase > CREATE m_temp COLUMNFAMILIES('A','B','C') LIMIT=3; //Create Constant MatrixHBase > A = Matrix(number_of_rows, number_of_columns, constant); //multiply a matrix by a scalar in place, A = s*AHBase > A = A.timesEquals(3);HBase > B = Matrix(number_of_rows, number_of_columns, constant);HB

[jira] Updated: (HADOOP-1375) a simple parser for hbase.

2007-06-07 Thread udanax (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] udanax updated HADOOP-1375: --- Description: http://wiki.apache.org/lucene-hadoop/HbaseShell (work in progress) HBase Shell is developed t

[jira] Assigned: (HADOOP-1457) Counters for monitoring task assignments

2007-06-07 Thread Devaraj Das (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1457?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Devaraj Das reassigned HADOOP-1457: --- Assignee: Arun C Murthy (was: Devaraj Das) > Counters for monitoring task assignments > ---

[jira] Commented: (HADOOP-1457) Counters for monitoring task assignments

2007-06-07 Thread Devaraj Das (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-1457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12502269 ] Devaraj Das commented on HADOOP-1457: - Looks good except some white space changes which can be removed. > Count