[jira] Closed: (PIG-1360) Pig API docs should include Piggybank

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1360. --- > Pig API docs should include Piggybank > - > > Key: PIG-1360 >

[jira] Closed: (PIG-1174) Creation of output path should be done by storage function

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1174. --- > Creation of output path should be done by storage function >

[jira] Closed: (PIG-1278) Type mismatch in key from map: expected org.apache.pig.impl.io.NullableFloatWritable, recieved org.apache.pig.impl.io.NullableText

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1278. --- > Type mismatch in key from map: expected > org.apache.pig.impl.io.NullableFloatWritable, recieved > org.apache.p

[jira] Closed: (PIG-1323) Communicate whether the call to LoadFunc.setLocation is being made in hadoop's front end or backend

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1323?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1323. --- > Communicate whether the call to LoadFunc.setLocation is being made in > hadoop's front end or backend >

[jira] Closed: (PIG-1031) PigStorage interpreting chararray/bytearray for a tuple element inside a bag as float or double

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1031. --- > PigStorage interpreting chararray/bytearray for a tuple element inside a bag > as float or double >

[jira] Closed: (PIG-1245) Remove the connection to namenode in HExecutionEngine.init()

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1245. --- > Remove the connection to namenode in HExecutionEngine.init() > -

[jira] Closed: (PIG-1274) Column pruning throws Null pointer exception

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1274. --- > Column pruning throws Null pointer exception > > > K

[jira] Closed: (PIG-1179) Consecutives ORDER BY on the same relation don't work

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1179. --- > Consecutives ORDER BY on the same relation don't work > - > >

[jira] Closed: (PIG-759) HBaseStorage scheme for Load/Slice function

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-759. -- > HBaseStorage scheme for Load/Slice function > --- > > Key: P

[jira] Closed: (PIG-1138) [zebra] Support of PIG's new Load/Store Interfaces

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1138. --- > [zebra] Support of PIG's new Load/Store Interfaces > --- > >

[jira] Closed: (PIG-1374) PushDownForeachFlatten shall not push ForEach below Join if the flattened fields is used in the next statement

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1374?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1374. --- > PushDownForeachFlatten shall not push ForEach below Join if the flattened > fields is used in the next statement

[jira] Closed: (PIG-1391) pig unit tests leave behind files in temp directory because MiniCluster files don't get deleted

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1391?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1391. --- > pig unit tests leave behind files in temp directory because MiniCluster files > don't get deleted >

[jira] Closed: (PIG-1417) Site changes for 0.7

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1417. --- > Site changes for 0.7 > > > Key: PIG-1417 > URL: https://issu

[jira] Closed: (PIG-1366) PigStorage's pushProjection implementation results in NPE under certain data conditions

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1366. --- > PigStorage's pushProjection implementation results in NPE under certain data > conditions >

[jira] Closed: (PIG-1365) WrappedIOException is missing from Pig.jar

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1365. --- > WrappedIOException is missing from Pig.jar > -- > > Key:

[jira] Closed: (PIG-1394) POCombinerPackage hold too much memory for InternalCachedBag

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1394. --- > POCombinerPackage hold too much memory for InternalCachedBag > --

[jira] Closed: (PIG-1369) POProject does not handle null tuples and non existent fields in some cases

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1369. --- > POProject does not handle null tuples and non existent fields in some cases > ---

[jira] Closed: (PIG-1372) Restore PigInputFormat.sJob for backward compatibility

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1372. --- > Restore PigInputFormat.sJob for backward compatibility > -- >

[jira] Closed: (PIG-1384) Adding contrib javadoc to main Pig javadoc

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1384?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1384. --- > Adding contrib javadoc to main Pig javadoc > -- > > Key:

[jira] Closed: (PIG-1361) [Zebra] Zebra TableLoader.getSchema() should return the projectionSchema specified in the constructor of TableLoader instead of pruned proejction by pig

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1361?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1361. --- > [Zebra] Zebra TableLoader.getSchema() should return the projectionSchema > specified in the constructor of Table

[jira] Closed: (PIG-1346) In unit tests Util.executeShellCommand relies on java commands being in the path and does not consider JAVA_HOME

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1346. --- > In unit tests Util.executeShellCommand relies on java commands being in the > path and does not consider JAVA_HO

[jira] Closed: (PIG-1349) [Zebra] Hubson test failure in test case TestBasicUnion

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1349. --- > [Zebra] Hubson test failure in test case TestBasicUnion > ---

[jira] Closed: (PIG-1348) PigStorage making unnecessary byte array copy when storing data

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1348. --- > PigStorage making unnecessary byte array copy when storing data > ---

[jira] Closed: (PIG-1352) piggybank UPPER udf throws exception if argument is null

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1352?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1352. --- > piggybank UPPER udf throws exception if argument is null > --

[jira] Closed: (PIG-1362) Provide udf context signature in ensureAllKeysInSameSplit() method of loader

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1362?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1362. --- > Provide udf context signature in ensureAllKeysInSameSplit() method of loader > --

[jira] Closed: (PIG-1364) Public javadoc on apache site still on 0.2, needs to be updated for each version release

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1364. --- > Public javadoc on apache site still on 0.2, needs to be updated for each > version release > ---

[jira] Closed: (PIG-1336) Optimize POStore serialized into JobConf

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1336?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1336. --- > Optimize POStore serialized into JobConf > > > Key: PIG-

[jira] Closed: (PIG-1330) Move pruned schema tracking logic from LoadFunc to core code

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1330. --- > Move pruned schema tracking logic from LoadFunc to core code > --

[jira] Closed: (PIG-1356) [zebra] TableLoader makes unnecessary calls to build a Job instance that create a new JobClient in the hadoop 0.20.9

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1356?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1356. --- > [zebra] TableLoader makes unnecessary calls to build a Job instance that > create a new JobClient in the hadoop

[jira] Closed: (PIG-1327) Incorrect column pruning after multiple JOIN operations

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1327. --- > Incorrect column pruning after multiple JOIN operations > ---

[jira] Closed: (PIG-1357) [zebra] Test cases of map-side GROUP-BY should be added.

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1357?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1357. --- > [zebra] Test cases of map-side GROUP-BY should be added. > --

[jira] Closed: (PIG-1335) UDFFinder should find LoadFunc used by POCast

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1335. --- > UDFFinder should find LoadFunc used by POCast > - > >

[jira] Closed: (PIG-1325) Provide a way to exclude a testcase when running "ant test"

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1325. --- > Provide a way to exclude a testcase when running "ant test" > ---

[jira] Closed: (PIG-1312) Make Pig work with hadoop security

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1312. --- > Make Pig work with hadoop security > -- > > Key: PIG-1312 >

[jira] Closed: (PIG-1315) [Zebra] Implementing OrderedLoadFunc interface for Zebra TableLoader

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1315?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1315. --- > [Zebra] Implementing OrderedLoadFunc interface for Zebra TableLoader > --

[jira] Closed: (PIG-1317) LOLoad should cache results of LoadMetadata.getSchema() for use in subsequent calls to LOLoad.getSchema() or LOLoad.determineSchema()

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1317. --- > LOLoad should cache results of LoadMetadata.getSchema() for use in subsequent > calls to LOLoad.getSchema() or L

[jira] Closed: (PIG-1318) [Zebra] Invalid type for source_table field when using order-preserving Sorted Table Union

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1318?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1318. --- > [Zebra] Invalid type for source_table field when using order-preserving > Sorted Table Union > -

[jira] Closed: (PIG-1308) Inifinite loop in JobClient when reading from BinStorage Message: [org.apache.hadoop.mapreduce.lib.input.FileInputFormat - Total input paths to process : 2]

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1308?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1308. --- > Inifinite loop in JobClient when reading from BinStorage Message: > [org.apache.hadoop.mapreduce.lib.input.FileI

[jira] Closed: (PIG-1320) Pig/Zebra 0.7.0 Docs

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1320?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1320. --- > Pig/Zebra 0.7.0 Docs > > > Key: PIG-1320 > URL: https://issu

[jira] Closed: (PIG-1316) TextLoader should use Bzip2TextInputFormat for bzip files so that bzip files can be efficiently processed by splitting the files

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1316. --- > TextLoader should use Bzip2TextInputFormat for bzip files so that bzip files > can be efficiently processed by s

[jira] Closed: (PIG-1307) when we spill the DefaultDataBag we are not setting the sized changed flag to be true.

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1307. --- > when we spill the DefaultDataBag we are not setting the sized changed flag to > be true. > -

[jira] Closed: (PIG-1305) Document in Load statement syntax that Pig and underlying M/R does not handle concatenated bz2 and gz files correctly

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1305?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1305. --- > Document in Load statement syntax that Pig and underlying M/R does not > handle concatenated bz2 and gz files c

[jira] Closed: (PIG-1310) ISO Date UDFs: Conversion, Trucation and Date Math

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1310. --- > ISO Date UDFs: Conversion, Trucation and Date Math > -- > >

[jira] Closed: (PIG-1306) [zebra] Support of locally sorted input splits

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1306?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1306. --- > [zebra] Support of locally sorted input splits > -- > >

[jira] Closed: (PIG-1303) unable to set outgoing format for org.apache.pig.piggybank.evaluation.util.apachelogparser.DateExtractor

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1303. --- > unable to set outgoing format for > org.apache.pig.piggybank.evaluation.util.apachelogparser.DateExtractor > ---

[jira] Closed: (PIG-1290) WeightedRangePartitioner should not check if input is empty if quantile file is empty

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1290. --- > WeightedRangePartitioner should not check if input is empty if quantile file > is empty > --

[jira] Closed: (PIG-1301) Problem pruning columns with UDF

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1301. --- > Problem pruning columns with UDF > > > Key: PIG-1301 >

[jira] Closed: (PIG-1298) Restore file traversal behavior to Pig loaders

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1298. --- > Restore file traversal behavior to Pig loaders > -- > >

[jira] Closed: (PIG-1296) Skewed join fail due to negative partition index

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1296. --- > Skewed join fail due to negative partition index > > >

[jira] Closed: (PIG-1285) Allow SingleTupleBag to be serialized

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1285. --- > Allow SingleTupleBag to be serialized > - > > Key: PIG-1285 >

[jira] Closed: (PIG-1300) PigStorage does not load tuples with large #s.

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1300. --- > PigStorage does not load tuples with large #s. > -- > >

[jira] Closed: (PIG-1289) PIG Join fails while doing a filter on joined data

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1289. --- > PIG Join fails while doing a filter on joined data > -- > >

[jira] Closed: (PIG-1293) pig wrapper script tends to fail if pig is in the path and PIG_HOME isn't set

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1293. --- > pig wrapper script tends to fail if pig is in the path and PIG_HOME isn't set > -

[jira] Closed: (PIG-1284) pig UDF is lacking XMLLoader. Plan to add the XMLLoader

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1284?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1284. --- > pig UDF is lacking XMLLoader. Plan to add the XMLLoader > ---

[jira] Closed: (PIG-1291) [zebra] Zebra need to support the virtual column 'source_table' for the unsorted table unions also

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1291. --- > [zebra] Zebra need to support the virtual column 'source_table' for the > unsorted table unions also >

[jira] Closed: (PIG-1292) Interface Refinements

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1292?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1292. --- > Interface Refinements > - > > Key: PIG-1292 > URL: https://is

[jira] Closed: (PIG-1276) [Zebra] Changes requried for Zebra due to PIG-1259 changes

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1276. --- > [Zebra] Changes requried for Zebra due to PIG-1259 changes > ---

[jira] Closed: (PIG-1282) [zebra] make Zebra's pig test cases run on real cluster

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1282?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1282. --- > [zebra] make Zebra's pig test cases run on real cluster > ---

[jira] Closed: (PIG-1287) Use hadoop-0.20.2 with pig 0.7.0 release

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1287?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1287. --- > Use hadoop-0.20.2 with pig 0.7.0 release > > > Key: PIG-

[jira] Closed: (PIG-1272) Column pruner causes wrong results

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1272. --- > Column pruner causes wrong results > -- > > Key: PIG-1272 >

[jira] Closed: (PIG-1267) Problems with partition filter optimizer

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1267. --- > Problems with partition filter optimizer > > > Key: PIG-

[jira] Closed: (PIG-1263) Script producing varying number of records when COGROUPing value of map data type with and without types

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1263?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1263. --- > Script producing varying number of records when COGROUPing value of map data > type with and without types > ---

[jira] Closed: (PIG-1273) Skewed join throws error

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1273. --- > Skewed join throws error > - > > Key: PIG-1273 > URL: ht

[jira] Closed: (PIG-1275) empty bag in PigStorage read as null

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1275. --- > empty bag in PigStorage read as null > > > Key: PIG-1275 >

[jira] Closed: (PIG-1268) [Zebra] Need an ant target that runs all pig-related tests in Zebra

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1268?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1268. --- > [Zebra] Need an ant target that runs all pig-related tests in Zebra > ---

[jira] Closed: (PIG-1269) [Zebra] Restrict schema definition for collection

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1269. --- > [Zebra] Restrict schema definition for collection > - > >

[jira] Closed: (PIG-1262) Additional findbugs and javac warnings

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1262?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1262. --- > Additional findbugs and javac warnings > -- > > Key: PIG-1262

[jira] Closed: (PIG-1265) Change LoadMetadata and StoreMetadata to use Job instead of Configuraiton and add a cleanupOnFailure method to StoreFuncInterface

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1265?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1265. --- > Change LoadMetadata and StoreMetadata to use Job instead of Configuraiton and > add a cleanupOnFailure method to

[jira] Closed: (PIG-1260) Param Subsitution results in parser error if there is no EOL after last line in script

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1260?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1260. --- > Param Subsitution results in parser error if there is no EOL after last line > in script > -

[jira] Closed: (PIG-1259) ResourceFieldSchema.setSchema should not allow a bag field without a Tuple as its only sub field (the tuple itself can have a schema with > 1 subfields)

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1259. --- > ResourceFieldSchema.setSchema should not allow a bag field without a Tuple as > its only sub field (the tuple i

[jira] Closed: (PIG-1266) Show spill count on the pig console at the end of the job

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1266?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1266. --- > Show spill count on the pig console at the end of the job > -

[jira] Closed: (PIG-1264) Skewed join sampler misses out the key with the highest frequency

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1264. --- > Skewed join sampler misses out the key with the highest frequency > -

[jira] Closed: (PIG-1257) PigStorage per the new load-store redesign should support splitting of bzip files

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1257. --- > PigStorage per the new load-store redesign should support splitting of bzip > files > --

[jira] Closed: (PIG-1261) PigStorageSchema broke after changes to ResourceSchema

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1261. --- > PigStorageSchema broke after changes to ResourceSchema > -- >

[jira] Closed: (PIG-1255) Tiny code cleanup for serialization code for PigSplit

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1255?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1255. --- > Tiny code cleanup for serialization code for PigSplit > - > >

[jira] Closed: (PIG-1256) [Zebra] Bag field should always contain a tuple type as the field schema in ResourceSchema object converted from Zebra Schema

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1256. --- > [Zebra] Bag field should always contain a tuple type as the field schema in > ResourceSchema object converted fr

[jira] Closed: (PIG-1258) [zebra] Number of sorted input splits is unusually high

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1258. --- > [zebra] Number of sorted input splits is unusually high > ---

[jira] Closed: (PIG-1251) Move SortInfo calculation earlier in compilation

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1251. --- > Move SortInfo calculation earlier in compilation > - > >

[jira] Closed: (PIG-1252) Diamond splitter does not generate correct results when using Multi-query optimization

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1252?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1252. --- > Diamond splitter does not generate correct results when using Multi-query > optimization > -

[jira] Closed: (PIG-1253) [zebra] make map/reduce test cases run on real cluster

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1253. --- > [zebra] make map/reduce test cases run on real cluster > -- >

[jira] Closed: (PIG-1250) Make StoreFunc an abstract class and create a mirror interface called StoreFuncInterface

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1250. --- > Make StoreFunc an abstract class and create a mirror interface called > StoreFuncInterface > ---

[jira] Closed: (PIG-1238) Dump does not respect the schema

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1238. --- > Dump does not respect the schema > > > Key: PIG-1238 >

[jira] Closed: (PIG-1243) Passing Complex map types to and from streaming causes a problem

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1243. --- > Passing Complex map types to and from streaming causes a problem > --

[jira] Closed: (PIG-1240) [Zebra] suggestion to have zebra manifest file contain version and svn-revision etc.

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1240. --- > [Zebra] suggestion to have zebra manifest file contain version and > svn-revision etc. > --

[jira] Closed: (PIG-1226) Need to be able to register jars on the command line

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1226. --- > Need to be able to register jars on the command line > > >

[jira] Closed: (PIG-1248) [piggybank] useful String functions

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1248. --- > [piggybank] useful String functions > --- > > Key: PIG-1248 >

[jira] Closed: (PIG-1233) NullPointerException in AVG

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1233. --- > NullPointerException in AVG > > > Key: PIG-1233 > U

[jira] Closed: (PIG-1220) Document unknown keywords as missing or to do in future

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1220. --- > Document unknown keywords as missing or to do in future > ---

[jira] Closed: (PIG-1241) Accumulator is turned on when a map is used with a non-accumulative UDF

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1241. --- > Accumulator is turned on when a map is used with a non-accumulative UDF > ---

[jira] Closed: (PIG-1224) Collected group should change to use new (internal) bag

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1224. --- > Collected group should change to use new (internal) bag > ---

[jira] Closed: (PIG-1218) Use distributed cache to store samples

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1218. --- > Use distributed cache to store samples > -- > > Key: PIG-1218

[jira] Closed: (PIG-1234) Unable to create input slice for har:// files

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1234. --- > Unable to create input slice for har:// files > - > >

[jira] Closed: (PIG-1230) Streaming input in POJoinPackage should use nonspillable bag to collect tuples

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1230. --- > Streaming input in POJoinPackage should use nonspillable bag to collect tuples >

[jira] Closed: (PIG-1216) New load store design does not allow Pig to validate inputs and outputs up front

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1216. --- > New load store design does not allow Pig to validate inputs and outputs up > front > ---

[jira] Closed: (PIG-1217) [piggybank] evaluation.util.Top is broken

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1217. --- > [piggybank] evaluation.util.Top is broken > - > > Key: PI

[jira] Closed: (PIG-1215) Make Hadoop jobId more prominent in the client log

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1215. --- > Make Hadoop jobId more prominent in the client log > -- > >

[jira] Closed: (PIG-1207) [zebra] Data sanity check should be performed at the end of writing instead of later at query time

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1207. --- > [zebra] Data sanity check should be performed at the end of writing instead > of later at query time >

[jira] Closed: (PIG-1209) Port POJoinPackage to proactively spill

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1209?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1209. --- > Port POJoinPackage to proactively spill > --- > > Key: PIG-12

[jira] Closed: (PIG-1212) LogicalPlan.replaceAndAddSucessors produce wrong result when successors are null

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1212. --- > LogicalPlan.replaceAndAddSucessors produce wrong result when successors are > null > ---

[jira] Closed: (PIG-1204) Pig hangs when joining two streaming relations in local mode

2010-05-13 Thread Daniel Dai (JIRA)
[ https://issues.apache.org/jira/browse/PIG-1204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Daniel Dai closed PIG-1204. --- > Pig hangs when joining two streaming relations in local mode > --

  1   2   3   >