[2/3] BUILD: Updating CHANGES.txt for Spark 1.1

pwendell Wed, 27 Aug 2014 15:56:40 -0700

http://git-wip-us.apache.org/repos/asf/spark/blob/8597e9cf/CHANGES.txt
----------------------------------------------------------------------
diff --git a/CHANGES.txt b/CHANGES.txt
new file mode 100644
index 0000000..6efb022
--- /dev/null
+++ b/CHANGES.txt
@@ -0,0 +1,14470 @@
+Spark Change Log
+----------------
+
+Release 1.1.0
+
+  Add line continuation for script to work w/ py2.7.5
+  Matthew Farrellee <m...@redhat.com>
+  2014-08-27 15:50:30 -0700
+  Commit: d4cf7a0, github.com/apache/spark/pull/2139
+
+  [SPARK-3235][SQL] Ensure in-memory tables don't always broadcast.
+  Michael Armbrust <mich...@databricks.com>
+  2014-08-27 15:14:08 -0700
+  Commit: 9a62cf3, github.com/apache/spark/pull/2147
+
+  [SPARK-3065][SQL] Add locale setting to fix results do not match for 
udf_unix_timestamp format "yyyy MMM dd h:mm:ss a" run with not 
"America/Los_Angeles" TimeZone in HiveCompatibilitySuite
+  luogankun <luogan...@gmail.com>
+  2014-08-27 15:08:22 -0700
+  Commit: 5ea260e, github.com/apache/spark/pull/1968
+
+  [SQL] [SPARK-3236] Reading Parquet tables from Metastore mangles location
+  Aaron Davidson <aa...@databricks.com>
+  2014-08-27 15:05:47 -0700
+  Commit: 7711687, github.com/apache/spark/pull/2150
+
+  [SPARK-3252][SQL] Add missing condition for test
+  viirya <vii...@gmail.com>
+  2014-08-27 14:55:05 -0700
+  Commit: b3d763b, github.com/apache/spark/pull/2159
+
+  [SPARK-3243] Don't use stale spark-driver.* system properties
+  Andrew Or <andrewo...@gmail.com>
+  2014-08-27 14:46:56 -0700
+  Commit: c1ffa3e, github.com/apache/spark/pull/2154
+
+  Spark-3213 Fixes issue with spark-ec2 not detecting slaves created with 
"Launch More like this"
+  Vida Ha <v...@databricks.com>
+  2014-08-27 14:26:06 -0700
+  Commit: 3cb4e17, github.com/apache/spark/pull/2163
+
+  [SPARK-3138][SQL] sqlContext.parquetFile should be able to take a single 
file as parameter
+  chutium <teng....@gmail.com>
+  2014-08-27 13:13:04 -0700
+  Commit: 90f8f3e, github.com/apache/spark/pull/2044
+
+  [SPARK-3197] [SQL] Reduce the Expression tree object creations for 
aggregation function (min/max)
+  Cheng Hao <hao.ch...@intel.com>
+  2014-08-27 12:50:47 -0700
+  Commit: 4c7f082, github.com/apache/spark/pull/2113
+
+  [SPARK-3118][SQL]add "SHOW TBLPROPERTIES tblname;" and "SHOW COLUMNS 
(FROM|IN) table_name [(FROM|IN) db_name]" support
+  u0jing <u9j...@gmail.com>
+  2014-08-27 12:47:14 -0700
+  Commit: 19cda07, github.com/apache/spark/pull/2034
+
+  SPARK-3259 - User data should be given to the master
+  Allan Douglas R. de Oliveira <al...@chaordicsystems.com>
+  2014-08-27 12:43:22 -0700
+  Commit: 0c94a5b, github.com/apache/spark/pull/2162
+
+  [SPARK-2608][Core] Fixed command line option passing issue over Mesos via 
SPARK_EXECUTOR_OPTS
+  Cheng Lian <lian.cs....@gmail.com>
+  2014-08-27 12:39:21 -0700
+  Commit: 935bffe, github.com/apache/spark/pull/2161
+
+  [SPARK-3239] [PySpark] randomize the dirs for each process
+  Davies Liu <davies....@gmail.com>
+  2014-08-27 10:40:35 -0700
+  Commit: 092121e, github.com/apache/spark/pull/2152
+
+  [SPARK-3170][CORE][BUG]:RDD info loss in "StorageTab" and "ExecutorTab"
+  uncleGen <husty...@gmail.com>
+  2014-08-27 10:32:13 -0700
+  Commit: 8f8e2a4, github.com/apache/spark/pull/2131
+
+  [SPARK-3154][STREAMING] Make FlumePollingInputDStream shutdown cleaner.
+  Hari Shreedharan <hshreedha...@apache.org>
+  2014-08-27 02:39:02 -0700
+  Commit: 1d468df, github.com/apache/spark/pull/2065
+
+  [SPARK-3227] [mllib] Added migration guide for v1.0 to v1.1
+  Joseph K. Bradley <joseph.kurata.brad...@gmail.com>
+  2014-08-27 01:45:59 -0700
+  Commit: 7286d57, github.com/apache/spark/pull/2146
+
+  [SPARK-2830][MLLIB] doc update for 1.1
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-27 01:19:48 -0700
+  Commit: 7401247, github.com/apache/spark/pull/2151
+
+  [SPARK-3237][SQL] Fix parquet filters with UDFs
+  Michael Armbrust <mich...@databricks.com>
+  2014-08-27 00:59:23 -0700
+  Commit: ca01de1, github.com/apache/spark/pull/2153
+
+  [SPARK-3139] Made ContextCleaner to not block on shuffles
+  Tathagata Das <tathagata.das1...@gmail.com>
+  2014-08-27 00:13:38 -0700
+  Commit: 5cf1e44, github.com/apache/spark/pull/2143
+
+  HOTFIX: Minor typo in conf template
+  Patrick Wendell <pwend...@gmail.com>
+  2014-08-26 23:40:50 -0700
+  Commit: 6f82a4b
+
+  [SPARK-3167] Handle special driver configs in Windows (Branch 1.1)
+  Andrew Or <andrewo...@gmail.com>
+  2014-08-26 23:06:11 -0700
+  Commit: e7672f1, github.com/apache/spark/pull/2156
+
+  [SPARK-3224] FetchFailed reduce stages should only show up once in failed 
stages (in UI)
+  Reynold Xin <r...@apache.org>, Kay Ousterhout <kayousterh...@gmail.com>
+  2014-08-26 21:59:48 -0700
+  Commit: 2381e90, github.com/apache/spark/pull/2127
+
+  Fix unclosed HTML tag in Yarn docs.
+  Josh Rosen <joshro...@apache.org>
+  2014-08-26 18:55:00 -0700
+  Commit: 7726e56
+
+  [SPARK-3036][SPARK-3037][SQL] Add MapType/ArrayType containing null value 
support to Parquet.
+  Takuya UESHIN <ues...@happy-camper.st>
+  2014-08-26 18:28:41 -0700
+  Commit: 8b5af6f, github.com/apache/spark/pull/2032
+
+  [Docs] Run tests like in contributing guide
+  nchammas <nicholas.cham...@gmail.com>
+  2014-08-26 17:50:04 -0700
+  Commit: 0d97233, github.com/apache/spark/pull/2149
+
+  [SPARK-2964] [SQL] Remove duplicated code from spark-sql and 
start-thriftserver.sh
+  Cheng Lian <lian.cs....@gmail.com>, Kousuke Saruta 
<saru...@oss.nttdata.co.jp>
+  2014-08-26 17:33:40 -0700
+  Commit: c0e1f99, github.com/apache/spark/pull/1886
+
+  [SPARK-3194][SQL] Add AttributeSet to fix bugs with invalid comparisons of 
AttributeReferences
+  Michael Armbrust <mich...@databricks.com>
+  2014-08-26 16:29:14 -0700
+  Commit: a308a16, github.com/apache/spark/pull/2109
+
+  [SPARK-2839][MLlib] Stats Toolkit documentation updated
+  Burak <brk...@gmail.com>
+  2014-08-26 15:18:42 -0700
+  Commit: 2715eb7, github.com/apache/spark/pull/2130
+
+  [SPARK-3226][MLLIB] doc update for native libraries
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-26 15:12:27 -0700
+  Commit: 5ff9000, github.com/apache/spark/pull/2128
+
+  [SPARK-3063][SQL] ExistingRdd should convert Map to catalyst Map.
+  Takuya UESHIN <ues...@happy-camper.st>
+  2014-08-26 15:04:08 -0700
+  Commit: 5d981a4, github.com/apache/spark/pull/1963
+
+  [SPARK-2969][SQL] Make ScalaReflection be able to handle 
ArrayType.containsNull and MapType.valueContainsNull.
+  Takuya UESHIN <ues...@happy-camper.st>
+  2014-08-26 13:22:55 -0700
+  Commit: 35a5853, github.com/apache/spark/pull/1889
+
+  [SPARK-2871] [PySpark] add histgram() API
+  Davies Liu <davies....@gmail.com>
+  2014-08-26 13:04:30 -0700
+  Commit: 83d2730, github.com/apache/spark/pull/2091
+
+  [SPARK-3131][SQL] Allow user to set parquet compression codec for writing 
ParquetFile in SQLContext
+  chutium <teng....@gmail.com>
+  2014-08-26 11:51:26 -0700
+  Commit: 3a9d874, github.com/apache/spark/pull/2039
+
+  [SPARK-2886] Use more specific actor system name than "spark"
+  Andrew Or <andrewo...@gmail.com>
+  2014-08-25 23:36:09 -0700
+  Commit: 0f947f1, github.com/apache/spark/pull/1810
+
+  [Spark-3222] [SQL] Cross join support in HiveQL
+  Daoyuan Wang <daoyuan.w...@intel.com>, adrian-wang <daoyuanw...@gmail.com>
+  2014-08-25 22:56:35 -0700
+  Commit: 48a0749, github.com/apache/spark/pull/2124
+
+  SPARK-2481: The environment variables SPARK_HISTORY_OPTS is covered in 
spark-env.sh
+  witgo <wi...@qq.com>, GuoQiang Li <wi...@qq.com>
+  2014-08-25 19:22:27 -0700
+  Commit: 4d6a0e9, github.com/apache/spark/pull/1341
+
+  [SPARK-3011][SQL] _temporary directory should be filtered out by 
sqlContext.parquetFile
+  Chia-Yung Su <chiay...@appier.com>
+  2014-08-25 18:20:19 -0700
+  Commit: b5dc9b4, github.com/apache/spark/pull/1959
+
+  [SQL] logWarning should be logInfo in getResultSetSchema
+  wangfei <wangfei_he...@126.com>
+  2014-08-25 17:46:43 -0700
+  Commit: 957b356, github.com/apache/spark/pull/1939
+
+  [SPARK-3058] [SQL] Support EXTENDED for EXPLAIN
+  Cheng Hao <hao.ch...@intel.com>
+  2014-08-25 17:43:56 -0700
+  Commit: f8ac8ed, github.com/apache/spark/pull/1962
+
+  [SPARK-2929][SQL] Refactored Thrift server and CLI suites
+  Cheng Lian <lian.cs....@gmail.com>
+  2014-08-25 16:29:59 -0700
+  Commit: 292f28d, github.com/apache/spark/pull/1856
+
+  [SPARK-3204][SQL] MaxOf would be foldable if both left and right are 
foldable.
+  Takuya UESHIN <ues...@happy-camper.st>
+  2014-08-25 16:27:00 -0700
+  Commit: 19b01d6, github.com/apache/spark/pull/2116
+
+  Fixed a typo in docs/running-on-mesos.md
+  Cheng Lian <lian.cs....@gmail.com>
+  2014-08-25 14:56:51 -0700
+  Commit: 8d33a6d, github.com/apache/spark/pull/2119
+
+  [FIX] fix error message in sendMessageReliably
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-25 14:55:20 -0700
+  Commit: d892062, github.com/apache/spark/pull/2120
+
+  SPARK-2798 [BUILD] Correct several small errors in Flume module pom.xml files
+  Sean Owen <so...@cloudera.com>
+  2014-08-25 13:29:07 -0700
+  Commit: ff616fd, github.com/apache/spark/pull/1726
+
+  [SPARK-2495][MLLIB] make KMeans constructor public
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-25 12:30:02 -0700
+  Commit: 69a17f1, github.com/apache/spark/pull/2112
+
+  [SPARK-2871] [PySpark] add zipWithIndex() and zipWithUniqueId()
+  Davies Liu <davies....@gmail.com>
+  2014-08-24 21:16:05 -0700
+  Commit: b82da3d, github.com/apache/spark/pull/2092
+
+  [MLlib][SPARK-2997] Update SVD documentation to reflect roughly square
+  Reza Zadeh <riz...@gmail.com>
+  2014-08-24 17:35:54 -0700
+  Commit: 749bddc, github.com/apache/spark/pull/2070
+
+  [SPARK-2841][MLlib] Documentation for feature transformations
+  DB Tsai <dbt...@alpinenow.com>
+  2014-08-24 17:33:33 -0700
+  Commit: a4db81a, github.com/apache/spark/pull/2068
+
+  [SPARK-3192] Some scripts have 2 space indentation but other scripts have 4 
space indentation.
+  Kousuke Saruta <saru...@oss.nttdata.co.jp>
+  2014-08-24 09:43:44 -0700
+  Commit: ce14cd1, github.com/apache/spark/pull/2104
+
+  [SPARK-2967][SQL]  Follow-up: Also copy hash expressions in sort based 
shuffle fix.
+  Michael Armbrust <mich...@databricks.com>
+  2014-08-23 16:21:08 -0700
+  Commit: e23f0bc, github.com/apache/spark/pull/2072
+
+  [SPARK-2554][SQL] CountDistinct partial aggregation and object allocation 
improvements
+  Michael Armbrust <mich...@databricks.com>, Gregory Owen <greo...@gmail.com>
+  2014-08-23 16:19:10 -0700
+  Commit: 7112da8, github.com/apache/spark/pull/1935
+
+  [SQL] Make functionRegistry in HiveContext transient.
+  Yin Huai <huaiyin....@gmail.com>
+  2014-08-23 12:46:41 -0700
+  Commit: 9309786, github.com/apache/spark/pull/2074
+
+  [SPARK-2963] REGRESSION - The description about how to build for using CLI 
and Thrift JDBC server is absent in proper document  -
+  Kousuke Saruta <saru...@oss.nttdata.co.jp>
+  2014-08-22 22:28:05 -0700
+  Commit: 5689660, github.com/apache/spark/pull/2080
+
+  [SPARK-3169] Removed dependency on spark streaming test from spark flume sink
+  Tathagata Das <tathagata.das1...@gmail.com>
+  2014-08-22 21:34:48 -0700
+  Commit: cd73631, github.com/apache/spark/pull/2101
+
+  Revert "HOTFIX:Temporarily removing flume sink test in 1.1 branch"
+  Patrick Wendell <pwend...@gmail.com>
+  2014-08-22 21:31:52 -0700
+  Commit: 385c4f2
+
+  [SPARK-2840] [mllib] DecisionTree doc update (Java, Python examples)
+  Joseph K. Bradley <joseph.kurata.brad...@gmail.com>
+  2014-08-21 00:17:29 -0700
+  Commit: 1e5d9cb, github.com/apache/spark/pull/2063
+
+  BUILD: Bump Hadoop versions in the release build.
+  Patrick Wendell <pwend...@gmail.com>
+  2014-08-20 12:18:41 -0700
+  Commit: da0a701
+
+  HOTFIX:Temporarily removing flume sink test in 1.1 branch
+  Patrick Wendell <pwend...@gmail.com>
+  2014-08-20 22:24:22 -0700
+  Commit: 1d5e84a
+
+  [HOTFIX][STREAMING] Allow the JVM/Netty to decide which port to bind to in 
Flume Polling Tests.
+  Hari Shreedharan <harishreedha...@gmail.com>
+  2014-08-17 19:50:31 -0700
+  Commit: 4485665, github.com/apache/spark/pull/1820
+
+  [HOTFIX][Streaming] Handle port collisions in flume polling test
+  Andrew Or <andrewo...@gmail.com>
+  2014-08-06 16:34:53 -0700
+  Commit: 3f91e9d, github.com/apache/spark/pull/1803
+
+  [SPARK-2843][MLLIB] add a section about regularization parameter in ALS
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-20 17:47:39 -0700
+  Commit: eba399b, github.com/apache/spark/pull/2064
+
+  [SPARK-3143][MLLIB] add tf-idf user guide
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-20 17:41:36 -0700
+  Commit: 1af68ca, github.com/apache/spark/pull/2061
+
+  [SPARK-3140] Clarify confusing PySpark exception message
+  Andrew Or <andrewo...@gmail.com>
+  2014-08-20 17:07:39 -0700
+  Commit: f8bcb12, github.com/apache/spark/pull/2067
+
+  [SPARK-2298] Encode stage attempt in SparkListener & UI.
+  Reynold Xin <r...@apache.org>
+  2014-08-20 15:37:27 -0700
+  Commit: dc05282, github.com/apache/spark/pull/1545
+
+  [SPARK-2169] Don't copy appName / basePath everywhere.
+  Marcelo Vanzin <van...@cloudera.com>
+  2014-08-18 13:25:30 -0700
+  Commit: 2c1683e, github.com/apache/spark/pull/1252
+
+  [SPARK-2846][SQL] Add configureInputJobPropertiesForStorageHandler to 
initialization of job conf
+  Alex Liu <alex_li...@yahoo.com>
+  2014-08-20 16:14:06 -0700
+  Commit: 64e136a, github.com/apache/spark/pull/1927
+
+  SPARK_LOGFILE and SPARK_ROOT_LOGGER no longer need in spark-daemon.sh
+  wangfei <wangfei_he...@126.com>
+  2014-08-20 16:00:46 -0700
+  Commit: 5f72d7b, github.com/apache/spark/pull/2057
+
+  [SPARK-2967][SQL] Fix sort based shuffle for spark sql.
+  Michael Armbrust <mich...@databricks.com>
+  2014-08-20 15:51:14 -0700
+  Commit: 311831d, github.com/apache/spark/pull/2066
+
+  [SPARK-2849] Handle driver configs separately in client mode
+  Andrew Or <andrewo...@gmail.com>
+  2014-08-20 15:01:47 -0700
+  Commit: beb705a, github.com/apache/spark/pull/1845
+
+  [SPARK-3149] Connection establishment information is not enough.
+  Kousuke Saruta <saru...@oss.nttdata.co.jp>
+  2014-08-20 14:04:39 -0700
+  Commit: 25b01fd, github.com/apache/spark/pull/2060
+
+  [SPARK-3062] [SPARK-2970] [SQL] spark-sql script ends with IOException when 
EventLogging is enabled
+  Kousuke Saruta <saru...@oss.nttdata.co.jp>
+  2014-08-20 13:26:11 -0700
+  Commit: 5095851, github.com/apache/spark/pull/1970
+
+  [SPARK-3126][SPARK-3127][SQL] Fixed HiveThriftServer2Suite
+  Cheng Lian <lian.cs....@gmail.com>
+  2014-08-20 12:57:39 -0700
+  Commit: 99ca704, github.com/apache/spark/pull/2036
+
+  SPARK-3092 [SQL]: Always include the thriftserver when -Phive is enabled.
+  Patrick Wendell <pwend...@gmail.com>
+  2014-08-20 12:13:31 -0700
+  Commit: ca7322d, github.com/apache/spark/pull/2006
+
+  [SPARK-3054][STREAMING] Add unit tests for Spark Sink.
+  Hari Shreedharan <hshreedha...@apache.org>, Hari Shreedharan 
<hshreedha...@cloudera.com>
+  2014-08-20 04:09:54 -0700
+  Commit: 9b29099, github.com/apache/spark/pull/1958
+
+  [SPARK-3141] [PySpark] fix sortByKey() with take()
+  Davies Liu <davies....@gmail.com>
+  2014-08-19 22:43:49 -0700
+  Commit: 5b22ebf, github.com/apache/spark/pull/2045
+
+  [DOCS] Fixed wrong links
+  Ken Takagiwa <ugw.gi.wo...@gmail.com>
+  2014-08-19 22:43:22 -0700
+  Commit: f8c908e, github.com/apache/spark/pull/2042
+
+  [SPARK-2974] [SPARK-2975] Fix two bugs related to spark.local.dirs
+  Josh Rosen <joshro...@apache.org>
+  2014-08-19 22:42:50 -0700
+  Commit: 5d1a878, github.com/apache/spark/pull/2002
+
+  [SPARK-3142][MLLIB] output shuffle data directly in Word2Vec
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-19 22:16:22 -0700
+  Commit: a5bc9c6, github.com/apache/spark/pull/2049
+
+  [SPARK-3119] Re-implementation of TorrentBroadcast.
+  Reynold Xin <r...@apache.org>
+  2014-08-19 22:11:13 -0700
+  Commit: 08c9973, github.com/apache/spark/pull/2030
+
+  [HOTFIX][Streaming][MLlib] use temp folder for checkpoint
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-19 22:05:29 -0700
+  Commit: d5db95b, github.com/apache/spark/pull/2046
+
+  [SPARK-3130][MLLIB] detect negative values in naive Bayes
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-19 21:01:23 -0700
+  Commit: 148e45b, github.com/apache/spark/pull/2038
+
+  [SQL] add note of use synchronizedMap in SQLConf
+  wangfei <wangfei_he...@126.com>, scwf <wangf...@huawei.com>
+  2014-08-19 19:37:02 -0700
+  Commit: 607735c, github.com/apache/spark/pull/1996
+
+  [SPARK-3112][MLLIB] Add documentation and example for StreamingLR
+  freeman <the.freeman....@gmail.com>
+  2014-08-19 18:07:42 -0700
+  Commit: d75464d, github.com/apache/spark/pull/2047
+
+  [MLLIB] minor update to word2vec
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-19 17:41:37 -0700
+  Commit: 023ed7c, github.com/apache/spark/pull/2043
+
+  [SPARK-2468] Netty based block server / client module
+  Reynold Xin <r...@apache.org>
+  2014-08-19 17:40:35 -0700
+  Commit: 66b4c81, github.com/apache/spark/pull/1971
+
+  [SPARK-3136][MLLIB] Create Java-friendly methods in RandomRDDs
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-19 16:06:48 -0700
+  Commit: d371c71, github.com/apache/spark/pull/2041
+
+  [SPARK-2790] [PySpark] fix zip with serializers which have different batch 
sizes.
+  Davies Liu <davies....@gmail.com>
+  2014-08-19 14:46:32 -0700
+  Commit: 3540d4b, github.com/apache/spark/pull/1894
+
+  Move a bracket in validateSettings of SparkConf
+  hzw19900416 <carlmartin...@gmail.com>
+  2014-08-19 14:04:49 -0700
+  Commit: f6b4ab8, github.com/apache/spark/pull/2012
+
+  SPARK-2333 - spark_ec2 script should allow option for existing security group
+  Vida Ha <v...@databricks.com>
+  2014-08-19 13:35:05 -0700
+  Commit: c3952b0, github.com/apache/spark/pull/1899
+
+  [SPARK-3128][MLLIB] Use streaming test suite for StreamingLR
+  freeman <the.freeman....@gmail.com>
+  2014-08-19 13:28:57 -0700
+  Commit: 04a3208, github.com/apache/spark/pull/2037
+
+  [SPARK-3089] Fix meaningless error message in ConnectionManager
+  Kousuke Saruta <saru...@oss.nttdata.co.jp>
+  2014-08-19 10:15:11 -0700
+  Commit: 5d895ad, github.com/apache/spark/pull/2000
+
+  [SPARK-3072] YARN - Exit when reach max number failed executors
+  Thomas Graves <tgra...@apache.org>
+  2014-08-19 09:40:31 -0500
+  Commit: 1418893, github.com/apache/spark/pull/2022
+
+  Fix typo in decision tree docs
+  Matt Forbes <m...@tellapart.com>
+  2014-08-18 21:43:32 -0700
+  Commit: f3b0f34, github.com/apache/spark/pull/1837
+
+  [SPARK-3116] Remove the excessive lockings in TorrentBroadcast
+  Reynold Xin <r...@apache.org>
+  2014-08-18 20:51:41 -0700
+  Commit: b6d8e66, github.com/apache/spark/pull/2028
+
+  [SPARK-3114] [PySpark] Fix Python UDFs in Spark SQL.
+  Josh Rosen <joshro...@apache.org>, Davies Liu <davies....@gmail.com>
+  2014-08-18 20:42:19 -0700
+  Commit: 3a03259, github.com/apache/spark/pull/2026.
+
+  [SPARK-3108][MLLIB] add predictOnValues to StreamingLR and fix predictOn
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-18 18:20:54 -0700
+  Commit: 7d069bf, github.com/apache/spark/pull/2023
+
+  [SPARK-2850] [SPARK-2626] [mllib] MLlib stats examples + small fixes
+  Joseph K. Bradley <joseph.kurata.brad...@gmail.com>
+  2014-08-18 18:01:39 -0700
+  Commit: e3f89e9, github.com/apache/spark/pull/1878
+
+  [mllib] DecisionTree: treeAggregate + Python example bug fix
+  Joseph K. Bradley <joseph.kurata.brad...@gmail.com>
+  2014-08-18 14:40:05 -0700
+  Commit: 98778ff, github.com/apache/spark/pull/2015
+
+  [SPARK-2718] [yarn] Handle quotes and other characters in user args.
+  Marcelo Vanzin <van...@cloudera.com>
+  2014-08-18 14:10:10 -0700
+  Commit: 25cabd7, github.com/apache/spark/pull/1724
+
+  [SPARK-3103] [PySpark] fix saveAsTextFile() with utf-8
+  Davies Liu <davies....@gmail.com>
+  2014-08-18 13:58:35 -0700
+  Commit: e083334, github.com/apache/spark/pull/2018
+
+  [SPARK-2406][SQL] Initial support for using ParquetTableScan to read 
HiveMetaStore tables.
+  Michael Armbrust <mich...@databricks.com>, Yin Huai <h...@cse.ohio-state.edu>
+  2014-08-18 13:17:10 -0700
+  Commit: cc4015d, github.com/apache/spark/pull/1819
+
+  [SPARK-3091] [SQL] Add support for caching metadata on Parquet files
+  Matei Zaharia <ma...@databricks.com>
+  2014-08-18 11:00:10 -0700
+  Commit: 2ae2857, github.com/apache/spark/pull/2005
+
+  SPARK-3025 [SQL]: Allow JDBC clients to set a fair scheduler pool
+  Patrick Wendell <pwend...@gmail.com>
+  2014-08-18 10:52:20 -0700
+  Commit: 496f62d, github.com/apache/spark/pull/1937
+
+  [SPARK-3085] [SQL] Use compact data structures in SQL joins
+  Matei Zaharia <ma...@databricks.com>
+  2014-08-18 10:45:24 -0700
+  Commit: 4da76fc, github.com/apache/spark/pull/1993
+
+  [SPARK-3084] [SQL] Collect broadcasted tables in parallel in joins
+  Matei Zaharia <ma...@databricks.com>
+  2014-08-18 10:05:52 -0700
+  Commit: 55e9dd6, github.com/apache/spark/pull/1990
+
+  SPARK-3096: Include parquet hive serde by default in build
+  Patrick Wendell <pwend...@gmail.com>
+  2014-08-18 10:00:46 -0700
+  Commit: ec0b91e, github.com/apache/spark/pull/2009
+
+  [SPARK-2862] histogram method fails on some choices of bucketCount
+  Chandan Kumar <chandan.ku...@imaginea.com>
+  2014-08-18 09:52:25 -0700
+  Commit: 12f16ba, github.com/apache/spark/pull/1787
+
+  [MLlib] Remove transform(dataset: RDD[String]) from Word2Vec public API
+  Liquan Pei <liquan...@gmail.com>
+  2014-08-18 01:15:45 -0700
+  Commit: e0bc333, github.com/apache/spark/pull/2010
+
+  [SPARK-2842][MLlib]Word2Vec documentation
+  Liquan Pei <liquan...@gmail.com>
+  2014-08-17 23:30:47 -0700
+  Commit: 518258f, github.com/apache/spark/pull/2003
+
+  [SPARK-3097][MLlib] Word2Vec performance improvement
+  Liquan Pei <liquan...@gmail.com>
+  2014-08-17 23:29:44 -0700
+  Commit: 708cde9, github.com/apache/spark/pull/1932
+
+  SPARK-2900. aggregate inputBytes per stage
+  Sandy Ryza <sa...@cloudera.com>
+  2014-08-17 22:39:06 -0700
+  Commit: 0506539, github.com/apache/spark/pull/1826
+
+  SPARK-2884: Create binary builds in parallel with release script.
+  Patrick Wendell <pwend...@gmail.com>
+  2014-08-17 22:29:58 -0700
+  Commit: a5ae720
+
+  [SPARK-3087][MLLIB] fix col indexing bug in chi-square and add a check for 
number of distinct values
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-17 20:53:18 -0700
+  Commit: 8438daf, github.com/apache/spark/pull/1997
+
+  [SPARK-1981] updated streaming-kinesis.md
+  Chris Fregly <ch...@fregly.com>
+  2014-08-17 19:33:15 -0700
+  Commit: 8263567, github.com/apache/spark/pull/1757
+
+  [SQL] Improve debug logging and toStrings.
+  Michael Armbrust <mich...@databricks.com>
+  2014-08-17 19:00:38 -0700
+  Commit: 4f776df, github.com/apache/spark/pull/2004
+
+  Revert "[SPARK-2970] [SQL] spark-sql script ends with IOException when 
EventLogging is enabled"
+  Michael Armbrust <mich...@databricks.com>
+  2014-08-17 18:10:45 -0700
+  Commit: c6a0091, github.com/apache/spark/pull/2007
+
+  SPARK-2881: Upgrade to Snappy 1.0.5.3 to avoid SPARK-2881.
+  Patrick Wendell <pwend...@gmail.com>
+  2014-08-17 15:48:39 -0700
+  Commit: d411f41, github.com/apache/spark/pull/1999
+
+  [SPARK-3042] [mllib] DecisionTree Filter top-down instead of bottom-up
+  Joseph K. Bradley <joseph.kurata.brad...@gmail.com>
+  2014-08-16 23:53:14 -0700
+  Commit: 91af120, github.com/apache/spark/pull/1975
+
+  [SPARK-3077][MLLIB] fix some chisq-test
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-16 21:16:27 -0700
+  Commit: 413a329, github.com/apache/spark/pull/1982
+
+  In the stop method of ConnectionManager to cancel the ackTimeoutMonitor
+  GuoQiang Li <wi...@qq.com>
+  2014-08-16 20:05:55 -0700
+  Commit: f02e327, github.com/apache/spark/pull/1989
+
+  [SPARK-1065] [PySpark] improve supporting for large broadcast
+  Davies Liu <davies....@gmail.com>
+  2014-08-16 16:59:34 -0700
+  Commit: 5dd571c, github.com/apache/spark/pull/1912
+
+  [SPARK-3035] Wrong example with SparkContext.addFile
+  iAmGhost <kdh7...@gmail.com>
+  2014-08-16 16:48:38 -0700
+  Commit: 721f2fd, github.com/apache/spark/pull/1942
+
+  [SPARK-3081][MLLIB] rename RandomRDDGenerators to RandomRDDs
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-16 15:14:43 -0700
+  Commit: a12d3ae, github.com/apache/spark/pull/1979
+
+  [SPARK-3048][MLLIB] add LabeledPoint.parse and remove 
loadStreamingLabeledPoints
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-16 15:13:34 -0700
+  Commit: 0b354be, github.com/apache/spark/pull/1952
+
+  [SPARK-2677] BasicBlockFetchIterator#next can wait forever
+  Kousuke Saruta <saru...@oss.nttdata.co.jp>
+  2014-08-16 14:15:58 -0700
+  Commit: bd3ce2f, github.com/apache/spark/pull/1632
+
+  [SQL] Using safe floating-point numbers in doctest
+  Cheng Lian <lian.cs....@gmail.com>
+  2014-08-16 11:26:51 -0700
+  Commit: 8c79574, github.com/apache/spark/pull/1925
+
+  [SPARK-2977] Ensure ShuffleManager is created before ShuffleBlockManager
+  Josh Rosen <joshro...@apache.org>
+  2014-08-16 00:04:55 -0700
+  Commit: 0e0ec2e, github.com/apache/spark/pull/1976
+
+  [SPARK-3045] Make Serializer interface Java friendly
+  Reynold Xin <r...@apache.org>
+  2014-08-15 23:12:34 -0700
+  Commit: fcf30cd, github.com/apache/spark/pull/1948
+
+  [SPARK-3015] Block on cleaning tasks to prevent Akka timeouts
+  Andrew Or <andrewo...@gmail.com>
+  2014-08-15 22:55:32 -0700
+  Commit: 2541537, github.com/apache/spark/pull/1931
+
+  [SPARK-3001][MLLIB] Improve Spearman's correlation
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-15 21:07:55 -0700
+  Commit: ce06d7f, github.com/apache/spark/pull/1917
+
+  [SPARK-3078][MLLIB] Make LRWithLBFGS API consistent with others
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-15 21:04:29 -0700
+  Commit: c085011, github.com/apache/spark/pull/1973
+
+  [SPARK-3046] use executor's class loader as the default serializer 
classloader
+  Reynold Xin <r...@apache.org>
+  2014-08-15 17:04:15 -0700
+  Commit: 077213b, github.com/apache/spark/pull/1972
+
+  [SPARK-3022] [SPARK-3041] [mllib] Call findBins once per level + unordered 
feature bug fix
+  Joseph K. Bradley <joseph.kurata.brad...@gmail.com>
+  2014-08-15 14:50:10 -0700
+  Commit: 407ea9f, github.com/apache/spark/pull/1950
+
+  SPARK-3028. sparkEventToJson should support SparkListenerExecutorMetrics...
+  Sandy Ryza <sa...@cloudera.com>
+  2014-08-15 11:35:08 -0700
+  Commit: 63376a0, github.com/apache/spark/pull/1961
+
+  Revert "[SPARK-2468] Netty based block server / client module"
+  Patrick Wendell <pwend...@gmail.com>
+  2014-08-15 09:01:35 -0700
+  Commit: b066af4
+
+  [SPARK-2924] remove default args to overloaded methods
+  Anand Avati <av...@redhat.com>
+  2014-08-15 08:53:52 -0700
+  Commit: debb3e3, github.com/apache/spark/pull/1704
+
+  [SPARK-2468] Netty based block server / client module
+  Reynold Xin <r...@apache.org>
+  2014-08-14 19:01:33 -0700
+  Commit: 3f23d2a, github.com/apache/spark/pull/1907
+
+  [SPARK-2936] Migrate Netty network module from Java to Scala
+  Reynold Xin <r...@apache.org>
+  2014-08-10 20:36:54 -0700
+  Commit: d3cce58, github.com/apache/spark/pull/1865
+
+  [SPARK-2736] PySpark converter and example script for reading Avro files
+  Kan Zhang <kzh...@apache.org>
+  2014-08-14 19:03:51 -0700
+  Commit: 72e730e, github.com/apache/spark/pull/1916
+
+  [SPARK-3027] TaskContext: tighten visibility and provide Java friendly 
callback API
+  Reynold Xin <r...@apache.org>
+  2014-08-14 18:37:02 -0700
+  Commit: f99e4fc, github.com/apache/spark/pull/1938
+
+  Make dev/mima runnable on Mac OS X.
+  Reynold Xin <r...@apache.org>
+  2014-08-14 16:27:11 -0700
+  Commit: 475a35b, github.com/apache/spark/pull/1953
+
+  SPARK-3009: Reverted readObject method in ApplicationInfo so that Applic...
+  Jacek Lewandowski <lewandowski.ja...@gmail.com>
+  2014-08-14 15:01:39 -0700
+  Commit: f5d9176, github.com/apache/spark/pull/1947
+
+  Revert  [SPARK-3011][SQL] _temporary directory should be filtered out by 
sqlContext.parquetFile
+  Michael Armbrust <mich...@databricks.com>
+  2014-08-14 13:00:21 -0700
+  Commit: c39a3f3, github.com/apache/spark/pull/1949
+
+  [SPARK-2979][MLlib] Improve the convergence rate by minimizing the condition 
number
+  DB Tsai <dbt...@alpinenow.com>
+  2014-08-14 11:56:13 -0700
+  Commit: dc8ef93, github.com/apache/spark/pull/1897
+
+  Minor cleanup of metrics.Source
+  Reynold Xin <r...@apache.org>
+  2014-08-14 11:22:41 -0700
+  Commit: a3dc54f, github.com/apache/spark/pull/1943
+
+  [SPARK-2925] [sql]fix spark-sql and start-thriftserver shell bugs when set 
--driver-java-options
+  wangfei <wangfei_he...@126.com>, wangfei <wangf...@huawei.com>
+  2014-08-14 10:55:51 -0700
+  Commit: df25acd, github.com/apache/spark/pull/1851
+
+  [SQL] Python JsonRDD UTF8 Encoding Fix
+  Ahir Reddy <ahirre...@gmail.com>
+  2014-08-14 10:48:52 -0700
+  Commit: 850abaa, github.com/apache/spark/pull/1914
+
+  [SPARK-2927][SQL] Add a conf to configure if we always read Binary columns 
stored in Parquet as String columns
+  Yin Huai <h...@cse.ohio-state.edu>
+  2014-08-14 10:46:33 -0700
+  Commit: de501e1, github.com/apache/spark/pull/1855
+
+  [SPARK-3011][SQL] _temporary directory should be filtered out by 
sqlContext.parquetFile
+  Chia-Yung Su <chiay...@appier.com>
+  2014-08-14 10:43:08 -0700
+  Commit: 221c84e, github.com/apache/spark/pull/1924
+
+  SPARK-2893: Do not swallow Exceptions when running a custom kryo registrator
+  Graham Dennis <graham.den...@gmail.com>
+  2014-08-14 02:24:18 -0700
+  Commit: af809de, github.com/apache/spark/pull/1827
+
+  [SPARK-3029] Disable local execution of Spark jobs by default
+  Aaron Davidson <aa...@databricks.com>
+  2014-08-14 01:37:38 -0700
+  Commit: 0cb2b82, github.com/apache/spark/pull/1321
+
+  [SPARK-2995][MLLIB] add ALS.setIntermediateRDDStorageLevel
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-13 23:53:44 -0700
+  Commit: 1baf06f, github.com/apache/spark/pull/1913
+
+  [Docs] Add missing <code> tags (minor)
+  Andrew Or <andrewo...@gmail.com>
+  2014-08-13 23:24:23 -0700
+  Commit: bf7c6e1, github.com/apache/spark/pull/1936
+
+  [SPARK-3006] Failed to execute spark-shell in Windows OS
+  Masayoshi TSUZUKI <tsudu...@oss.nttdata.co.jp>
+  2014-08-13 22:17:07 -0700
+  Commit: dcd99c3, github.com/apache/spark/pull/1918
+
+  SPARK-3020: Print completed indices rather than tasks in web UI
+  Patrick Wendell <pwend...@gmail.com>
+  2014-08-13 18:08:38 -0700
+  Commit: c6cb55a, github.com/apache/spark/pull/1933
+
+  [SPARK-2986] [SQL] fixed: setting properties does not effect
+  guowei <guo...@upyoo.com>
+  2014-08-13 17:45:24 -0700
+  Commit: a8d2649, github.com/apache/spark/pull/1904
+
+  [SPARK-2970] [SQL] spark-sql script ends with IOException when EventLogging 
is enabled
+  Kousuke Saruta <saru...@oss.nttdata.co.jp>
+  2014-08-13 17:42:38 -0700
+  Commit: b5b632c, github.com/apache/spark/pull/1891
+
+  [SPARK-2935][SQL]Fix parquet predicate push down bug
+  Michael Armbrust <mich...@databricks.com>
+  2014-08-13 17:40:59 -0700
+  Commit: e8e7f17, github.com/apache/spark/pull/1863
+
+  [SPARK-2650][SQL] More precise initial buffer size estimation for in-memory 
column buffer
+  Cheng Lian <lian.cs....@gmail.com>
+  2014-08-13 17:37:55 -0700
+  Commit: ee7d2cc, github.com/apache/spark/pull/1901
+
+  [SPARK-2994][SQL] Support for udfs that take complex types
+  Michael Armbrust <mich...@databricks.com>
+  2014-08-13 17:35:38 -0700
+  Commit: 71b8408, github.com/apache/spark/pull/1915
+
+  [SPARK-2817] [SQL] add "show create table" support
+  tianyi <tia...@asiainfo-linkage.com>, tianyi <tia...@asiainfo.com>, tianyi 
<tianyi.asiai...@gmail.com>
+  2014-08-13 16:50:02 -0700
+  Commit: 0fb1198, github.com/apache/spark/pull/1760
+
+  [SPARK-3004][SQL] Added null checking when retrieving row set
+  Cheng Lian <lian.cs....@gmail.com>
+  2014-08-13 16:27:50 -0700
+  Commit: 8732375, github.com/apache/spark/pull/1920
+
+  [MLLIB] use Iterator.fill instead of Array.fill
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-13 16:20:49 -0700
+  Commit: e63bf87, github.com/apache/spark/pull/1930
+
+  [SPARK-2983] [PySpark] improve performance of sortByKey()
+  Davies Liu <davies....@gmail.com>
+  2014-08-13 14:57:12 -0700
+  Commit: a7bc21c, github.com/apache/spark/pull/1898
+
+  [SPARK-3013] [SQL] [PySpark] convert array into list
+  Davies Liu <davies....@gmail.com>
+  2014-08-13 14:56:11 -0700
+  Commit: 9936020, github.com/apache/spark/pull/1928
+
+  [SPARK-2963] [SQL] There no documentation about building to use HiveServer 
and CLI for SparkSQL
+  Kousuke Saruta <saru...@oss.nttdata.co.jp>
+  2014-08-13 14:42:57 -0700
+  Commit: 78f2f99, github.com/apache/spark/pull/1885
+
+  [SPARK-2993] [MLLib] colStats (wrapper around 
MultivariateStatisticalSummary) in Statistics
+  Doris Xin <doris.s....@gmail.com>
+  2014-08-12 23:47:42 -0700
+  Commit: 5ebeb3f, github.com/apache/spark/pull/1911
+
+  [SPARK-1777 (partial)] bugfix: make size of requested memory correctly
+  Zhang, Liye <liye.zh...@intel.com>
+  2014-08-12 23:43:36 -0700
+  Commit: ec5e2b0, github.com/apache/spark/pull/1892
+
+  Use transferTo when copy merge files in ExternalSorter
+  Raymond Liu <raymond....@intel.com>
+  2014-08-12 23:19:35 -0700
+  Commit: be674b3, github.com/apache/spark/pull/1884
+
+  [SPARK-2953] Allow using short names for io compression codecs
+  Reynold Xin <r...@apache.org>
+  2014-08-12 22:50:29 -0700
+  Commit: 837bf60, github.com/apache/spark/pull/1873
+
+  SPARK-2830 [MLlib]: re-organize mllib documentation
+  Ameet Talwalkar <atalwal...@gmail.com>
+  2014-08-12 17:15:21 -0700
+  Commit: cffd9bb, github.com/apache/spark/pull/1908
+
+  fix flaky tests
+  Davies Liu <davies....@gmail.com>
+  2014-08-12 16:26:01 -0700
+  Commit: b5f8083, github.com/apache/spark/pull/1910
+
+  [MLlib] Correctly set vectorSize and alpha
+  Liquan Pei <liquan...@gmail.com>
+  2014-08-12 00:28:00 -0700
+  Commit: 2a8117a, github.com/apache/spark/pull/1900
+
+  [SPARK-2923][MLLIB] Implement some basic BLAS routines
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-11 22:33:45 -0700
+  Commit: 872c170, github.com/apache/spark/pull/1849
+
+  [SQL] [SPARK-2826] Reduce the memory copy while building the hashmap for 
HashOuterJoin
+  Cheng Hao <hao.ch...@intel.com>
+  2014-08-11 20:45:14 -0700
+  Commit: f66f260, github.com/apache/spark/pull/1765
+
+  [SPARK-2650][SQL] Build column buffers in smaller batches
+  Michael Armbrust <mich...@databricks.com>
+  2014-08-11 20:21:56 -0700
+  Commit: 779d1eb, github.com/apache/spark/pull/1880
+
+  [SPARK-2968][SQL] Fix nullabilities of Explode.
+  Takuya UESHIN <ues...@happy-camper.st>
+  2014-08-11 20:18:03 -0700
+  Commit: 54b387f, github.com/apache/spark/pull/1888
+
+  [SPARK-2965][SQL] Fix HashOuterJoin output nullabilities.
+  Takuya UESHIN <ues...@happy-camper.st>
+  2014-08-11 20:15:01 -0700
+  Commit: dcbf079, github.com/apache/spark/pull/1887
+
+  [SQL] A tiny refactoring in HiveContext#analyze
+  Yin Huai <huaiyin....@gmail.com>
+  2014-08-11 20:11:29 -0700
+  Commit: fd8173f, github.com/apache/spark/pull/1881
+
+  [sql]use SparkSQLEnv.stop() in ShutdownHook
+  wangfei <wangf...@huawei.com>
+  2014-08-11 20:10:13 -0700
+  Commit: 6d0af52, github.com/apache/spark/pull/1852
+
+  [SPARK-2590][SQL] Added option to handle incremental collection, disabled by 
default
+  Cheng Lian <lian.cs....@gmail.com>
+  2014-08-11 20:08:06 -0700
+  Commit: cf2f807, github.com/apache/spark/pull/1853
+
+  [SPARK-2844][SQL] Correctly set JVM HiveContext if it is passed into Python 
HiveContext constructor
+  Ahir Reddy <ahirre...@gmail.com>
+  2014-08-11 20:06:06 -0700
+  Commit: 8cb4e5b, github.com/apache/spark/pull/1768
+
+  [SPARK-2934][MLlib] Adding LogisticRegressionWithLBFGS Interface
+  DB Tsai <dbt...@alpinenow.com>
+  2014-08-11 19:49:29 -0700
+  Commit: 8f6e2e9, github.com/apache/spark/pull/1862
+
+  [SPARK-2515][mllib] Chi Squared test
+  Doris Xin <doris.s....@gmail.com>
+  2014-08-11 19:22:14 -0700
+  Commit: 7e31f7c, github.com/apache/spark/pull/1733
+
+  [SPARK-2931] In TaskSetManager, reset currentLocalityIndex after recomputing 
locality levels
+  Josh Rosen <joshro...@apache.org>
+  2014-08-11 19:15:01 -0700
+  Commit: 6c64d57, github.com/apache/spark/pull/1896
+
+  [SPARK-2952] Enable logging actor messages at DEBUG level
+  Reynold Xin <r...@apache.org>
+  2014-08-11 15:25:21 -0700
+  Commit: 6ec1374, github.com/apache/spark/pull/1870
+
+  [PySpark] [SPARK-2954] [SPARK-2948] [SPARK-2910] [SPARK-2101] Python 2.6 
Fixes
+  Josh Rosen <joshro...@apache.org>
+  2014-08-11 11:54:09 -0700
+  Commit: 09b8a3c, github.com/apache/spark/pull/1868.
+
+  [SPARK-2937] Separate out samplyByKeyExact as its own API in PairRDDFunction
+  Doris Xin <doris.s....@gmail.com>, Xiangrui Meng <m...@databricks.com>
+  2014-08-10 16:31:07 -0700
+  Commit: 3def842, github.com/apache/spark/pull/1866
+
+  [SPARK-2898] [PySpark] fix bugs in deamon.py
+  Davies Liu <davies....@gmail.com>
+  2014-08-10 13:00:38 -0700
+  Commit: 92daffe, github.com/apache/spark/pull/1842
+
+  Remove extra semicolon in Task.scala
+  GuoQiang Li <wi...@qq.com>
+  2014-08-10 12:12:22 -0700
+  Commit: bb23b11, github.com/apache/spark/pull/1876
+
+  Turn UpdateBlockInfo into case class.
+  Reynold Xin <r...@apache.org>
+  2014-08-09 23:06:54 -0700
+  Commit: 076ddda, github.com/apache/spark/pull/1872
+
+  Updated Spark SQL README to include the hive-thriftserver module
+  Reynold Xin <r...@apache.org>
+  2014-08-09 22:05:36 -0700
+  Commit: e8f8e5f, github.com/apache/spark/pull/1867
+
+  [SPARK-2894] spark-shell doesn't accept flags
+  Kousuke Saruta <saru...@oss.nttdata.co.jp>, Cheng Lian 
<lian.cs....@gmail.com>
+  2014-08-09 21:10:43 -0700
+  Commit: ba223b8, github.com/apache/spark/pull/1715,
+
+  [SPARK-1766] sorted functions to meet pedantic requirements
+  Chris Cope <cc...@resilientscience.com>
+  2014-08-09 20:58:56 -0700
+  Commit: 4a7f3ef, github.com/apache/spark/pull/1859
+
+  [SPARK-2861] Fix Doc comment of histogram method
+  Chandan Kumar <chandan.ku...@imaginea.com>
+  2014-08-09 00:45:54 -0700
+  Commit: 71fcd2e, github.com/apache/spark/pull/1786
+
+  [SPARK-2635] Fix race condition at SchedulerBackend.isReady in standalone 
mode
+  li-zhihui <zhihui...@intel.com>, Li Zhihui <zhihui...@intel.com>
+  2014-08-08 22:52:56 -0700
+  Commit: 3311da2, github.com/apache/spark/pull/1525
+
+  [SPARK-2897][SPARK-2920]TorrentBroadcast does use the serializer class 
specified in the spark option "spark.serializer"
+  GuoQiang Li <wi...@qq.com>
+  2014-08-08 16:57:26 -0700
+  Commit: dd11e4e, github.com/apache/spark/pull/1836
+
+  [SPARK-1997][MLLIB] update breeze to 0.9
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-08 15:07:31 -0700
+  Commit: 8fba6de, github.com/apache/spark/pull/1749
+
+  [SPARK-2700] [SQL] Hidden files (such as .impala_insert_staging) should be 
filtered out by sqlContext.parquetFile
+  chutium <teng....@gmail.com>
+  2014-08-08 13:31:08 -0700
+  Commit: e264503, github.com/apache/spark/pull/1691
+
+  [SPARK-2919] [SQL] Basic support for analyze command in HiveQl
+  Yin Huai <h...@cse.ohio-state.edu>
+  2014-08-08 11:23:58 -0700
+  Commit: daa090f, github.com/apache/spark/pull/1848
+
+  [SPARK-2877] [SQL] MetastoreRelation should use SparkClassLoader when 
creating the tableDesc
+  Yin Huai <h...@cse.ohio-state.edu>
+  2014-08-08 11:15:16 -0700
+  Commit: 8b0188b, github.com/apache/spark/pull/1806
+
+  [SPARK-2908] [SQL] JsonRDD.nullTypeToStringType does not convert all 
NullType to StringType
+  Yin Huai <h...@cse.ohio-state.edu>
+  2014-08-08 11:10:11 -0700
+  Commit: 544a909, github.com/apache/spark/pull/1840
+
+  [SPARK-2888] [SQL] Fix addColumnMetadataToConf in HiveTableScan
+  Yin Huai <h...@cse.ohio-state.edu>
+  2014-08-08 11:01:51 -0700
+  Commit: 3eb5dd0, github.com/apache/spark/pull/1817
+
+  [SPARK-2904] Remove non-used local variable in SparkSubmitArguments
+  Kousuke Saruta <saru...@oss.nttdata.co.jp>
+  2014-08-07 18:53:15 -0700
+  Commit: a54b5d9, github.com/apache/spark/pull/1834
+
+  SPARK-2565. Update ShuffleReadMetrics as blocks are fetched
+  Sandy Ryza <sa...@cloudera.com>
+  2014-08-07 18:09:03 -0700
+  Commit: aab7735, github.com/apache/spark/pull/1507
+
+  SPARK-2787: Make sort-based shuffle write files directly when there's no 
sorting/aggregation and # partitions is small
+  Matei Zaharia <ma...@databricks.com>
+  2014-08-07 18:04:49 -0700
+  Commit: 0f2274f, github.com/apache/spark/pull/1799
+
+  SPARK-2899 Doc generation is back to working in new SBT Build.
+  Prashant Sharma <prashan...@imaginea.com>
+  2014-08-07 16:24:22 -0700
+  Commit: 30369b8, github.com/apache/spark/pull/1830
+
+  SPARK-2905 Fixed path sbin => bin
+  Oleg Danilov <oleg.dani...@wandisco.com>
+  2014-08-07 15:48:44 -0700
+  Commit: c65c810, github.com/apache/spark/pull/1835
+
+  [SPARK-2852][MLLIB] Separate model from IDF/StandardScaler algorithms
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-07 11:28:12 -0700
+  Commit: f705c1d, github.com/apache/spark/pull/1814
+
+  [mllib] DecisionTree Strategy parameter checks
+  Joseph K. Bradley <joseph.kurata.brad...@gmail.com>
+  2014-08-07 00:20:38 -0700
+  Commit: c089429, github.com/apache/spark/pull/1821
+
+  SPARK-2879 part 2 [BUILD] Use HTTPS to access Maven Central and other repos
+  Sean Owen <sro...@gmail.com>
+  2014-08-07 00:04:18 -0700
+  Commit: d6cd6fd0, github.com/apache/spark/pull/1828
+
+  [SPARK-2851] [mllib] DecisionTree Python consistency update
+  Joseph K. Bradley <joseph.kurata.brad...@gmail.com>
+  2014-08-06 22:58:59 -0700
+  Commit: c9f0944, github.com/apache/spark/pull/1798
+
+  [SPARK-2887] fix bug of countApproxDistinct() when have more than one 
partition
+  Davies Liu <davies....@gmail.com>
+  2014-08-06 21:22:13 -0700
+  Commit: cc8a7e9, github.com/apache/spark/pull/1812
+
+  Updating versions for Spark 1.1.0
+  Patrick Wendell <pwend...@gmail.com>
+  2014-08-06 19:11:39 -0700
+  Commit: cf35b56
+
+  HOTFIX: Support custom Java 7 location
+  Patrick Wendell <pwend...@gmail.com>
+  2014-08-06 18:45:03 -0700
+  Commit: 53fa048
+
+  SPARK-2879 [BUILD] Use HTTPS to access Maven Central and other repos
+  Sean Owen <sro...@gmail.com>
+  2014-08-06 18:13:35 -0700
+  Commit: 40284a9, github.com/apache/spark/pull/1805
+
+  [SPARK-2583] ConnectionManager error reporting
+  Kousuke Saruta <saru...@oss.nttdata.co.jp>, Josh Rosen <joshro...@apache.org>
+  2014-08-06 17:27:55 -0700
+  Commit: 3f92ce4, github.com/apache/spark/pull/1758
+
+  SPARK-2882: Spark build now checks local maven cache for dependencies
+  Gregory Owen <greo...@gmail.com>
+  2014-08-06 16:52:00 -0700
+  Commit: c2ae0b0, github.com/apache/spark/pull/1818
+
+  [PySpark] Add blanklines to Python docstrings so example code renders 
correctly
+  RJ Nowling <rnowl...@gmail.com>
+  2014-08-06 14:12:21 -0700
+  Commit: a314e29, github.com/apache/spark/pull/1808
+
+  [SPARK-2852][MLLIB] API consistency for `mllib.feature`
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-06 14:07:51 -0700
+  Commit: e654cfd, github.com/apache/spark/pull/1807
+
+  SPARK-2566. Update ShuffleWriteMetrics incrementally
+  Sandy Ryza <sa...@cloudera.com>
+  2014-08-06 13:10:33 -0700
+  Commit: a65c9ac, github.com/apache/spark/pull/1481
+
+  [SPARK-2627] [PySpark] have the build enforce PEP 8 automatically
+  Nicholas Chammas <nicholas.cham...@gmail.com>, nchammas 
<nicholas.cham...@gmail.com>
+  2014-08-06 12:58:24 -0700
+  Commit: 4c19614, github.com/apache/spark/pull/1744
+
+  [SPARK-2678][Core][SQL] A workaround for SPARK-2678
+  Cheng Lian <lian.cs....@gmail.com>
+  2014-08-06 12:28:35 -0700
+  Commit: cf8e7fd, github.com/apache/spark/pull/1801
+
+  [SPARK-2875] [PySpark] [SQL] handle null in schemaRDD()
+  Davies Liu <davies....@gmail.com>
+  2014-08-06 11:08:12 -0700
+  Commit: 27a8d4c, github.com/apache/spark/pull/1802
+
+  [SPARK-2157] Enable tight firewall rules for Spark
+  Andrew Or <andrewo...@gmail.com>, Andrew Ash <and...@andrewash.com>
+  2014-08-06 00:07:40 -0700
+  Commit: 31090e4, github.com/apache/spark/pull/1777
+
+  [SPARK-1022][Streaming][HOTFIX] Fixed zookeeper dependency of Kafka
+  Tathagata Das <tathagata.das1...@gmail.com>
+  2014-08-05 23:41:34 -0700
+  Commit: 5b4bc84, github.com/apache/spark/pull/1797
+
+  [MLlib] Use this.type as return type in k-means' builder pattern
+  DB Tsai <dbt...@alpinenow.com>
+  2014-08-05 23:32:29 -0700
+  Commit: aec217a, github.com/apache/spark/pull/1796
+
+  SPARK-2294: fix locality inversion bug in TaskManager
+  CodingCat <zhunans...@gmail.com>
+  2014-08-05 23:02:58 -0700
+  Commit: 1da2fdf, github.com/apache/spark/pull/1313
+
+  [SQL] Fix logging warn -> debug
+  Michael Armbrust <mich...@databricks.com>
+  2014-08-05 22:30:32 -0700
+  Commit: 0482055, github.com/apache/spark/pull/1800
+
+  [SQL] Tighten the visibility of various SQLConf methods and renamed 
setter/getters
+  Reynold Xin <r...@apache.org>
+  2014-08-05 22:29:19 -0700
+  Commit: 4f0b4f4, github.com/apache/spark/pull/1794
+
+  [SPARK-2806] core - upgrade to json4s-jackson 3.2.10
+  Anand Avati <av...@redhat.com>
+  2014-08-05 21:59:10 -0700
+  Commit: 6da8f41, github.com/apache/spark/pull/1702
+
+  [SPARK-2866][SQL] Support attributes in ORDER BY that aren't in SELECT
+  Michael Armbrust <mich...@databricks.com>
+  2014-08-05 20:55:02 -0700
+  Commit: 936f61e, github.com/apache/spark/pull/1795
+
+  [SPARK-2854][SQL] Finalize _acceptable_types in pyspark.sql
+  Yin Huai <h...@cse.ohio-state.edu>
+  2014-08-05 18:56:10 -0700
+  Commit: a10e1b0, github.com/apache/spark/pull/1793
+
+  [SPARK-2650][SQL] Try to partially fix SPARK-2650 by adjusting initial 
buffer size and reducing memory allocation
+  Cheng Lian <lian.cs....@gmail.com>
+  2014-08-05 18:50:37 -0700
+  Commit: 4233b02, github.com/apache/spark/pull/1769
+
+  [sql] rename project name in pom.xml of hive-thriftserver module
+  wangfei <wangf...@huawei.com>
+  2014-08-05 18:30:02 -0700
+  Commit: 152e36c, github.com/apache/spark/pull/1789
+
+  SPARK-2869 - Fix tiny bug in JdbcRdd for closing jdbc connection
+  Stephen Boesch <javadba>, Stephen Boesch <java...@gmail.com>
+  2014-08-05 18:18:08 -0700
+  Commit: 58247a8, github.com/apache/spark/pull/1792
+
+  [SPARK-2550][MLLIB][APACHE SPARK] Support regularization and intercept in 
pyspark's linear methods
+  Michael Giannakopoulos <miccagi...@gmail.com>
+  2014-08-05 16:30:32 -0700
+  Commit: 672904e, github.com/apache/spark/pull/1775
+
+  [SPARK-2503] Lower shuffle output buffer (spark.shuffle.file.buffer.kb) to 
32KB.
+  Reynold Xin <r...@apache.org>
+  2014-08-05 16:24:50 -0700
+  Commit: 0172277, github.com/apache/spark/pull/1781
+
+  [SPARK-2856] Decrease initial buffer size for Kryo to 64KB.
+  Reynold Xin <r...@apache.org>
+  2014-08-05 01:30:46 -0700
+  Commit: 5081b0d, github.com/apache/spark/pull/1780
+
+  [SPARK-2864][MLLIB] fix random seed in word2vec; move model to local
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-05 16:22:41 -0700
+  Commit: e77fa81, github.com/apache/spark/pull/1790
+
+  SPARK-1680: use configs for specifying environment variables on YARN
+  Thomas Graves <tgra...@apache.org>
+  2014-08-05 15:57:32 -0500
+  Commit: 7b798e1, github.com/apache/spark/pull/1512
+
+  SPARK-2380: Support displaying accumulator values in the web UI
+  Patrick Wendell <pwend...@gmail.com>
+  2014-08-05 13:08:23 -0700
+  Commit: 46b6983, github.com/apache/spark/pull/1309
+
+  [SPARK-2859] Update url of Kryo project in related docs
+  Guancheng (G.C.) Chen <chenguanch...@gmail.com>
+  2014-08-05 11:50:08 -0700
+  Commit: 0f541ab, github.com/apache/spark/pull/1782
+
+  [SPARK-2860][SQL] Fix coercion of CASE WHEN.
+  Michael Armbrust <mich...@databricks.com>
+  2014-08-05 11:17:50 -0700
+  Commit: 388ab53, github.com/apache/spark/pull/1785
+
+  SPARK-1890 and SPARK-1891- add admin and modify acls
+  Thomas Graves <tgra...@apache.org>
+  2014-08-05 12:52:52 -0500
+  Commit: e3fe657, github.com/apache/spark/pull/1196
+
+  SPARK-1528 - spark on yarn, add support for accessing remote HDFS
+  Thomas Graves <tgra...@apache.org>
+  2014-08-05 12:48:26 -0500
+  Commit: 6c0c65f, github.com/apache/spark/pull/1159
+
+  [SPARK-1022][Streaming] Add Kafka real unit test
+  jerryshao <saisai.s...@intel.com>
+  2014-08-05 10:40:28 -0700
+  Commit: b92a450, github.com/apache/spark/pull/1751
+
+  [SPARK-1779] Throw an exception if memory fractions are not between 0 and 1
+  wangfei <scn...@yeah.net>, wangfei <wangf...@huawei.com>
+  2014-08-05 00:51:07 -0700
+  Commit: 075ba67, github.com/apache/spark/pull/714
+
+  [SPARK-2857] Correct properties to set Master / Worker ports
+  Andrew Or <andrewo...@gmail.com>
+  2014-08-05 00:39:07 -0700
+  Commit: 12f99cf, github.com/apache/spark/pull/1779
+
+  SPARK-2711. Create a ShuffleMemoryManager to track memory for all spilling 
collections
+  Matei Zaharia <ma...@databricks.com>
+  2014-08-04 23:41:03 -0700
+  Commit: d13d253, github.com/apache/spark/pull/1707
+
+  SPARK-2685. Update ExternalAppendOnlyMap to avoid buffer.remove()
+  Matei Zaharia <ma...@databricks.com>
+  2014-08-04 23:27:53 -0700
+  Commit: a092285, github.com/apache/spark/pull/1773
+
+  [SPARK-2323] Exception in accumulator update should not crash DAGScheduler & 
SparkContext
+  Reynold Xin <r...@apache.org>
+  2014-08-04 20:39:18 -0700
+  Commit: 4ed7b5a, github.com/apache/spark/pull/1772
+
+  [SPARK-1687] [PySpark] fix unit tests related to pickable namedtuple
+  Davies Liu <davies....@gmail.com>
+  2014-08-04 15:54:52 -0700
+  Commit: 2225d18, github.com/apache/spark/pull/1771
+
+  SPARK-2792. Fix reading too much or too little data from each stream in 
ExternalMap / Sorter
+  Matei Zaharia <ma...@databricks.com>
+  2014-08-04 12:59:18 -0700
+  Commit: aa7a48e, github.com/apache/spark/pull/1722
+
+  [SPARK-1687] [PySpark] pickable namedtuple
+  Davies Liu <davies....@gmail.com>
+  2014-08-04 12:13:41 -0700
+  Commit: bfd2f39, github.com/apache/spark/pull/1623
+
+  [MLlib] [SPARK-2510]Word2Vec: Distributed Representation of Words
+  Liquan Pei <l...@gopivotal.com>, Xiangrui Meng <m...@databricks.com>, Liquan 
Pei <liquan...@gmail.com>
+  2014-08-03 23:55:58 -0700
+  Commit: 3823f6d, github.com/apache/spark/pull/1719
+
+  SPARK-2272 [MLlib] Feature scaling which standardizes the range of 
independent variables or features of data
+  DB Tsai <dbt...@alpinenow.com>
+  2014-08-03 21:39:21 -0700
+  Commit: 9aa1459, github.com/apache/spark/pull/1207
+
+  Fix some bugs with spaces in directory name.
+  Sarah Gerweck <sarah.a...@gmail.com>
+  2014-08-03 19:47:05 -0700
+  Commit: 2152e24, github.com/apache/spark/pull/1756
+
+  [SPARK-2810] upgrade to scala-maven-plugin 3.2.0
+  Anand Avati <av...@redhat.com>
+  2014-08-03 17:47:49 -0700
+  Commit: 4784d24, github.com/apache/spark/pull/1711
+
+  [SPARK-1740] [PySpark] kill the python worker
+  Davies Liu <davies....@gmail.com>
+  2014-08-03 15:52:00 -0700
+  Commit: a4cdb77, github.com/apache/spark/pull/1643
+
+  [SPARK-2783][SQL] Basic support for analyze in HiveContext
+  Yin Huai <h...@cse.ohio-state.edu>
+  2014-08-03 14:54:41 -0700
+  Commit: 7c6afda, github.com/apache/spark/pull/1741
+
+  [SPARK-2814][SQL] HiveThriftServer2 throws NPE when executing native commands
+  Cheng Lian <lian.cs....@gmail.com>
+  2014-08-03 12:34:46 -0700
+  Commit: 6ffdcc6, github.com/apache/spark/pull/1753
+
+  [SPARK-2784][SQL] Deprecate hql() method in favor of a config option, 
'spark.sql.dialect'
+  Michael Armbrust <mich...@databricks.com>
+  2014-08-03 12:28:29 -0700
+  Commit: c5ed1de, github.com/apache/spark/pull/1746
+
+  [SPARK-2197] [mllib] Java DecisionTree bug fix and easy-of-use
+  Joseph K. Bradley <joseph.kurata.brad...@gmail.com>
+  2014-08-03 10:36:52 -0700
+  Commit: eaa9355, github.com/apache/spark/pull/1740
+
+  SPARK-2246: Add user-data option to EC2 scripts
+  Allan Douglas R. de Oliveira <al...@chaordicsystems.com>
+  2014-08-03 10:25:59 -0700
+  Commit: 162fc95, github.com/apache/spark/pull/1186
+
+  SPARK-2712 - Add a small note to maven doc that mvn package must happen ...
+  Stephen Boesch <java...@gmail.com>
+  2014-08-03 10:19:04 -0700
+  Commit: 1992175, github.com/apache/spark/pull/1615
+
+  [Minor] Fixes on top of #1679
+  Andrew Or <andrewo...@gmail.com>
+  2014-08-02 22:00:46 -0700
+  Commit: fb2a207, github.com/apache/spark/pull/1736
+
+  SPARK-2414 [BUILD] Add LICENSE entry for jquery
+  Sean Owen <sro...@gmail.com>
+  2014-08-02 21:55:56 -0700
+  Commit: c137928, github.com/apache/spark/pull/1748
+
+  SPARK-2602 [BUILD] Tests steal focus under Java 6
+  Sean Owen <sro...@gmail.com>
+  2014-08-02 21:44:19 -0700
+  Commit: 0d47bb6, github.com/apache/spark/pull/1747
+
+  [SPARK-2739][SQL] Rename registerAsTable to registerTempTable
+  Michael Armbrust <mich...@databricks.com>
+  2014-08-02 18:27:04 -0700
+  Commit: 5b30e00, github.com/apache/spark/pull/1743
+
+  [SPARK-2797] [SQL] SchemaRDDs don't support unpersist()
+  Yin Huai <h...@cse.ohio-state.edu>
+  2014-08-02 17:55:22 -0700
+  Commit: 5ef8282, github.com/apache/spark/pull/1745
+
+  [SPARK-2729][SQL] Added test case for SPARK-2729
+  Cheng Lian <lian.cs....@gmail.com>
+  2014-08-02 17:12:49 -0700
+  Commit: 460fad8, github.com/apache/spark/pull/1738
+
+  [SPARK-2785][SQL] Remove assertions that throw when users try unsupported 
Hive commands.
+  Michael Armbrust <mich...@databricks.com>
+  2014-08-02 16:48:07 -0700
+  Commit: 4230df4, github.com/apache/spark/pull/1742
+
+  [SPARK-2097][SQL] UDF Support
+  Michael Armbrust <mich...@databricks.com>
+  2014-08-02 16:33:48 -0700
+  Commit: 3b9f25f, github.com/apache/spark/pull/1063
+
+  SPARK-2804: Remove scalalogging-slf4j dependency
+  GuoQiang Li <wi...@qq.com>
+  2014-08-02 13:55:28 -0700
+  Commit: 7924d72, github.com/apache/spark/pull/also
+
+  [SPARK-1981] Add AWS Kinesis streaming support
+  Chris Fregly <ch...@fregly.com>
+  2014-08-02 13:35:35 -0700
+  Commit: bb0ac6d, github.com/apache/spark/pull/1434
+
+  [SQL] Set outputPartitioning of BroadcastHashJoin correctly.
+  Yin Huai <h...@cse.ohio-state.edu>
+  2014-08-02 13:16:41 -0700
+  Commit: 91de0dc, github.com/apache/spark/pull/1735
+
+  [SPARK-2478] [mllib] DecisionTree Python API
+  Joseph K. Bradley <joseph.kurata.brad...@gmail.com>
+  2014-08-02 13:07:17 -0700
+  Commit: 8d6ac2b, github.com/apache/spark/pull/1727
+
+  [HOTFIX] Do not throw NPE if spark.test.home is not set
+  Andrew Or <andrewo...@gmail.com>
+  2014-08-02 12:11:50 -0700
+  Commit: e221108, github.com/apache/spark/pull/1739
+
+  MAINTENANCE: Automated closing of pull requests.
+  Patrick Wendell <pwend...@gmail.com>
+  2014-08-02 01:26:16 -0700
+  Commit: 87738bf, github.com/apache/spark/pull/706
+
+  HOTFIX: Fix concurrency issue in FlumePollingStreamSuite.
+  Patrick Wendell <pwend...@gmail.com>
+  2014-08-02 01:11:03 -0700
+  Commit: 44460ba
+
+  HOTFIX: Fixing test error in maven for flume-sink.
+  Patrick Wendell <pwend...@gmail.com>
+  2014-08-02 00:57:47 -0700
+  Commit: 25cad6a
+
+  [SPARK-1812]  sql/catalyst - Provide explicit type information
+  Anand Avati <av...@redhat.com>
+  2014-08-02 00:48:17 -0700
+  Commit: 08c095b, github.com/apache/spark/pull/1709
+
+  [SPARK-2454] Do not ship spark home to Workers
+  Andrew Or <andrewo...@gmail.com>
+  2014-08-02 00:45:38 -0700
+  Commit: 148af60, github.com/apache/spark/pull/1734
+
+  [SPARK-2316] Avoid O(blocks) operations in listeners
+  Andrew Or <andrewo...@gmail.com>
+  2014-08-01 23:56:24 -0700
+  Commit: d934801, github.com/apache/spark/pull/1679
+
+  Revert "[SPARK-1470][SPARK-1842] Use the scala-logging wrapper instead of 
the directly sfl4j api"
+  Patrick Wendell <pwend...@gmail.com>
+  2014-08-01 23:55:30 -0700
+  Commit: dab3796
+
+  [SPARK-1470][SPARK-1842] Use the scala-logging wrapper instead of the 
directly sfl4j api
+  GuoQiang Li <wi...@qq.com>
+  2014-08-01 23:55:11 -0700
+  Commit: adc8303, github.com/apache/spark/pull/1369
+
+  StatCounter on NumPy arrays [PYSPARK][SPARK-2012]
+  Jeremy Freeman <the.freeman....@gmail.com>
+  2014-08-01 22:33:25 -0700
+  Commit: 4bc3bb2, github.com/apache/spark/pull/1725
+
+  [SPARK-2801][MLlib]: DistributionGenerator renamed to RandomDataGenerator. 
RandomRDD is now of generic type
+  Burak <brk...@gmail.com>
+  2014-08-01 22:32:12 -0700
+  Commit: fda4759, github.com/apache/spark/pull/1732
+
+  [SPARK-1580][MLLIB] Estimate ALS communication and computation costs.
+  Tor Myklebust <tmykl...@gmail.com>, Xiangrui Meng <m...@databricks.com>
+  2014-08-01 21:25:02 -0700
+  Commit: e25ec06, github.com/apache/spark/pull/493
+
+  [SPARK-2550][MLLIB][APACHE SPARK] Support regularization and intercept in 
pyspark's linear methods.
+  Michael Giannakopoulos <miccagi...@gmail.com>
+  2014-08-01 21:00:31 -0700
+  Commit: c281189, github.com/apache/spark/pull/1624
+
+  Streaming mllib [SPARK-2438][MLLIB]
+  Jeremy Freeman <the.freeman....@gmail.com>, freeman 
<the.freeman....@gmail.com>
+  2014-08-01 20:10:26 -0700
+  Commit: f6a1899, github.com/apache/spark/pull/1361
+
+  [SPARK-2764] Simplify daemon.py process structure
+  Josh Rosen <joshro...@apache.org>
+  2014-08-01 19:38:21 -0700
+  Commit: e8e0fd6, github.com/apache/spark/pull/1680
+
+  [SPARK-2800]: Exclude scalastyle-output.xml Apache RAT checks
+  GuoQiang Li <wi...@qq.com>
+  2014-08-01 19:35:16 -0700
+  Commit: a38d3c9, github.com/apache/spark/pull/1729
+
+  [SPARK-2116] Load spark-defaults.conf from SPARK_CONF_DIR if set
+  Albert Chu <ch...@llnl.gov>
+  2014-08-01 19:00:38 -0700
+  Commit: 0da07da, github.com/apache/spark/pull/1059
+
+  [SPARK-2212][SQL] Hash Outer Join (follow-up bug fix).
+  Yin Huai <h...@cse.ohio-state.edu>
+  2014-08-01 18:52:01 -0700
+  Commit: 3822f33, github.com/apache/spark/pull/1721
+
+  [SPARK-2010] [PySpark] [SQL] support nested structure in SchemaRDD
+  Davies Liu <davies....@gmail.com>
+  2014-08-01 18:47:41 -0700
+  Commit: 880eabe, github.com/apache/spark/pull/1598
+
+  [SPARK-2796] [mllib] DecisionTree bug fix: ordered categorical features
+  Joseph K. Bradley <joseph.kurata.brad...@gmail.com>
+  2014-08-01 15:52:21 -0700
+  Commit: 7058a53, github.com/apache/spark/pull/1720
+
+  [SPARK-2786][mllib] Python correlations
+  Doris Xin <doris.s....@gmail.com>
+  2014-08-01 15:02:17 -0700
+  Commit: d88e695, github.com/apache/spark/pull/1713
+
+  SPARK-2791: Fix committing, reverting and state tracking in shuffle file 
consolidation
+  Aaron Davidson <aa...@databricks.com>
+  2014-08-01 13:57:19 -0700
+  Commit: 78f2af5, github.com/apache/spark/pull/1678
+
+  [SPARK-2379] Fix the bug that streaming's receiver may fall into a dead loop
+  joyyoj <suns...@gmail.com>
+  2014-08-01 13:41:55 -0700
+  Commit: b270309, github.com/apache/spark/pull/1694
+
+  SPARK-1612: Fix potential resource leaks
+  zsxwing <zsxw...@gmail.com>
+  2014-08-01 13:25:04 -0700
+  Commit: f5d9bea, github.com/apache/spark/pull/535
+
+  [SPARK-2490] Change recursive visiting on RDD dependencies to iterative 
approach
+  Liang-Chi Hsieh <vii...@gmail.com>
+  2014-08-01 12:12:30 -0700
+  Commit: baf9ce1, github.com/apache/spark/pull/1418
+
+  [SPARK-695] In DAGScheduler's getPreferredLocs, track set of visited 
partitions.
+  Aaron Staple <aaron.sta...@gmail.com>
+  2014-08-01 12:04:04 -0700
+  Commit: eb5bdca, github.com/apache/spark/pull/1362
+
+  [SQL] Documentation: Explain cacheTable command
+  CrazyJvm <crazy...@gmail.com>
+  2014-08-01 11:46:13 -0700
+  Commit: c82fe47, github.com/apache/spark/pull/1681
+
+  [SPARK-2767] [SQL] SparkSQL CLI doens't output error message if query failed.
+  Cheng Hao <hao.ch...@intel.com>
+  2014-08-01 11:42:05 -0700
+  Commit: c0b47ba, github.com/apache/spark/pull/1686
+
+  [SPARK-2729] [SQL] Forgot to match Timestamp type in ColumnBuilder
+  chutium <teng....@gmail.com>
+  2014-08-01 11:31:44 -0700
+  Commit: 580c701, github.com/apache/spark/pull/1636
+
+  [SQL][SPARK-2212]Hash Outer Join
+  Cheng Hao <hao.ch...@intel.com>
+  2014-08-01 11:27:12 -0700
+  Commit: 4415722, github.com/apache/spark/pull/1147
+
+  [SPARK-2179][SQL] A minor refactoring Java data type APIs (2179 follow-up).
+  Yin Huai <h...@cse.ohio-state.edu>
+  2014-08-01 11:14:53 -0700
+  Commit: c41fdf0, github.com/apache/spark/pull/1712
+
+  SPARK-2099. Report progress while task is running.
+  Sandy Ryza <sa...@cloudera.com>
+  2014-08-01 11:08:39 -0700
+  Commit: 8d338f6, github.com/apache/spark/pull/1056
+
+  [HOTFIX] downgrade breeze version to 0.7
+  Xiangrui Meng <m...@databricks.com>
+  2014-08-01 10:00:46 -0700
+  Commit: 5328c0a, github.com/apache/spark/pull/1718
+
+  [SPARK-1997] update breeze to version 0.8.1
+  witgo <wi...@qq.com>
+  2014-08-01 07:47:44 -0700
+  Commit: 0dacb1a, github.com/apache/spark/pull/940
+
+  SPARK-2768 [MLLIB] Add product, user recommend method to 
MatrixFactorizationModel
+  Sean Owen <sro...@gmail.com>
+  2014-08-01 07:32:53 -0700
+  Commit: 82d209d, github.com/apache/spark/pull/1687
+
+  [SPARK-2103][Streaming] Change to ClassTag for KafkaInputDStream and fix 
reflection issue
+  jerryshao <saisai.s...@intel.com>
+  2014-08-01 04:32:46 -0700
+  Commit: a32f0fb, github.com/apache/spark/pull/1508
+
+  [Spark 2557] fix LOCAL_N_REGEX in createTaskScheduler and make local-n and 
local-n-failures consistent
+  Ye Xianjin <advance...@gmail.com>
+  2014-08-01 00:34:39 -0700
+  Commit: 284771e, github.com/apache/spark/pull/1464
+
+  SPARK-2134: Report metrics before application finishes
+  Rahul Singhal <rahul.sing...@guavus.com>
+  2014-08-01 00:33:15 -0700
+  Commit: f1957e1, github.com/apache/spark/pull/1076
+
+  SPARK-983. Support external sorting in sortByKey()
+  Matei Zaharia <ma...@databricks.com>
+  2014-08-01 00:16:18 -0700
+  Commit: 72e3369, github.com/apache/spark/pull/931
+
+  [SPARK-2670] FetchFailedException should be thrown when local fetch has 
failed
+  Kousuke Saruta <saru...@oss.nttdata.co.jp>
+  2014-08-01 00:01:30 -0700
+  Commit: 8ff4417, github.com/apache/spark/pull/1578
+
+  SPARK-2738. Remove redundant imports in BlockManagerSuite
+  Sandy Ryza <sa...@cloudera.com>
+  2014-07-31 23:12:38 -0700
+  Commit: cb9e7d5, github.com/apache/spark/pull/1642
+
+  SPARK-2632, SPARK-2576. Fixed by only importing what is necessary during 
class definition.
+  Prashant Sharma <scrapco...@gmail.com>, Yin Huai <h...@cse.ohio-state.edu>, 
Prashant Sharma <prashan...@imaginea.com>
+  2014-07-31 22:57:13 -0700
+  Commit: 1499101, github.com/apache/spark/pull/1635
+
+  [SPARK-2702][Core] Upgrade Tachyon dependency to 0.5.0
+  Haoyuan Li <haoy...@cs.berkeley.edu>
+  2014-07-31 22:53:42 -0700
+  Commit: 2cdc3e5, github.com/apache/spark/pull/1651
+
+  [SPARK-2782][mllib] Bug fix for getRanks in SpearmanCorrelation
+  Doris Xin <doris.s....@gmail.com>
+  2014-07-31 21:23:35 -0700
+  Commit: c475540, github.com/apache/spark/pull/1710
+
+  [SPARK-2777][MLLIB] change ALS factors storage level to MEMORY_AND_DISK
+  Xiangrui Meng <m...@databricks.com>
+  2014-07-31 21:14:08 -0700
+  Commit: b190083, github.com/apache/spark/pull/1700
+
+  SPARK-2766:  ScalaReflectionSuite  throw an llegalArgumentException in JDK 6
+  GuoQiang Li <wi...@qq.com>
+  2014-07-31 21:06:57 -0700
+  Commit: 9998efa, github.com/apache/spark/pull/1683
+
+  [SPARK-2779] [SQL] asInstanceOf[Map[...]] should use scala.collection.Map 
instead of scala.collection.immutable.Map
+  Yin Huai <h...@cse.ohio-state.edu>
+  2014-07-31 21:02:11 -0700
+  Commit: 9632719, github.com/apache/spark/pull/1705
+
+  [SPARK-2756] [mllib] Decision tree bug fixes
+  Joseph K. Bradley <joseph.kurata.brad...@gmail.com>
+  2014-07-31 20:51:48 -0700
+  Commit: b124de5, github.com/apache/spark/pull/1673
+
+  [SPARK-2724] Python version of RandomRDDGenerators
+  Doris Xin <doris.s....@gmail.com>
+  2014-07-31 20:32:57 -0700
+  Commit: d843014, github.com/apache/spark/pull/1628
+
+  [SPARK-2531 & SPARK-2436] [SQL] Optimize the BuildSide when planning 
BroadcastNestedLoopJoin.
+  Zongheng Yang <zonghen...@gmail.com>
+  2014-07-31 19:32:16 -0700
+  Commit: 8f51491, github.com/apache/spark/pull/1448
+
+  SPARK-2282: Reuse Socket for sending accumulator updates to Pyspark
+  Aaron Davidson <aa...@databricks.com>
+  2014-07-31 15:31:53 -0700
+  Commit: ef4ff00, github.com/apache/spark/pull/1503
+
+  SPARK-2740: allow user to specify ascending and numPartitions for sortBy...
+  Rui Li <rui...@intel.com>
+  2014-07-31 15:07:26 -0700
+  Commit: 492a195, github.com/apache/spark/pull/1645
+
+  Docs: monitoring, streaming programming guide
+  kballou <kbal...@devnulllabs.io>
+  2014-07-31 14:58:52 -0700
+  Commit: cc82050, github.com/apache/spark/pull/1662
+
+  Improvements to merge_spark_pr.py
+  Josh Rosen <joshro...@apache.org>
+  2014-07-31 14:35:09 -0700
+  Commit: e021362, github.com/apache/spark/pull/1668
+
+  [SPARK-2523] [SQL] Hadoop table scan bug fixing (fix failing Jenkins maven 
test)
+  Yin Huai <h...@cse.ohio-state.edu>
+  2014-07-31 13:05:24 -0700
+  Commit: 49b3612, github.com/apache/spark/pull/1669
+
+  [SPARK-2511][MLLIB] add HashingTF and IDF
+  Xiangrui Meng <m...@databricks.com>
+  2014-07-31 12:55:00 -0700
+  Commit: dc0865b, github.com/apache/spark/pull/1671
+
+  SPARK-2646. log4j initialization not quite compatible with log4j 2.x
+  Sean Owen <sro...@gmail.com>
+  2014-07-31 12:26:36 -0700
+  Commit: e5749a1, github.com/apache/spark/pull/1547
+
+  SPARK-2749 [BUILD] Part 2. Fix a follow-on scalastyle error
+  Sean Owen <sro...@gmail.com>
+  2014-07-31 12:18:40 -0700
+  Commit: 4dbabb3, github.com/apache/spark/pull/1690
+
+  SPARK-2664. Deal with `--conf` options in spark-submit that relate to fl...
+  Sandy Ryza <sa...@cloudera.com>
+  2014-07-31 11:51:20 -0700
+  Commit: f68105d, github.com/apache/spark/pull/1665
+
+  SPARK-2028: Expose mapPartitionsWithInputSplit in HadoopRDD
+  Aaron Davidson <aa...@databricks.com>
+  2014-07-31 11:35:38 -0700
+  Commit: f193312, github.com/apache/spark/pull/973
+
+  [SPARK-2397][SQL] Deprecate LocalHiveContext
+  Michael Armbrust <mich...@databricks.com>
+  2014-07-31 11:26:43 -0700
+  Commit: 72cfb13, github.com/apache/spark/pull/1641
+
+  [SPARK-2743][SQL] Resolve original attributes in ParquetTableScan
+  Michael Armbrust <mich...@databricks.com>
+  2014-07-31 11:15:25 -0700
+  Commit: 3072b96, github.com/apache/spark/pull/1647
+
+  [SPARK-2762] SparkILoop leaks memory in multi-repl configurations
+  Timothy Hunter <timhun...@databricks.com>
+  2014-07-31 10:25:40 -0700
+  Commit: 92ca910, github.com/apache/spark/pull/1674
+
+  automatically set master according to `spark.master` in `spark-defaults....
+  CrazyJvm <crazy...@gmail.com>
+  2014-07-30 23:37:25 -0700
+  Commit: 669e3f0, github.com/apache/spark/pull/1644
+
+  [SPARK-2497] Included checks for module symbols too.
+  Prashant Sharma <prashan...@imaginea.com>
+  2014-07-30 22:46:30 -0700
+  Commit: 5a110da, github.com/apache/spark/pull/1463
+
+  [SPARK-2737] Add retag() method for changing RDDs' ClassTags.
+  Josh Rosen <joshro...@apache.org>
+  2014-07-30 22:40:57 -0700
+  Commit: 4fb2593, github.com/apache/spark/pull/1639
+
+  [SPARK-2340] Resolve event logging and History Server paths properly
+  Andrew Or <andrewo...@gmail.com>
+  2014-07-30 21:57:32 -0700
+  Commit: a7c305b, github.com/apache/spark/pull/1280
+
+  Required AM memory is "amMem", not "args.amMemory"
+  derek ma <ma...@asiainfo-linkage.com>
+  2014-07-30 21:37:59 -0700
+  Commit: 118c1c4, github.com/apache/spark/pull/1494
+
+  [SPARK-2758] UnionRDD's UnionPartition should not reference parent RDDs
+  Reynold Xin <r...@apache.org>
+  2014-07-30 21:30:13 -0700
+  Commit: 894d48f, github.com/apache/spark/pull/1675
+
+  SPARK-2045 Sort-based shuffle
+  Matei Zaharia <ma...@databricks.com>
+  2014-07-30 18:07:59 -0700
+  Commit: e966284, github.com/apache/spark/pull/1499
+
+  Update DecisionTreeRunner.scala
+  strat0sphere <stratos.dimopou...@gmail.com>
+  2014-07-30 17:57:50 -0700
+  Commit: da50176, github.com/apache/spark/pull/1676
+
+  SPARK-2341 [MLLIB] loadLibSVMFile doesn't handle regression datasets
+  Sean Owen <sro...@gmail.com>
+  2014-07-30 17:34:32 -0700
+  Commit: e9b275b, github.com/apache/spark/pull/1663
+
+  [SPARK-2734][SQL] Remove tables from cache when DROP TABLE is run.
+  Michael Armbrust <mich...@databricks.com>
+  2014-07-30 17:30:51 -0700
+  Commit: 88a519d, github.com/apache/spark/pull/1650
+
+  SPARK-2741 - Publish version of spark assembly which does not contain Hive
+  Brock Noland <br...@apache.org>
+  2014-07-30 17:04:30 -0700
+  Commit: 2ac37db, github.com/apache/spark/pull/1667
+
+  SPARK-2749 [BUILD]. Spark SQL Java tests aren't compiling in Jenkins' Maven 
builds; missing junit:junit dep
+  Sean Owen <sro...@gmail.com>
+  2014-07-30 15:04:33 -0700
+  Commit: 6ab96a6, github.com/apache/spark/pull/1660
+
+  Properly pass SBT_MAVEN_PROFILES into sbt.
+  Reynold Xin <r...@apache.org>
+  2014-07-30 14:31:20 -0700
+  Commit: 2f4b170
+
+  Set AMPLAB_JENKINS_BUILD_PROFILE.
+  Reynold Xin <r...@apache.org>
+  2014-07-30 14:08:24 -0700
+  Commit: 1097327
+
+  Wrap JAR_DL in dev/check-license.
+  Reynold Xin <r...@apache.org>
+  2014-07-30 13:42:43 -0700
+  Commit: 7c7ce54
+
+  [SPARK-2024] Add saveAsSequenceFile to PySpark
+  Kan Zhang <kzh...@apache.org>
+  2014-07-30 13:19:05 -0700
+  Commit: 94d1f46, github.com/apache/spark/pull/1338
+
+  dev/check-license wrap folders in quotes.
+  Reynold Xin <r...@apache.org>
+  2014-07-30 13:17:14 -0700
+  Commit: 437dc8c
+
+  [SQL] Fix compiling of catalyst docs.
+  Michael Armbrust <mich...@databricks.com>
+  2014-07-30 13:11:09 -0700
+  Commit: 2248891, github.com/apache/spark/pull/1653
+
+  More wrapping FWDIR in quotes.
+  Reynold Xin <r...@apache.org>
+  2014-07-30 13:04:20 -0700
+  Commit: 0feb349
+
+  Wrap FWDIR in quotes in dev/check-license.
+  Reynold Xin <r...@apache.org>
+  2014-07-30 12:33:42 -0700
+  Commit: 95cf203
+
+  Wrap FWDIR in quotes.
+  Reynold Xin <r...@apache.org>
+  2014-07-30 12:24:35 -0700
+  Commit: f2eb84f
+
+  [SPARK-2746] Set SBT_MAVEN_PROFILES only when it is not set explicitly by 
the user.
+  Reynold Xin <r...@apache.org>
+  2014-07-30 11:45:24 -0700
+  Commit: ff511ba, github.com/apache/spark/pull/1655
+
+  [SPARK-2544][MLLIB] Improve ALS algorithm resource usage
+  GuoQiang Li <wi...@qq.com>, witgo <wi...@qq.com>
+  2014-07-30 11:00:11 -0700
+  Commit: fc47bb6, github.com/apache/spark/pull/929
+
+  Avoid numerical instability
+  Naftali Harris <naftalihar...@gmail.com>
+  2014-07-30 09:56:59 -0700
+  Commit: e3d85b7, github.com/apache/spark/pull/1652
+
+  [SPARK-2747] git diff --dirstat can miss sql changes and not run Hive tests
+  Reynold Xin <r...@apache.org>
+  2014-07-30 09:28:53 -0700
+  Commit: 3bc3f18, github.com/apache/spark/pull/1656
+
+  [SPARK-2521] Broadcast RDD object (instead of sending it along with every 
task)
+  Reynold Xin <r...@apache.org>
+  2014-07-30 09:27:43 -0700
+  Commit: 774142f, github.com/apache/spark/pull/1498
+
+  SPARK-2748 [MLLIB] [GRAPHX] Loss of precision for small arguments to 
Math.exp, Math.log
+  Sean Owen <sro...@gmail.com>
+  2014-07-30 08:55:15 -0700
+  Commit: ee07541, github.com/apache/spark/pull/1659
+
+  SPARK-2543: Allow user to set maximum Kryo buffer size
+  Koert Kuipers <ko...@tresata.com>
+  2014-07-30 00:18:59 -0700
+  Commit: 7c5fc28, github.com/apache/spark/pull/735
+
+  [SPARK-2179][SQL] Public API for DataTypes and Schema
+  Yin Huai <h...@cse.ohio-state.edu>
+  2014-07-30 00:15:31 -0700
+  Commit: 7003c16, github.com/apache/spark/pull/1346
+
+  [SPARK-2260] Fix standalone-cluster mode, which was broken
+  Andrew Or <andrewo...@gmail.com>
+  2014-07-29 23:52:09 -0700
+  Commit: 4ce92cc, github.com/apache/spark/pull/1538
+
+  [SQL] Handle null values in debug()
+  Michael Armbrust <mich...@databricks.com>
+  2014-07-29 22:42:54 -0700
+  Commit: 077f633, github.com/apache/spark/pull/1646
+
+  [SPARK-2568] RangePartitioner should run only one job if data is balanced
+  Xiangrui Meng <m...@databricks.com>, Reynold Xin <r...@apache.org>
+  2014-07-29 22:16:20 -0700
+  Commit: 2e6efca, github.com/apache/spark/pull/1562
+
+  [SPARK-2054][SQL] Code Generation for Expression Evaluation
+  Michael Armbrust <mich...@databricks.com>
+  2014-07-29 20:58:05 -0700
+  Commit: 8446746, github.com/apache/spark/pull/993
+
+  [SPARK-2305] [PySpark] Update Py4J to version 0.8.2.1
+  Josh Rosen <joshro...@apache.org>
+  2014-07-29 19:02:06 -0700
+  Commit: 22649b6, github.com/apache/spark/pull/1626
+
+  [SPARK-2631][SQL] Use SQLConf to configure in-memory columnar caching
+  Michael Armbrust <mich...@databricks.com>
+  2014-07-29 18:20:51 -0700
+  Commit: 86534d0, github.com/apache/spark/pull/1638
+
+  [SPARK-2716][SQL] Don't check resolved for having filters.
+  Michael Armbrust <mich...@databricks.com>
+  2014-07-29 18:14:20 -0700
+  Commit: 39b8193, github.com/apache/spark/pull/1640
+
+  MAINTENANCE: Automated closing of pull requests.
+  Patrick Wendell <pwend...@gmail.com>
+  2014-07-29 17:52:48 -0700
+  Commit: 2c35666, github.com/apache/spark/pull/740
+
+  [SPARK-2393][SQL] Cost estimation optimization framework for Catalyst 
logical plans & sample usage.
+  Zongheng Yang <zonghen...@gmail.com>
+  2014-07-29 15:32:50 -0700
+  Commit: c7db274, github.com/apache/spark/pull/1238
+
+  [SPARK-2082] stratified sampling in PairRDDFunctions that guarantees exact 
sample size
+  Doris Xin <doris.s....@gmail.com>, Xiangrui Meng <m...@databricks.com>
+  2014-07-29 12:49:44 -0700
+  Commit: dc96536, github.com/apache/spark/pull/1025
+
+  [SPARK-2674] [SQL] [PySpark] support datetime type for SchemaRDD
+  Davies Liu <davies....@gmail.com>
+  2014-07-29 12:31:39 -0700
+  Commit: f0d880e, github.com/apache/spark/pull/1601
+
+  [SPARK-2730][SQL] When retrieving a value from a Map, GetItem evaluates key 
twice
+  Yin Huai <h...@cse.ohio-state.edu>
+  2014-07-29 12:23:34 -0700
+  Commit: e364348, github.com/apache/spark/pull/1637
+
+  [SQL]change some test lists
+  Daoyuan <daoyuan.w...@intel.com>
+  2014-07-29 12:22:48 -0700
+  Commit: 0c5c6a6, github.com/apache/spark/pull/1634
+
+  [STREAMING] SPARK-1729. Make Flume pull data from source, rather than the 
current pu...
+  Hari Shreedharan <harishreedha...@gmail.com>, Hari Shreedharan 
<hshreedha...@apache.org>, Tathagata Das <tathagata.das1...@gmail.com>, 
harishreedharan <hshreedha...@cloudera.com>
+  2014-07-29 11:11:29 -0700
+  Commit: 800ecff, github.com/apache/spark/pull/807
+
+  Minor indentation and comment typo fixes.
+  Aaron Staple <asta...@gmail.com>
+  2014-07-29 01:35:26 -0700
+  Commit: fc4d057, github.com/apache/spark/pull/1630
+
+  [SPARK-2174][MLLIB] treeReduce and treeAggregate
+  Xiangrui Meng <m...@databricks.com>
+  2014-07-29 01:16:41 -0700
+  Commit: 20424da, github.com/apache/spark/pull/1110
+
+  [SPARK-2726] and [SPARK-2727] Remove SortOrder and do in-place sort.
+  Reynold Xin <r...@apache.org>
+  2014-07-29 01:12:44 -0700
+  Commit: 96ba04b, github.com/apache/spark/pull/1631
+
+  [SPARK-791] [PySpark] fix pickle itemgetter with cloudpickle
+  Davies Liu <davies....@gmail.com>
+  2014-07-29 01:02:18 -0700
+  Commit: 92ef026, github.com/apache/spark/pull/1627
+
+  [SPARK-2580] [PySpark] keep silent in worker if JVM close the socket
+  Davies Liu <davies....@gmail.com>
+  2014-07-29 00:15:45 -0700
+  Commit: ccd5ab5, github.com/apache/spark/pull/1625
+
+  Excess judgment
+  Yadong Qi <qiyadong2...@gmail.com>
+  2014-07-28 21:39:02 -0700
+  Commit: 16ef4d1, github.com/apache/spark/pull/1629
+
+  Use commons-lang3 in SignalLogger rather than commons-lang
+  Aaron Davidson <aa...@databricks.com>
+  2014-07-28 13:37:44 -0700
+  Commit: 39ab87b, github.com/apache/spark/pull/1621
+
+  [SPARK-2410][SQL] Merging Hive Thrift/JDBC server (with Maven profile fix)
+  Cheng Lian <lian.cs....@gmail.com>
+  2014-07-28 12:07:30 -0700
+  Commit: a7a9d14, github.com/apache/spark/pull/1620
+
+  [SPARK-2479][MLlib] Comparing floating-point numbers using relative error in 
UnitTests
+  DB Tsai <dbt...@alpinenow.com>
+  2014-07-28 11:34:19 -0700
+  Commit: 255b56f, github.com/apache/spark/pull/1425
+
+  [SPARK-2523] [SQL] Hadoop table scan bug fixing
+  Cheng Hao <hao.ch...@intel.com>
+  2014-07-28 10:59:53 -0700
+  Commit: 2b8d89e, github.com/apache/spark/pull/1439
+
+  [SPARK-1550] [PySpark] Allow SparkContext creation after failed attempts
+  Josh Rosen <joshro...@apache.org>
+  2014-07-27 22:54:43 -0700
+  Commit: a7d145e, github.com/apache/spark/pull/1606
+
+  SPARK-2651: Add maven scalastyle plugin
+  Rahul Singhal <rahul.sing...@guavus.com>
+  2014-07-27 18:50:32 -0700
+  Commit: d7eac4c, github.com/apache/spark/pull/1550
+
+  Revert "[SPARK-2410][SQL] Merging Hive Thrift/JDBC server"
+  Patrick Wendell <pwend...@gmail.com>
+  2014-07-27 18:46:58 -0700
+  Commit: e5bbce9
+
+  [SPARK-2514] [mllib] Random RDD generator
+  Doris Xin <doris.s....@gmail.com>
+  2014-07-27 16:16:39 -0700
+  Commit: 81fcdd2, github.com/apache/spark/pull/1520
+
+  [SPARK-1777] Prevent OOMs from single partitions
+  Andrew Or <andrewo...@gmail.com>
+  2014-07-27 16:08:16 -0700
+  Commit: ecf30ee, github.com/apache/spark/pull/1165
+
+  [SPARK-2410][SQL] Merging Hive Thrift/JDBC server
+  Cheng Lian <lian.cs....@gmail.com>
+  2014-07-27 13:03:38 -0700
+  Commit: f6ff2a6, github.com/apache/spark/pull/1600
+
+  [SPARK-2705][CORE] Fixed stage description in stage info page
+  Cheng Lian <lian.cs....@gmail.com>
+  2014-07-27 12:35:21 -0700
+  Commit: 2bbf235, github.com/apache/spark/pull/1524
+
+  SPARK-2684: Update ExternalAppendOnlyMap to take an iterator as input
+  Matei Zaharia <ma...@databricks.com>
+  2014-07-27 11:20:20 -0700
+  Commit: 9857053, github.com/apache/spark/pull/1607
+
+  [SPARK-2679] [MLLib] Ser/De for Double
+  Doris Xin <doris.s....@gmail.com>
+  2014-07-27 07:21:07 -0700
+  Commit: 3a69c72, github.com/apache/spark/pull/1581
+
+  [SPARK-2361][MLLIB] Use broadcast instead of serializing data directly into 
task closure
+  Xiangrui Meng <m...@databricks.com>
+  2014-07-26 22:56:07 -0700
+  Commit: aaf2b73, github.com/apache/spark/pull/1427
+
+  SPARK-2680: Lower spark.shuffle.memoryFraction to 0.2 by default
+  Matei Zaharia <ma...@databricks.com>
+  2014-07-26 22:44:17 -0700
+  Commit: b547f69, github.com/apache/spark/pull/1593
+
+  [SPARK-2601] [PySpark] Fix Py4J error when transforming pickleFiles
+  Josh Rosen <joshro...@apache.org>
+  2014-07-26 17:37:05 -0700
+  Commit: ba46bbe, github.com/apache/spark/pull/1605
+
+  [SPARK-2704] Name threads in ConnectionManager and mark them as daemon.
+  Reynold Xin <r...@apache.org>
+  2014-07-26 15:00:32 -0700
+  Commit: 1290164, github.com/apache/spark/pull/1604
+
+  [SPARK-2279] Added emptyRDD method to Java API
+  bpaulin <b...@bobpaulin.com>
+  2014-07-26 10:27:09 -0700
+  Commit: c183b92, github.com/apache/spark/pull/1597
+
+  [SPARK-2652] [PySpark] Turning some default configs for PySpark
+  Davies Liu <davies....@gmail.com>
+  2014-07-26 01:07:08 -0700
+  Commit: 75663b5, github.com/apache/spark/pull/1568
+
+  [SPARK-2696] Reduce default value of spark.serializer.objectStreamReset
+  Hossein <hoss...@databricks.com>
+  2014-07-26 01:04:56 -0700
+  Commit: 66f26a4, github.com/apache/spark/pull/1595
+
+  [SPARK-1458] [PySpark] Expose sc.version in Java and PySpark
+  Josh Rosen <joshro...@apache.org>
+  2014-07-26 00:54:05 -0700
+  Commit: cf3e9fd, github.com/apache/spark/pull/1596
+
+  [SPARK-2659][SQL] Fix division semantics for hive
+  Michael Armbrust <mich...@databricks.com>
+  2014-07-25 19:17:49 -0700
+  Commit: 8904791, github.com/apache/spark/pull/1557
+
+  Part of [SPARK-2456] Removed some HashMaps from DAGScheduler by storing 
information in Stage.
+  Reynold Xin <r...@apache.org>
+  2014-07-25 18:45:02 -0700
+  Commit: 9d8666c, github.com/apache/spark/pull/1561
+
+  Revert "[SPARK-2410][SQL] Merging Hive Thrift/JDBC server"
+  Michael Armbrust <mich...@databricks.com>
+  2014-07-25 15:36:57 -0700
+  Commit: afd757a, github.com/apache/spark/pull/1594
+
+  [SPARK-1726] [SPARK-2567] Eliminate zombie stages in UI.
+  Kay Ousterhout <kayousterh...@gmail.com>
+  2014-07-25 15:14:13 -0700
+  Commit: 37ad3b7, github.com/apache/spark/pull/1566
+
+  [SPARK-2125] Add sort flag and move sort into shuffle implementations
+  jerryshao <saisai.s...@intel.com>
+  2014-07-25 14:34:38 -0700
+  Commit: 47b6b38, github.com/apache/spark/pull/1210
+
+  [SQL]Update HiveMetastoreCatalog.scala
+  baishuo(ç½ç¡) <vc_j...@hotmail.com>
+  2014-07-25 13:59:45 -0700
+  Commit: ab3c6a4, github.com/apache/spark/pull/1569
+
+  [SPARK-2682] Javadoc generated from Scala source code is not in javadoc's 
index
+  Yin Huai <h...@cse.ohio-state.edu>
+  2014-07-25 13:00:13 -0700
+  Commit: a19d8c8, github.com/apache/spark/pull/1584
+
+  [SPARK-2410][SQL] Merging Hive Thrift/JDBC server
+  Cheng Lian <lian.cs....@gmail.com>
+  2014-07-25 12:20:49 -0700
+  Commit: 06dc0d2, github.com/apache/spark/pull/1399
+
+  [SPARK-2683] unidoc failed because org.apache.spark.util.CallSite uses Java 
keywords as value names
+  Yin Huai <h...@cse.ohio-state.edu>
+  2014-07-25 11:14:51 -0700
+  Commit: 32bcf9a, github.com/apache/spark/pull/1585
+
+  replace println to log4j
+  fireflyc <firef...@126.com>
+  2014-07-25 10:47:52 -0700
+  Commit: a2715cc, github.com/apache/spark/pull/1372
+
+  [SPARK-2665] [SQL] Add EqualNS & Unit Tests
+  Cheng Hao <hao.ch...@intel.com>
+  2014-07-25 01:30:22 -0700
+  Commit: 184aa1c, github.com/apache/spark/pull/1570
+
+  [SPARK-2529] Clean closures in foreach and foreachPartition.
+  Reynold Xin <r...@apache.org>
+  2014-07-25 01:10:05 -0700
+  Commit: eb82abd, github.com/apache/spark/pull/1583
+
+  SPARK-2657 Use more compact data structures than ArrayBuffer in groupBy & 
cogroup
+  Matei Zaharia <ma...@databricks.com>
+  2014-07-25 00:32:32 -0700
+  Commit: 8529ced, github.com/apache/spark/pull/1555
+
+  [SPARK-2656] Python version of stratified sampling
+  Doris Xin <doris.s....@gmail.com>
+  2014-07-24 23:42:08 -0700
+  Commit: 2f75a4a, github.com/apache/spark/pull/1554
+
+  [SPARK-2538] [PySpark] Hash based disk spilling aggregation
+  Davies Liu <davies....@gmail.com>
+  2014-07-24 22:53:47 -0700
+  Commit: 14174ab, github.com/apache/spark/pull/1460
+
+  [SPARK-2014] Make PySpark store RDDs in MEMORY_ONLY_SER with compression by 
default
+  Prashant Sharma <prashan...@imaginea.com>
+  2014-07-24 18:15:37 -0700
+  Commit: eff9714, github.com/apache/spark/pull/1051
+
+  [SPARK-2464][Streaming] Fixed Twitter stream stopping bug
+  Tathagata Das <tathagata.das1...@gmail.com>
+  2014-07-24 15:59:09 -0700
+  Commit: a45d548, github.com/apache/spark/pull/1577
+
+  SPARK-2250: show stage RDDs in UI
+  Neville Li <nevi...@spotify.com>
+  2014-07-24 14:13:00 -0700
+  Commit: fec641b, github.com/apache/spark/pull/1188
+
+  [SPARK-2037]: yarn client mode doesn't support 
spark.yarn.max.executor.failures
+  GuoQiang Li <wi...@qq.com>
+  2014-07-24 14:46:10 -0500
+  Commit: 323a83c, github.com/apache/spark/pull/1180
+
+  [SPARK-2479 (partial)][MLLIB] fix binary metrics unit tests
+  Xiangrui Meng <m...@databricks.com>
+  2014-07-24 12:37:02 -0700
+  Commit: c960b50, github.com/apache/spark/pull/1576
+
+  [SPARK-2603][SQL] Remove unnecessary toMap and toList in converting Java 
collections to Scala collections JsonRDD.scala
+  Yin Huai <h...@cse.ohio-state.edu>
+  2014-07-24 11:19:19 -0700
+  Commit: b352ef1, github.com/apache/spark/pull/1504
+
+  [Build] SPARK-2619: Configurable filemode for the spark/bin folder in debian 
package
+  tzolov <christian.tzo...@gmail.com>
+  2014-07-24 11:12:25 -0700
+  Commit: 9fd1414, github.com/apache/spark/pull/1531
+
+  SPARK-2150: Provide direct link to finished application UI in yarn resou...
+  Rahul Singhal <rahul.sing...@guavus.com>
+  2014-07-24 09:31:04 -0500
+  Commit: 46e224a, github.com/apache/spark/pull/1094
+
+  [SPARK-2661][bagel]unpersist old processed rdd
+  Daoyuan <daoyuan.w...@intel.com>
+  2014-07-24 00:09:36 -0700
+  Commit: 42dfab7, github.com/apache/spark/pull/1519
+
+  SPARK-2310. Support arbitrary Spark properties on the command line with ...
+  Sandy Ryza <sa...@cloudera.com>
+  2014-07-23 23:09:25 -0700
+  Commit: e34922a, github.com/apache/spark/pull/1253
+
+  [SPARK-2658][SQL] Add rule for true = 1.
+  Michael Armbrust <mich...@databricks.com>
+  2014-07-23 22:52:49 -0700
+  Commit: 78d18fd, github.com/apache/spark/pull/1556
+
+  SPARK-2662: Fix NPE for JsonProtocol
+  GuoQiang Li <wi...@qq.com>
+  2014-07-23 22:50:39 -0700
+  Commit: 9e7725c, github.com/apache/spark/pull/1511
+
+  Replace RoutingTableMessage with pair
+  Ankur Dave <ankurd...@gmail.com>
+  2014-07-23 20:11:28 -0700
+  Commit: 2d25e34, github.com/apache/spark/pull/1553
+
+  [SPARK-2484][SQL] Build should not run hivecompatibility tests by default.
+  witgo <wi...@qq.com>
+  2014-07-23 18:17:05 -0700
+  Commit: 60f0ae3, github.com/apache/spark/pull/1403
+
+  [SPARK-2549] Functions defined inside of other functions trigger failures
+  Prashant Sharma <prashan...@imaginea.com>
+  2014-07-23 17:12:28 -0700
+  Commit: 9b76332, github.com/apache/spark/pull/1510
+
+  [SPARK-2102][SQL][CORE] Add option for kryo registration required and use a 
resource pool in Spark SQL for Kryo instances.
+  Ian O Connell <ioconn...@twitter.com>
+  2014-07-23 16:30:06 -0700
+  Commit: efdaeb1, github.com/apache/spark/pull/1377
+
+  [SPARK-2569][SQL] Fix shipping of TEMPORARY hive UDFs.
+  Michael Armbrust <mich...@databricks.com>
+  2014-07-23 16:26:55 -0700
+  Commit: 1871574, github.com/apache/spark/pull/1552
+
+  SPARK-2226:  [SQL] transform HAVING clauses with aggregate expressions that 
aren't in the aggregation list
+  William Benton <wi...@redhat.com>
+  2014-07-23 16:25:32 -0700
+  Commit: e060d3e, github.com/apache/spark/pull/1497
+
+  SPARK-2277: clear host->rack info properly
+  Rui Li <rui...@intel.com>
+  2014-07-23 16:23:24 -0700
+  Commit: 91903e0, github.com/apache/spark/pull/1454
+
+  [SPARK-2588][SQL] Add some more DSLs.
+  Takuya UESHIN <ues...@happy-camper.st>
+  2014-07-23 14:47:23 -0700
+  Commit: 1b790cf, github.com/apache/spark/pull/1491
+
+  [CORE] SPARK-2640: In "local[N]", free cores of the only executor should be 
touched by "spark.task.cpus" for every finish/start-up of tasks.
+  woshilaiceshide <woshilaicesh...@qq.com>
+  2014-07-23 11:05:41 -0700
+  Commit: f776bc9, github.com/apache/spark/pull/1544
+
+  [SPARK-2609] Log thread ID when spilling ExternalAppendOnlyMap
+  Andrew Or <andrewo...@gmail.com>
+  2014-07-23 10:31:45 -0700
+  Commit: 2592111, github.com/apache/spark/pull/1517
+
+  [SPARK-2617] Correct doc and usages of preservesPartitioning
+  Xiangrui Meng <m...@databricks.com>
+  2014-07-23 00:58:55 -0700
+  Commit: 4c7243e, github.com/apache/spark/pull/1526
+
+  Remove GraphX MessageToPartition for compatibility with sort-based shuffle
+  Ankur Dave <ankurd...@gmail.com>
+  2014-07-22 22:18:30 -0700
+  Commit: 6c2be93, github.com/apache/spark/pull/1537
+
+  [YARN] SPARK-2577: File upload to viewfs is broken due to mount point re...
+  Gera Shegalov <g...@twitter.com>
+  2014-07-22 21:05:12 -0500
+  Commit: 02e4572, github.com/apache/spark/pull/1483
+
+  [YARN][SPARK-2606]:In some cases,the spark UI pages display incorrect
+  GuoQiang Li <wi...@qq.com>
+  2014-07-22 20:34:40 -0500
+  Commit: ddadf1b, github.com/apache/spark/pull/1501
+
+  Graphx example
+  CrazyJvm <crazy...@gmail.com>
+  2014-07-22 18:14:44 -0700
+  Commit: 5f7b991, github.com/apache/spark/pull/1523
+
+  [SPARK-2615] [SQL] Add Equal Sign "==" Support for HiveQl
+  Cheng Hao <hao.ch...@intel.com>
+  2014-07-22 18:13:28 -0700
+  Commit: 79fe763, github.com/apache/spark/pull/1522
+
+  SPARK-2047: Introduce an in-mem Sorter, and use it to reduce mem usage
+  Aaron Davidson <aa...@databricks.com>
+  2014-07-22 11:58:53 -0700
+  Commit: 85d3596, github.com/apache/spark/pull/1502
+
+  [MLLIB] make Mima ignore updateFeatures (private) in ALS
+  Xiangrui Meng <m...@databricks.com>
+  2014-07-22 11:45:37 -0700
+  Commit: 1407871, github.com/apache/spark/pull/1533
+
+  [SPARK-2612] [mllib] Fix data skew in ALS
+  peng.zhang <peng.zh...@xiaomi.com>
+  2014-07-22 02:39:07 -0700
+  Commit: 75db174, github.com/apache/spark/pull/1521
+
+  [SPARK-2452] Create a new valid for each  instead of using lineId.
+  Prashant Sharma <prash...@apache.org>
+  2014-07-22 00:38:26 -0700
+  Commit: 81fec99, github.com/apache/spark/pull/1441
+
+  [SPARK-2470] PEP8 fixes to PySpark
+  Nicholas Chammas <nicholas.cham...@gmail.com>, nchammas 
<nicholas.cham...@gmail.com>
+  2014-07-21 22:30:53 -0700
+  Commit: 5d16d5b, github.com/apache/spark/pull/1505
+
+  [SPARK-2086] Improve output of toDebugString to make shuffle boundaries more 
clear
+  Gregory Owen <greo...@gmail.com>
+  2014-07-21 18:55:01 -0700
+  Commit: c3462c6, github.com/apache/spark/pull/1364
+
+  [SPARK-2561][SQL] Fix apply schema
+  Michael Armbrust <mich...@databricks.com>
+  2014-07-21 18:18:17 -0700
+  Commit: 511a731, github.com/apache/spark/pull/1470
+
+  [SPARK-2434][MLlib]: Warning messages that point users to original MLlib 
implementations added to Examples
+  Burak <brk...@gmail.com>
+  2014-07-21 17:03:40 -0700
+  Commit: a4d6020, github.com/apache/spark/pull/1515
+
+  Fix flakey HiveQuerySuite test
+  Aaron Davidson <aa...@databricks.com>
+  2014-07-21 14:35:15 -0700
+  Commit: abeacff, github.com/apache/spark/pull/1514
+
+  [SPARK-2494] [PySpark] make hash of None consistant cross machines
+  Davies Liu <davies....@gmail.com>
+  2014-07-21 11:59:54 -0700
+  Commit: 872538c, github.com/apache/spark/pull/1371
+
+  SPARK-1707. Remove unnecessary 3 second sleep in YarnClusterScheduler
+  Sandy Ryza <sa...@cloudera.com>
+  2014-07-21 13:15:46 -0500
+  Commit: f89cf65, githu


<TRUNCATED>

---------------------------------------------------------------------
To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org
For additional commands, e-mail: commits-h...@spark.apache.org

[2/3] BUILD: Updating CHANGES.txt for Spark 1.1

Reply via email to