spark git commit: [SPARK-12986][DOC] Fix pydoc warnings in mllib/regression.py

2016-02-08 Thread meng
Repository: spark Updated Branches: refs/heads/master 140ddef37 -> edf4a0e62 [SPARK-12986][DOC] Fix pydoc warnings in mllib/regression.py I have fixed the warnings by running "make html" under "python/docs/". They are caused by not having blank lines around indented paragraphs. Author: Nam P

spark git commit: [SPARK-8964] [SQL] Use Exchange to perform shuffle in Limit

2016-02-08 Thread davies
Repository: spark Updated Branches: refs/heads/master edf4a0e62 -> 06f0df6df [SPARK-8964] [SQL] Use Exchange to perform shuffle in Limit This patch changes the implementation of the physical `Limit` operator so that it relies on the `Exchange` operator to perform data movement rather than di

spark git commit: [SPARK-13101][SQL] nullability of array type element should not fail analysis of encoder

2016-02-08 Thread marmbrus
Repository: spark Updated Branches: refs/heads/master 06f0df6df -> 8e4d15f70 [SPARK-13101][SQL] nullability of array type element should not fail analysis of encoder nullability should only be considered as an optimization rather than part of the type system, so instead of failing analysis f

spark git commit: [SPARK-13210][SQL] catch OOM when allocate memory and expand array

2016-02-08 Thread joshrosen
Repository: spark Updated Branches: refs/heads/master 8e4d15f70 -> 37bc203c8 [SPARK-13210][SQL] catch OOM when allocate memory and expand array There is a bug when we try to grow the buffer, OOM is ignore wrongly (the assert also skipped by JVM), then we try grow the array again, this one wil

spark git commit: [SPARK-13210][SQL] catch OOM when allocate memory and expand array

2016-02-08 Thread joshrosen
Repository: spark Updated Branches: refs/heads/branch-1.6 3ca5dc307 -> 9b3009622 [SPARK-13210][SQL] catch OOM when allocate memory and expand array There is a bug when we try to grow the buffer, OOM is ignore wrongly (the assert also skipped by JVM), then we try grow the array again, this one

spark git commit: [SPARK-13095] [SQL] improve performance for broadcast join with dimension table

2016-02-08 Thread davies
Repository: spark Updated Branches: refs/heads/master 37bc203c8 -> ff0af0ddf [SPARK-13095] [SQL] improve performance for broadcast join with dimension table This PR improve the performance for Broadcast join with dimension tables, which is common in data warehouse. If the join key can fit in

spark git commit: [SPARK-10620][SPARK-13054] Minor addendum to #10835

2016-02-08 Thread joshrosen
Repository: spark Updated Branches: refs/heads/master ff0af0ddf -> eeaf45b92 [SPARK-10620][SPARK-13054] Minor addendum to #10835 Additional changes to #10835, mainly related to style and visibility. This patch also adds back a few deprecated methods for backward compatibility. Author: Andrew

spark git commit: [SPARK-12992] [SQL] Support vectorized decoding in UnsafeRowParquetRecordReader.

2016-02-08 Thread davies
Repository: spark Updated Branches: refs/heads/master eeaf45b92 -> 3708d13f1 [SPARK-12992] [SQL] Support vectorized decoding in UnsafeRowParquetRecordReader. WIP: running tests. Code needs a bit of clean up. This patch completes the vectorized decoding with the goal of passing the existing t