[jira] [Commented] (SPARK-9162) Implement code generation for ScalaUDF

2015-07-21 Thread Tarek Auel (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-9162?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14634615#comment-14634615
 ] 

Tarek Auel commented on SPARK-9162:
---

I can take it.

 Implement code generation for ScalaUDF
 --

 Key: SPARK-9162
 URL: https://issues.apache.org/jira/browse/SPARK-9162
 Project: Spark
  Issue Type: Sub-task
  Components: SQL
Reporter: Reynold Xin





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-9163) Implement code generation for Conv

2015-07-20 Thread Tarek Auel (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-9163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14634070#comment-14634070
 ] 

Tarek Auel commented on SPARK-9163:
---

This might be a duplicate of https://issues.apache.org/jira/browse/SPARK-9132

 Implement code generation for Conv
 --

 Key: SPARK-9163
 URL: https://issues.apache.org/jira/browse/SPARK-9163
 Project: Spark
  Issue Type: Sub-task
  Components: SQL
Reporter: Reynold Xin





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Created] (SPARK-9178) UTF8String empty string method

2015-07-19 Thread Tarek Auel (JIRA)
Tarek Auel created SPARK-9178:
-

 Summary: UTF8String empty string method
 Key: SPARK-9178
 URL: https://issues.apache.org/jira/browse/SPARK-9178
 Project: Spark
  Issue Type: New Feature
  Components: SQL
Reporter: Tarek Auel


Create a method in UTF8String that returns an empty string, in order to avoid 
calls of UTF8String.fromString()



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-9177) Reuse Calendar instance in WeekOfYear

2015-07-19 Thread Tarek Auel (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-9177?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14632823#comment-14632823
 ] 

Tarek Auel commented on SPARK-9177:
---

Sure.

 Reuse Calendar instance in WeekOfYear
 -

 Key: SPARK-9177
 URL: https://issues.apache.org/jira/browse/SPARK-9177
 Project: Spark
  Issue Type: Sub-task
  Components: SQL
Reporter: Reynold Xin

 Right now WeekOfYear creates a new Calendar instance for every record, both 
 in code gen and in interpreted mode. We should just reuse the same Calendar 
 instance (i.e. initialize it as a member variable in interpreted mode, and 
 use mutable state in codegen mode).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Created] (SPARK-9115) date/time function: dayInYear

2015-07-16 Thread Tarek Auel (JIRA)
Tarek Auel created SPARK-9115:
-

 Summary: date/time function: dayInYear
 Key: SPARK-9115
 URL: https://issues.apache.org/jira/browse/SPARK-9115
 Project: Spark
  Issue Type: New Feature
  Components: SQL
Affects Versions: 1.5.0
Reporter: Tarek Auel


dayInyear(date): Int

Returns the number of the day in the year of a given date.
e.g. dayInYear(2015-01-01) == 1, dayInYear(2015-12-31) == 365



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-8995) Cast date strings with date, date and time and just time information to DateType and TimestampTzpe

2015-07-10 Thread Tarek Auel (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-8995?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tarek Auel updated SPARK-8995:
--
Description: Tests of https://github.com/apache/spark/pull/6981 fail, 
because we can not cast strings like '13:18:08' to a valid date and extract the 
hours later. It's not possible to parse strings that contains date and time 
information to date, like '2015-03-18 12:25:49'  (was: Tests of 
https://github.com/apache/spark/pull/6981 fails, because we can not cast 
strings like '13:18:08' to a valid date and extract the hours later. It's not 
possible to parse strings that contains date and time information to date, like 
'2015-03-18 12:25:49')

 Cast date strings with date, date and time and just time information to 
 DateType and TimestampTzpe
 --

 Key: SPARK-8995
 URL: https://issues.apache.org/jira/browse/SPARK-8995
 Project: Spark
  Issue Type: Improvement
  Components: SQL
Reporter: Tarek Auel

 Tests of https://github.com/apache/spark/pull/6981 fail, because we can not 
 cast strings like '13:18:08' to a valid date and extract the hours later. 
 It's not possible to parse strings that contains date and time information to 
 date, like '2015-03-18 12:25:49'



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-8864) Date/time function and data type design

2015-07-08 Thread Tarek Auel (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-8864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14619838#comment-14619838
 ] 

Tarek Auel commented on SPARK-8864:
---

Thanks Reynold. 

The function date_format(date, format) has been removed?

 Date/time function and data type design
 ---

 Key: SPARK-8864
 URL: https://issues.apache.org/jira/browse/SPARK-8864
 Project: Spark
  Issue Type: Sub-task
  Components: SQL
Reporter: Reynold Xin
Assignee: Reynold Xin
 Fix For: 1.5.0

 Attachments: SparkSQLdatetimeudfs (1).pdf


 Please see the attached design doc.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-8830) levenshtein directly on top of UTF8String

2015-07-06 Thread Tarek Auel (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-8830?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14614590#comment-14614590
 ] 

Tarek Auel commented on SPARK-8830:
---

I am working on this

 levenshtein directly on top of UTF8String
 -

 Key: SPARK-8830
 URL: https://issues.apache.org/jira/browse/SPARK-8830
 Project: Spark
  Issue Type: Sub-task
  Components: SQL
Reporter: Reynold Xin

 We currently rely on commons-lang's levenshtein implementation. Ideally, we 
 should have our own implementation to:
 1. Reduce external dependency
 2. Work directly against UTF8String so we don't need to convert to/from 
 java.lang.String back and forth.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Created] (SPARK-8727) Add missing python api

2015-06-30 Thread Tarek Auel (JIRA)
Tarek Auel created SPARK-8727:
-

 Summary: Add missing python api
 Key: SPARK-8727
 URL: https://issues.apache.org/jira/browse/SPARK-8727
 Project: Spark
  Issue Type: Improvement
  Components: SQL
Reporter: Tarek Auel


Add the python api that is missing for

https://issues.apache.org/jira/browse/SPARK-8248
https://issues.apache.org/jira/browse/SPARK-8234
https://issues.apache.org/jira/browse/SPARK-8217
https://issues.apache.org/jira/browse/SPARK-8215
https://issues.apache.org/jira/browse/SPARK-8212



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-8212) math function: e

2015-06-30 Thread Tarek Auel (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-8212?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14607589#comment-14607589
 ] 

Tarek Auel commented on SPARK-8212:
---

I added the python api:
https://issues.apache.org/jira/browse/SPARK-8727

 math function: e
 

 Key: SPARK-8212
 URL: https://issues.apache.org/jira/browse/SPARK-8212
 Project: Spark
  Issue Type: Sub-task
  Components: SQL
Reporter: Reynold Xin
Assignee: Adrian Wang
  Labels: missing-python
 Fix For: 1.5.0


 e(): double
 Returns the value of e.
 We should make this foldable so it gets folded by the optimizer.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-8234) misc function: md5

2015-06-30 Thread Tarek Auel (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-8234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14607586#comment-14607586
 ] 

Tarek Auel commented on SPARK-8234:
---

I added the python api:
https://issues.apache.org/jira/browse/SPARK-8727

 misc function: md5
 --

 Key: SPARK-8234
 URL: https://issues.apache.org/jira/browse/SPARK-8234
 Project: Spark
  Issue Type: Sub-task
  Components: SQL
Reporter: Reynold Xin
Assignee: Qian, Shilei
  Labels: missing-python
 Fix For: 1.5.0


 md5(string/binary): string
 Calculates an MD5 128-bit checksum for the string or binary (as of Hive 
 1.3.0). The value is returned as a string of 32 hex digits, or NULL if the 
 argument was NULL. Example: md5('ABC') = '902fbdd2b1df0c4f70b4a5d23525e932'.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-8248) string function: length

2015-06-30 Thread Tarek Auel (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-8248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14607585#comment-14607585
 ] 

Tarek Auel commented on SPARK-8248:
---

I added the python api:
https://issues.apache.org/jira/browse/SPARK-8727

 string function: length
 ---

 Key: SPARK-8248
 URL: https://issues.apache.org/jira/browse/SPARK-8248
 Project: Spark
  Issue Type: Sub-task
  Components: SQL
Reporter: Reynold Xin
Assignee: Cheng Hao
  Labels: missing-python
 Fix For: 1.5.0


 length(string A): int
 Returns the length of the string.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-8217) math function: log2

2015-06-30 Thread Tarek Auel (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-8217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14607587#comment-14607587
 ] 

Tarek Auel commented on SPARK-8217:
---

I added the python api:
https://issues.apache.org/jira/browse/SPARK-8727

 math function: log2
 ---

 Key: SPARK-8217
 URL: https://issues.apache.org/jira/browse/SPARK-8217
 Project: Spark
  Issue Type: Sub-task
  Components: SQL
Reporter: Reynold Xin
Assignee: Adrian Wang
  Labels: missing-python
 Fix For: 1.5.0


 log2(double a): double
 Returns the base-2 logarithm of the argument a.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-8215) math function: pi

2015-06-30 Thread Tarek Auel (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-8215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14607588#comment-14607588
 ] 

Tarek Auel commented on SPARK-8215:
---

I added the python api:
https://issues.apache.org/jira/browse/SPARK-8727

 math function: pi
 -

 Key: SPARK-8215
 URL: https://issues.apache.org/jira/browse/SPARK-8215
 Project: Spark
  Issue Type: Sub-task
  Components: SQL
Reporter: Reynold Xin
Assignee: Adrian Wang
  Labels: missing-python
 Fix For: 1.5.0


 pi(): double
 Returns the value of pi. We should make sure foldable = true so it gets 
 folded by the optimizer.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-8363) Move sqrt into math

2015-06-30 Thread Tarek Auel (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-8363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14607592#comment-14607592
 ] 

Tarek Auel commented on SPARK-8363:
---

[~rxin] I guess the python api does exist:
https://github.com/apache/spark/blob/master/python/pyspark/sql/functions.py#L99

 Move sqrt into math
 ---

 Key: SPARK-8363
 URL: https://issues.apache.org/jira/browse/SPARK-8363
 Project: Spark
  Issue Type: Sub-task
  Components: SQL
Reporter: Reynold Xin
Assignee: Liang-Chi Hsieh
  Labels: missing-python
 Fix For: 1.5.0


 It doesn't really belong in Arithmetic. It should also extend 
 UnaryMathExpression.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-8668) expr function to convert SQL expression into a Column

2015-06-29 Thread Tarek Auel (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-8668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14606067#comment-14606067
 ] 

Tarek Auel commented on SPARK-8668:
---

Hi,

just to get it right:

selectExpr of the dataframe api takes at the moment varargs as arguments. This 
should be enhanced in order to parse ONE string argument that contains multiple 
expressions, shouldn't it? Or do I get it wrong?

 expr function to convert SQL expression into a Column
 -

 Key: SPARK-8668
 URL: https://issues.apache.org/jira/browse/SPARK-8668
 Project: Spark
  Issue Type: Sub-task
  Components: SQL
Reporter: Reynold Xin

 selectExpr uses the expression parser to parse a string expressions. would be 
 great to create an expr function in functions.scala/functions.py that 
 converts a string into an expression (or a list of expressions separated by 
 comma).



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Comment Edited] (SPARK-8301) Improve UTF8String substring/startsWith/endsWith/contains performance

2015-06-11 Thread Tarek Auel (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-8301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582691#comment-14582691
 ] 

Tarek Auel edited comment on SPARK-8301 at 6/11/15 11:45 PM:
-

Another approach could be:

(0 until b.length).forall(( i ) = b( i ) == bytes( i ))

In theory this could be parallelised,


was (Author: tarekauel):
Another approach could be:

(0 until b.length).forall((i) = b(i) == bytes(i))

In theory this could be parallelised,

 Improve UTF8String substring/startsWith/endsWith/contains performance
 -

 Key: SPARK-8301
 URL: https://issues.apache.org/jira/browse/SPARK-8301
 Project: Spark
  Issue Type: Improvement
  Components: SQL
Reporter: Reynold Xin
Priority: Critical

 Many functions in UTF8String are unnecessarily expensive.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-8301) Improve UTF8String substring/startsWith/endsWith/contains performance

2015-06-11 Thread Tarek Auel (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-8301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582691#comment-14582691
 ] 

Tarek Auel commented on SPARK-8301:
---

Another approach could be:

(0 until b.length).forall((i) = b(i) == bytes(i))

In theory this could be parallelised,

 Improve UTF8String substring/startsWith/endsWith/contains performance
 -

 Key: SPARK-8301
 URL: https://issues.apache.org/jira/browse/SPARK-8301
 Project: Spark
  Issue Type: Improvement
  Components: SQL
Reporter: Reynold Xin
Priority: Critical

 Many functions in UTF8String are unnecessarily expensive.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-8301) Improve UTF8String substring/startsWith/endsWith/contains performance

2015-06-11 Thread Tarek Auel (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-8301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582609#comment-14582609
 ] 

Tarek Auel commented on SPARK-8301:
---

Hi,

do you have concrete ideas how the implementation should be improved?

I would like to take this item. One suggestion for startWith could be:

 val b = prefix.getBytes

if (b.length  bytes.length) {
  return false
}

var i = 0
while (i  b.length  (bytes(i) == b(i))) {
  i += 1
}
i == b.length

What do you think? Do you have already ideas in your mind how the performance 
should be improved?

 Improve UTF8String substring/startsWith/endsWith/contains performance
 -

 Key: SPARK-8301
 URL: https://issues.apache.org/jira/browse/SPARK-8301
 Project: Spark
  Issue Type: Improvement
  Components: SQL
Reporter: Reynold Xin
Priority: Critical

 Many functions in UTF8String are unnecessarily expensive.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Comment Edited] (SPARK-8301) Improve UTF8String substring/startsWith/endsWith/contains performance

2015-06-11 Thread Tarek Auel (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-8301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14582609#comment-14582609
 ] 

Tarek Auel edited comment on SPARK-8301 at 6/11/15 10:25 PM:
-

Hi,

do you have concrete ideas how the implementation should be improved?

I would like to take this item. One suggestion for startWith could be:

 val b = prefix.getBytes

if (b.length  bytes.length) {
  return false
}

var i = 0
while (i  b.length  (bytes ( i ) == b ( i ))) {
  i += 1
}
i == b.length

What do you think? Do you have already ideas in your mind how the performance 
should be improved?


was (Author: tarekauel):
Hi,

do you have concrete ideas how the implementation should be improved?

I would like to take this item. One suggestion for startWith could be:

 val b = prefix.getBytes

if (b.length  bytes.length) {
  return false
}

var i = 0
while (i  b.length  (bytes(i) == b(i))) {
  i += 1
}
i == b.length

What do you think? Do you have already ideas in your mind how the performance 
should be improved?

 Improve UTF8String substring/startsWith/endsWith/contains performance
 -

 Key: SPARK-8301
 URL: https://issues.apache.org/jira/browse/SPARK-8301
 Project: Spark
  Issue Type: Improvement
  Components: SQL
Reporter: Reynold Xin
Priority: Critical

 Many functions in UTF8String are unnecessarily expensive.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Created] (SPARK-7999) Graph complement

2015-06-01 Thread Tarek Auel (JIRA)
Tarek Auel created SPARK-7999:
-

 Summary: Graph complement
 Key: SPARK-7999
 URL: https://issues.apache.org/jira/browse/SPARK-7999
 Project: Spark
  Issue Type: Improvement
Reporter: Tarek Auel
Priority: Minor






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-7999) Graph complement

2015-06-01 Thread Tarek Auel (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-7999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tarek Auel updated SPARK-7999:
--
Issue Type: Sub-task  (was: Improvement)
Parent: SPARK-7893

 Graph complement
 

 Key: SPARK-7999
 URL: https://issues.apache.org/jira/browse/SPARK-7999
 Project: Spark
  Issue Type: Sub-task
Reporter: Tarek Auel
Priority: Minor





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-7999) Graph complement

2015-06-01 Thread Tarek Auel (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-7999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tarek Auel updated SPARK-7999:
--
Description: 
This task is for implementing the complement operation (compare to parent task).

http://techieme.in/complex-graph-operations/

 Graph complement
 

 Key: SPARK-7999
 URL: https://issues.apache.org/jira/browse/SPARK-7999
 Project: Spark
  Issue Type: Sub-task
Reporter: Tarek Auel
Priority: Minor

 This task is for implementing the complement operation (compare to parent 
 task).
 http://techieme.in/complex-graph-operations/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-7999) Graph complement

2015-06-01 Thread Tarek Auel (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-7999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14566999#comment-14566999
 ] 

Tarek Auel commented on SPARK-7999:
---

I would propose

def complement(attr: ED): Graph[VD, ED]

as interface

 Graph complement
 

 Key: SPARK-7999
 URL: https://issues.apache.org/jira/browse/SPARK-7999
 Project: Spark
  Issue Type: Sub-task
Reporter: Tarek Auel
Priority: Minor

 This task is for implementing the complement operation (compare to parent 
 task).
 http://techieme.in/complex-graph-operations/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Comment Edited] (SPARK-7999) Graph complement

2015-06-01 Thread Tarek Auel (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-7999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14566999#comment-14566999
 ] 

Tarek Auel edited comment on SPARK-7999 at 6/1/15 7:04 AM:
---

I would propose

def complement(attr: ED, selfLoops: Boolean = false): Graph[VD, ED]

as interface. The self-loop parameter defines whether self loops (A--A) should 
be created or not.


was (Author: tarekauel):
I would propose

def complement(attr: ED): Graph[VD, ED]

as interface

 Graph complement
 

 Key: SPARK-7999
 URL: https://issues.apache.org/jira/browse/SPARK-7999
 Project: Spark
  Issue Type: Sub-task
Reporter: Tarek Auel
Priority: Minor

 This task is for implementing the complement operation (compare to parent 
 task).
 http://techieme.in/complex-graph-operations/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Updated] (SPARK-7999) Graph complement function in GraphX

2015-06-01 Thread Tarek Auel (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-7999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tarek Auel updated SPARK-7999:
--
Summary: Graph complement function in GraphX  (was: Graph complement)

 Graph complement function in GraphX
 ---

 Key: SPARK-7999
 URL: https://issues.apache.org/jira/browse/SPARK-7999
 Project: Spark
  Issue Type: Sub-task
  Components: GraphX
Reporter: Tarek Auel
Priority: Minor

 This task is for implementing the complement operation (compare to parent 
 task).
 http://techieme.in/complex-graph-operations/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Created] (SPARK-7966) add Spreading Activation algorithm to GraphX

2015-05-30 Thread Tarek Auel (JIRA)
Tarek Auel created SPARK-7966:
-

 Summary: add Spreading Activation algorithm to GraphX
 Key: SPARK-7966
 URL: https://issues.apache.org/jira/browse/SPARK-7966
 Project: Spark
  Issue Type: Improvement
  Components: GraphX
Reporter: Tarek Auel
Priority: Minor


I'm wondering if you would like to add the Spreading Activation algorithm to 
GraphX. I have implemented it, using the Pregel-API and would love to share it 
with the community.

Spreading activation is a algorithm that was invented to search in associative 
networks. The basic idea is, that you have one (or multiple) starting nodes. 
The activation spreads out from these nodes to the neighbours and the 
neighbours of the neighbours. The activation decreases after every hop. Nodes 
that were reached by many activations will have a higher total activation level.

Spreading Activation is for many use cases useful. Imagine you have the social 
network of two people. If you apply the spreading activation to this social 
graph with the two people as starting nodes, you will get the nodes that are 
most important for both.

Some resources:

http://www.websci11.org/fileadmin/websci/posters/105_paper.pdf
https://webfiles.uci.edu/eloftus/CollinsLoftus_PsychReview_75.pdf?uniq=20ou4w



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org