[
https://issues.apache.org/jira/browse/SPARK-24768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16590441#comment-16590441
]
Antonio Murgia commented on SPARK-24768:
Will this support UDT to the extent the parquet
[
https://issues.apache.org/jira/browse/SPARK-24772?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Antonio Murgia updated SPARK-24772:
---
Comment: was deleted
(was: Will this support UDT to the extent the parquet reader/writer
[
https://issues.apache.org/jira/browse/SPARK-24772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16590439#comment-16590439
]
Antonio Murgia commented on SPARK-24772:
Will this support UDT to the extent the parquet
[
https://issues.apache.org/jira/browse/SPARK-24862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16551582#comment-16551582
]
Antonio Murgia commented on SPARK-24862:
We can check if {{y}} is also synthesized as a field
[
https://issues.apache.org/jira/browse/SPARK-24862?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16551312#comment-16551312
]
Antonio Murgia commented on SPARK-24862:
Yeah, they are definitely not supported. Therefore I
Antonio Murgia created SPARK-24862:
--
Summary: Spark Encoder is not consistent to scala case class
semantic for multiple argument lists
Key: SPARK-24862
URL: https://issues.apache.org/jira/browse/SPARK-24862
[
https://issues.apache.org/jira/browse/SPARK-24673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16529479#comment-16529479
]
Antonio Murgia commented on SPARK-24673:
I have created a PR, I have added the overload to both
[
https://issues.apache.org/jira/browse/SPARK-24673?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16526307#comment-16526307
]
Antonio Murgia commented on SPARK-24673:
Looks doable. Should I go with a method overload,
Antonio Murgia created SPARK-24673:
--
Summary: scala sql function from_utc_timestamp second argument
could be Column instead of String
Key: SPARK-24673
URL: https://issues.apache.org/jira/browse/SPARK-24673
[
https://issues.apache.org/jira/browse/SPARK-15740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15315394#comment-15315394
]
Antonio Murgia commented on SPARK-15740:
As of now the memory requirement would be something like
[
https://issues.apache.org/jira/browse/SPARK-15740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313830#comment-15313830
]
Antonio Murgia commented on SPARK-15740:
In order to set a small partition size w/r/t
[
https://issues.apache.org/jira/browse/SPARK-15740?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15313754#comment-15313754
]
Antonio Murgia commented on SPARK-15740:
Looking into it right now.
> Word2VecSuite "big model
Antonio Murgia created SPARK-11993:
--
Summary:
https://github.com/streamatica/TrafficAnalytics/issues/393#issuecomment-159685855
Key: SPARK-11993
URL: https://issues.apache.org/jira/browse/SPARK-11993
[
https://issues.apache.org/jira/browse/SPARK-11994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15028016#comment-15028016
]
Antonio Murgia commented on SPARK-11994:
Since `spark.kryoserializer.buffer.max` defaults to
Antonio Murgia created SPARK-11994:
--
Summary: Word2VecModel load and save cause SparkException when
model is bigger than spark.kryoserializer.buffer.max
Key: SPARK-11994
URL:
[
https://issues.apache.org/jira/browse/SPARK-11993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Antonio Murgia closed SPARK-11993.
--
Resolution: Invalid
>
[
https://issues.apache.org/jira/browse/SPARK-11994?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15027648#comment-15027648
]
Antonio Murgia commented on SPARK-11994:
Sure.
> Word2VecModel load and save cause
Antonio Murgia created SPARK-11350:
--
Summary: There is no best practice to handle warnings or messages
produced by Executors in a distributed manner
Key: SPARK-11350
URL:
[
https://issues.apache.org/jira/browse/SPARK-10105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Antonio Murgia updated SPARK-10105:
---
Description:
When training Word2Vec on a really big dataset, it's really hard to evaluate
Antonio Murgia created SPARK-10105:
--
Summary: Adding most k frequent words parameter to Word2Vec
implementation
Key: SPARK-10105
URL: https://issues.apache.org/jira/browse/SPARK-10105
Project: Spark
Antonio Murgia created SPARK-10046:
--
Summary: Hive warehouse dir not set in current directory when not
providing hive-site.xml
Key: SPARK-10046
URL: https://issues.apache.org/jira/browse/SPARK-10046
21 matches
Mail list logo