[
https://issues.apache.org/jira/browse/FLINK-2094?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15623730#comment-15623730
]
ASF GitHub Bot commented on FLINK-2094:
---------------------------------------
GitHub user kalmanchapman opened a pull request:
https://github.com/apache/flink/pull/2735
[FLINK-2094] implements Word2Vec for FlinkML
This pr implements Word2Vec for FlinkML - addressing Jira Issue
[Flink-2094](https://issues.apache.org/jira/browse/FLINK-2094)
Word2Vec is a word embedding algorithm that generates vectors
to reflect the contextual and semantic values of words in a text.
find out more detail about word2vec here:
https://arxiv.org/pdf/1411.2738v4.pdf
This implementation uses an abstracted embedding algorithm
which I've called a ContextEmbedder - based on the original Word2Vec
algorithms -
to allow users to extend embedding to reflect problems outside of words.
Word2Vec is an implementation of the ContextEmbedder
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/kalmanchapman/flink FLINK-2094
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/flink/pull/2735.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #2735
----
commit 82f4d4fcc34872c79dcb4c83b5f4bf7ec94c7da7
Author: kalman <[email protected]>
Date: 2016-09-14T19:36:32Z
[FLINK-2094] implements Word2Vec for FlinkML
Word2Vec is a word embedding algorithm that generates vectors
to reflect the contextual and semantic values of those words in a text
This implementation uses an abstracted embedding algorithm
- based on the original Word2Vec algorithms - to allow users
to extend embedding to reflect problems outside of words
----
> Implement Word2Vec
> ------------------
>
> Key: FLINK-2094
> URL: https://issues.apache.org/jira/browse/FLINK-2094
> Project: Flink
> Issue Type: Improvement
> Components: Machine Learning Library
> Reporter: Nikolaas Steenbergen
> Assignee: Nikolaas Steenbergen
> Priority: Minor
> Labels: ML
>
> implement Word2Vec
> http://arxiv.org/pdf/1402.3722v1.pdf
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)