[04/20] incubator-hivemall-site git commit: Added criteo example and step-by-step guide for general classifier/regressor
http://git-wip-us.apache.org/repos/asf/incubator-hivemall-site/blob/6c6a1b42/userguide/supervised_learning/prediction.html -- diff --git a/userguide/supervised_learning/prediction.html b/userguide/supervised_learning/prediction.html new file mode 100644 index 000..a4719e5 --- /dev/null +++ b/userguide/supervised_learning/prediction.html @@ -0,0 +1,2604 @@ + + + + + + +How Prediction Works · Hivemall User Manual + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +http://hivemall.incubator.apache.org/; target="_blank" class="custom-link"> Home + + + + + + + + + + +TABLE OF CONTENTS + + + + + + + + +1.1. + +Introduction + + + + + + + + + + + + +1.2. + +Getting Started + + + + + + + + + + + + + +1.2.1. + +Installation + + + + + + + + + + + + +1.2.2. + +Install as permanent functions + + + + + + + + + + + + +1.2.3. + +Input Format + + + + + + + + + + + + + + + + + +1.3. + +List of Functions + + + + + + + + + + + + +1.4. + +Tips for Effective Hivemall + + + + + + + + + + + + + +1.4.1. + +Explicit add_bias() for better prediction + + + + + + + + + + + + +1.4.2. + +Use rand_amplify() to better prediction results + + + + + + + + + + + + +1.4.3. + +Real-time prediction on RDBMS + + + + + + + + + + + + +1.4.4. + +Ensemble learning for stable prediction + + + + + + +
[18/20] incubator-hivemall-site git commit: Added criteo example and step-by-step guide for general classifier/regressor
http://git-wip-us.apache.org/repos/asf/incubator-hivemall-site/blob/6c6a1b42/userguide/binaryclass/criteo_dataset.html -- diff --git a/userguide/binaryclass/criteo_dataset.html b/userguide/binaryclass/criteo_dataset.html new file mode 100644 index 000..cdc58d0 --- /dev/null +++ b/userguide/binaryclass/criteo_dataset.html @@ -0,0 +1,2474 @@ + + + + + + +Data preparation · Hivemall User Manual + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +http://hivemall.incubator.apache.org/; target="_blank" class="custom-link"> Home + + + + + + + + + + +TABLE OF CONTENTS + + + + + + + + +1.1. + +Introduction + + + + + + + + + + + + +1.2. + +Getting Started + + + + + + + + + + + + + +1.2.1. + +Installation + + + + + + + + + + + + +1.2.2. + +Install as permanent functions + + + + + + + + + + + + +1.2.3. + +Input Format + + + + + + + + + + + + + + + + + +1.3. + +List of Functions + + + + + + + + + + + + +1.4. + +Tips for Effective Hivemall + + + + + + + + + + + + + +1.4.1. + +Explicit add_bias() for better prediction + + + + + + + + + + + + +1.4.2. + +Use rand_amplify() to better prediction results + + + + + + + + + + + + +1.4.3. + +Real-time prediction on RDBMS + + + + + + + + + + + + +1.4.4. + +Ensemble learning for stable prediction + + + + + + + + +
[03/20] incubator-hivemall-site git commit: Added criteo example and step-by-step guide for general classifier/regressor
http://git-wip-us.apache.org/repos/asf/incubator-hivemall-site/blob/6c6a1b42/userguide/supervised_learning/tutorial.html -- diff --git a/userguide/supervised_learning/tutorial.html b/userguide/supervised_learning/tutorial.html new file mode 100644 index 000..0863296 --- /dev/null +++ b/userguide/supervised_learning/tutorial.html @@ -0,0 +1,2982 @@ + + + + + + +Step-by-Step Tutorial on Supervised Learning · Hivemall User Manual + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +http://hivemall.incubator.apache.org/; target="_blank" class="custom-link"> Home + + + + + + + + + + +TABLE OF CONTENTS + + + + + + + + +1.1. + +Introduction + + + + + + + + + + + + +1.2. + +Getting Started + + + + + + + + + + + + + +1.2.1. + +Installation + + + + + + + + + + + + +1.2.2. + +Install as permanent functions + + + + + + + + + + + + +1.2.3. + +Input Format + + + + + + + + + + + + + + + + + +1.3. + +List of Functions + + + + + + + + + + + + +1.4. + +Tips for Effective Hivemall + + + + + + + + + + + + + +1.4.1. + +Explicit add_bias() for better prediction + + + + + + + + + + + + +1.4.2. + +Use rand_amplify() to better prediction results + + + + + + + + + + + + +1.4.3. + +Real-time prediction on RDBMS + + + + + + + + + + + + +1.4.4. + +Ensemble learning for stable prediction + + + + +
[11/20] incubator-hivemall-site git commit: Added criteo example and step-by-step guide for general classifier/regressor
http://git-wip-us.apache.org/repos/asf/incubator-hivemall-site/blob/6c6a1b42/userguide/misc/generic_funcs.html -- diff --git a/userguide/misc/generic_funcs.html b/userguide/misc/generic_funcs.html index d7972f6..7b596ca 100644 --- a/userguide/misc/generic_funcs.html +++ b/userguide/misc/generic_funcs.html @@ -861,9 +861,9 @@ - + - + 5.1. @@ -876,6 +876,21 @@ + + + + + +5.2. + +Step-by-Step Tutorial on Supervised Learning + + + + + + + @@ -1228,6 +1243,57 @@ + + + + + +6.8. + +Criteo Tutorial + + + + + + + + + + + + + +6.8.1. + +Data preparation + + + + + + + + + + + + +6.8.2. + +Field-Aware Factorization Machines + + + + + + + + + + + + @@ -2862,7 +2928,7 @@ Apache Hivemall is an effort undergoing incubation at The Apache Software Founda var gitbook = gitbook || []; gitbook.push(function() { -gitbook.page.hasChanged({"page":{"title":"List of Generic Hivemall Functions","level":"2.1","depth":1,"next":{"title":"Efficient Top-K Query Processing","level":"2.2","depth":1,"path":"misc/topk.md","ref":"misc/topk.md","articles":[]},"previous":{"title":"Map-side join causes ClassCastException on Tez","level":"1.6.5","depth":2,"path":"troubleshooting/mapjoin_classcastex.md","ref":"troubleshooting/mapjoin_classcastex.md","articles":[]},"dir":"ltr"},"config":{"plugins":["theme-api","edit-link","github","splitter","sitemap","etoc","callouts","toggle-chapters","anchorjs","codeblock-filename","expandable-chapters","multipart","codeblock-filename","katex","emphasize","localized-footer"],"styles":{"website":"styles/website.css","pdf":"styles/pdf.css","epub":"styles/epub.css","mobi":"styles/mobi.css","ebook":"styles/ebook.css","print":"styles/print.css"},"pluginsConfig":{"emphasize":{},"callouts":{},"etoc":{"h2lb":3,"header":1,"maxdepth":3,"mindepth":1,"notoc":true},"github":{" url":"
[06/20] incubator-hivemall-site git commit: Added criteo example and step-by-step guide for general classifier/regressor
http://git-wip-us.apache.org/repos/asf/incubator-hivemall-site/blob/6c6a1b42/userguide/sitemap.xml -- diff --git a/userguide/sitemap.xml b/userguide/sitemap.xml index 64eeb77..e79c582 100644 --- a/userguide/sitemap.xml +++ b/userguide/sitemap.xml @@ -43,7 +43,8 @@ http://hivemall.incubator.apache.org/eval/rank.html weekly 0.5 http://hivemall.incubator.apache.org/eval/datagen.html weekly 0.5 http://hivemall.incubator.apache.org/eval/lr_datagen.html weekly 0.5 - http://hivemall.incubator.apache.org/misc/prediction.html weekly 0.5 + http://hivemall.incubator.apache.org/supervised_learning/prediction.html weekly 0.5 + http://hivemall.incubator.apache.org/supervised_learning/tutorial.html weekly 0.5 http://hivemall.incubator.apache.org/binaryclass/general.html weekly 0.5 http://hivemall.incubator.apache.org/binaryclass/a9a.html weekly 0.5 http://hivemall.incubator.apache.org/binaryclass/a9a_dataset.html weekly 0.5 @@ -65,6 +66,9 @@ http://hivemall.incubator.apache.org/binaryclass/webspam_dataset.html weekly 0.5 http://hivemall.incubator.apache.org/binaryclass/webspam_scw.html weekly 0.5 http://hivemall.incubator.apache.org/binaryclass/titanic_rf.html weekly 0.5 + http://hivemall.incubator.apache.org/binaryclass/criteo.html weekly 0.5 + http://hivemall.incubator.apache.org/binaryclass/criteo_dataset.html weekly 0.5 + http://hivemall.incubator.apache.org/binaryclass/criteo_ffm.html weekly 0.5 http://hivemall.incubator.apache.org/multiclass/news20.html weekly 0.5 http://hivemall.incubator.apache.org/multiclass/news20_dataset.html weekly 0.5 http://hivemall.incubator.apache.org/multiclass/news20_one-vs-the-rest_dataset.html weekly 0.5 http://git-wip-us.apache.org/repos/asf/incubator-hivemall-site/blob/6c6a1b42/userguide/spark/binaryclass/a9a_df.html -- diff --git a/userguide/spark/binaryclass/a9a_df.html b/userguide/spark/binaryclass/a9a_df.html index 91d789b..1678115 100644 --- a/userguide/spark/binaryclass/a9a_df.html +++ b/userguide/spark/binaryclass/a9a_df.html @@ -861,9 +861,9 @@ - + - + 5.1. @@ -876,6 +876,21 @@ + + + + + +5.2. + +Step-by-Step Tutorial on Supervised Learning + + + + + + + @@ -1228,6 +1243,57 @@ + + + + + +6.8. + +Criteo Tutorial + + + + + + + + + + + + + +6.8.1. + +Data preparation + + + + + + + + + + + + +6.8.2. + +Field-Aware Factorization Machines + + + + + + + + + + + + @@ -2335,7 +2401,7 @@ Apache Hivemall is an effort undergoing incubation at The Apache Software Founda var gitbook = gitbook || []; gitbook.push(function() { -gitbook.page.hasChanged({"page":{"title":"a9a tutorial for DataFrame","level":"13.2.1","depth":2,"next":{"title":"a9a tutorial for SQL","level":"13.2.2","depth":2,"path":"spark/binaryclass/a9a_sql.md","ref":"spark/binaryclass/a9a_sql.md","articles":[]},"previous":{"title":"Binary Classification","level":"13.2","depth":1,"path":"spark/binaryclass/index.md","ref":"spark/binaryclass/index.md","articles":[{"title":"a9a tutorial for DataFrame","level":"13.2.1","depth":2,"path":"spark/binaryclass/a9a_df.md","ref":"spark/binaryclass/a9a_df.md","articles":[]},{"title":"a9a tutorial for SQL","level":"13.2.2","depth":2,"path":"spark/binaryclass/a9a_sql.md","ref":"spark/binaryclass/a9a_sql.md","articles":[]}]},"dir":"ltr"},"config":{"plugins":["theme-api","edit-link","github","splitter","sitemap","etoc","callouts","toggle-chapters","anchorjs","codeblock-filename","expandable-chapters","multipart","codeblock-filename","katex","emphasize","localized-footer"],"styles":{"website":"styl
[19/20] incubator-hivemall-site git commit: Added criteo example and step-by-step guide for general classifier/regressor
http://git-wip-us.apache.org/repos/asf/incubator-hivemall-site/blob/6c6a1b42/userguide/binaryclass/criteo.html -- diff --git a/userguide/binaryclass/criteo.html b/userguide/binaryclass/criteo.html new file mode 100644 index 000..d8839fd --- /dev/null +++ b/userguide/binaryclass/criteo.html @@ -0,0 +1,2412 @@ + + + + + + +Criteo Tutorial · Hivemall User Manual + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +http://hivemall.incubator.apache.org/; target="_blank" class="custom-link"> Home + + + + + + + + + + +TABLE OF CONTENTS + + + + + + + + +1.1. + +Introduction + + + + + + + + + + + + +1.2. + +Getting Started + + + + + + + + + + + + + +1.2.1. + +Installation + + + + + + + + + + + + +1.2.2. + +Install as permanent functions + + + + + + + + + + + + +1.2.3. + +Input Format + + + + + + + + + + + + + + + + + +1.3. + +List of Functions + + + + + + + + + + + + +1.4. + +Tips for Effective Hivemall + + + + + + + + + + + + + +1.4.1. + +Explicit add_bias() for better prediction + + + + + + + + + + + + +1.4.2. + +Use rand_amplify() to better prediction results + + + + + + + + + + + + +1.4.3. + +Real-time prediction on RDBMS + + + + + + + + + + + + +1.4.4. + +Ensemble learning for stable prediction + + + + + + + + + + +
[12/20] incubator-hivemall-site git commit: Added criteo example and step-by-step guide for general classifier/regressor
http://git-wip-us.apache.org/repos/asf/incubator-hivemall-site/blob/6c6a1b42/userguide/ft_engineering/vectorization.html -- diff --git a/userguide/ft_engineering/vectorization.html b/userguide/ft_engineering/vectorization.html index f5cf711..e65f2d9 100644 --- a/userguide/ft_engineering/vectorization.html +++ b/userguide/ft_engineering/vectorization.html @@ -861,9 +861,9 @@ - + - + 5.1. @@ -876,6 +876,21 @@ + + + + + +5.2. + +Step-by-Step Tutorial on Supervised Learning + + + + + + + @@ -1228,6 +1243,57 @@ + + + + + +6.8. + +Criteo Tutorial + + + + + + + + + + + + + +6.8.1. + +Data preparation + + + + + + + + + + + + +6.8.2. + +Field-Aware Factorization Machines + + + + + + + + + + + + @@ -2308,7 +2374,7 @@ Apache Hivemall is an effort undergoing incubation at The Apache Software Founda var gitbook = gitbook || []; gitbook.push(function() { -gitbook.page.hasChanged({"page":{"title":"Feature vectorization","level":"3.6.1","depth":2,"next":{"title":"Quantify non-number features","level":"3.6.2","depth":2,"path":"ft_engineering/quantify.md","ref":"ft_engineering/quantify.md","articles":[]},"previous":{"title":"Feature Transformation","level":"3.6","depth":1,"path":"ft_engineering/ft_trans.md","ref":"ft_engineering/ft_trans.md","articles":[{"title":"Feature vectorization","level":"3.6.1","depth":2,"path":"ft_engineering/vectorization.md","ref":"ft_engineering/vectorization.md","articles":[]},{"title":"Quantify non-number features","level":"3.6.2","depth":2,"path":"ft_engineering/quantify.md","ref":"ft_engineering/quantify.md","articles":[]}]},"dir":"ltr"},"config":{"plugins":["theme-api","edit-link","github","splitter","sitemap","etoc","callouts","toggle-chapters","anchorjs","codeblock-filename","expandable-chapters","multipart","codeblock-filename","katex","emphasize","localized-footer"],"styles":{"website":"st yles/website.css","pdf":"styles/pdf.css","epub":"styles/epub.css","mobi":"styles/mobi.css","ebook":"styles/ebook.css","print":"styles/print.css"},"pluginsConfig":{"emphasize":{},"callouts":{},"etoc":{"h2lb":3,"header":1,"maxdepth":3,"mindepth":1,"notoc":true},"github":{"url":"https://github.com/apache/incubator-hivemall/"},"splitter":{},"search":{},"downloadpdf":{"base":"https://github.com/apache/incubator-hivemall/docs/gitbook","label":"PDF","multilingual":false},"multipart":{},"localized-footer":{"filename":"FOOTER.md","hline":"true"},"lunr":{"maxIndexSize":100,"ignoreSpecialCharacters":false},"katex":{},"fontsettings":{"theme":"white","family":"sans","size":2,"font":"sans"},"highlight":{},"codeblock-filename":{},"sitemap":{"hostname":"http://hivemall.incubator.apache.org/"},"theme-api":{"languages":[],"split":false,"theme":"dark"},"sharing":{"facebook":true,"twitter":true,"google":false,"weibo":false,"instapaper":false,"vk":false,"all":["facebook","google","twitter","weibo"," instapaper"]},"edit-link":{"label":"Edit","base":"
[13/20] incubator-hivemall-site git commit: Added criteo example and step-by-step guide for general classifier/regressor
http://git-wip-us.apache.org/repos/asf/incubator-hivemall-site/blob/6c6a1b42/userguide/ft_engineering/binning.html -- diff --git a/userguide/ft_engineering/binning.html b/userguide/ft_engineering/binning.html index 44e09ac..7253292 100644 --- a/userguide/ft_engineering/binning.html +++ b/userguide/ft_engineering/binning.html @@ -861,9 +861,9 @@ - + - + 5.1. @@ -876,6 +876,21 @@ + + + + + +5.2. + +Step-by-Step Tutorial on Supervised Learning + + + + + + + @@ -1228,6 +1243,57 @@ + + + + + +6.8. + +Criteo Tutorial + + + + + + + + + + + + + +6.8.1. + +Data preparation + + + + + + + + + + + + +6.8.2. + +Field-Aware Factorization Machines + + + + + + + + + + + + @@ -2512,7 +2578,7 @@ Apache Hivemall is an effort undergoing incubation at The Apache Software Founda var gitbook = gitbook || []; gitbook.push(function() { -gitbook.page.hasChanged({"page":{"title":"Feature Binning","level":"3.4","depth":1,"next":{"title":"Feature Paring","level":"3.5","depth":1,"path":"ft_engineering/pairing.md","ref":"ft_engineering/pairing.md","articles":[{"title":"Polynomial features","level":"3.5.1","depth":2,"path":"ft_engineering/polynomial.md","ref":"ft_engineering/polynomial.md","articles":[]}]},"previous":{"title":"Feature Selection","level":"3.3","depth":1,"path":"ft_engineering/selection.md","ref":"ft_engineering/selection.md","articles":[]},"dir":"ltr"},"config":{"plugins":["theme-api","edit-link","github","splitter","sitemap","etoc","callouts","toggle-chapters","anchorjs","codeblock-filename","expandable-chapters","multipart","codeblock-filename","katex","emphasize","localized-footer"],"styles":{"website":"styles/website.css","pdf":"styles/pdf.css","epub":"styles/epub.css","mobi":"styles/mobi.css","ebook":"styles/ebook.css","print":"styles/print.css"},"pluginsConfig":{"emphasize":{},"callouts": {},"etoc":{"h2lb":3,"header":1,"maxdepth":3,"mindepth":1,"notoc":true},"github":{"url":"https://github.com/apache/incubator-hivemall/"},"splitter":{},"search":{},"downloadpdf":{"base":"https://github.com/apache/incubator-hivemall/docs/gitbook","label":"PDF","multilingual":false},"multipart":{},"localized-footer":{"filename":"FOOTER.md","hline":"true"},"lunr":{"maxIndexSize":100,"ignoreSpecialCharacters":false},"katex":{},"fontsettings":{"theme":"white","family":"sans","size":2,"font":"sans"},"highlight":{},"codeblock-filename":{},"sitemap":{"hostname":"http://hivemall.incubator.apache.org/"},"theme-api":{"languages":[],"split":false,"theme":"dark"},"sharing":{"facebook":true,"twitter":true,"google":false,"weibo":false,"instapaper":false,"vk":false,"all":["facebook","google","twitter","weibo","instapaper"]},"edit-link":{"label":"Edit","base":"https://github.com/apache/incubator-hivemall/tree/master/docs/gitbook"},"theme-default":{"styles":{"website":"styles/website.css","pdf":"st
[08/20] incubator-hivemall-site git commit: Added criteo example and step-by-step guide for general classifier/regressor
http://git-wip-us.apache.org/repos/asf/incubator-hivemall-site/blob/6c6a1b42/userguide/regression/e2006.html -- diff --git a/userguide/regression/e2006.html b/userguide/regression/e2006.html index ef98e5e..a4cc962 100644 --- a/userguide/regression/e2006.html +++ b/userguide/regression/e2006.html @@ -861,9 +861,9 @@ - + - + 5.1. @@ -876,6 +876,21 @@ + + + + + +5.2. + +Step-by-Step Tutorial on Supervised Learning + + + + + + + @@ -1228,6 +1243,57 @@ + + + + + +6.8. + +Criteo Tutorial + + + + + + + + + + + + + +6.8.1. + +Data preparation + + + + + + + + + + + + +6.8.2. + +Field-Aware Factorization Machines + + + + + + + + + + + + @@ -2270,7 +2336,7 @@ Apache Hivemall is an effort undergoing incubation at The Apache Software Founda var gitbook = gitbook || []; gitbook.push(function() { -gitbook.page.hasChanged({"page":{"title":"E2006-tfidf Regression Tutorial","level":"8.2","depth":1,"next":{"title":"Data preparation","level":"8.2.1","depth":2,"path":"regression/e2006_dataset.md","ref":"regression/e2006_dataset.md","articles":[]},"previous":{"title":"Regression","level":"8.1","depth":1,"path":"regression/general.md","ref":"regression/general.md","articles":[]},"dir":"ltr"},"config":{"plugins":["theme-api","edit-link","github","splitter","sitemap","etoc","callouts","toggle-chapters","anchorjs","codeblock-filename","expandable-chapters","multipart","codeblock-filename","katex","emphasize","localized-footer"],"styles":{"website":"styles/website.css","pdf":"styles/pdf.css","epub":"styles/epub.css","mobi":"styles/mobi.css","ebook":"styles/ebook.css","print":"styles/print.css"},"pluginsConfig":{"emphasize":{},"callouts":{},"etoc":{"h2lb":3,"header":1,"maxdepth":3,"mindepth":1,"notoc":true},"github":{"url":"https://github.com/apache/incubator-hivemall/"},"spli tter":{},"search":{},"downloadpdf":{"base":"
[09/20] incubator-hivemall-site git commit: Added criteo example and step-by-step guide for general classifier/regressor
http://git-wip-us.apache.org/repos/asf/incubator-hivemall-site/blob/6c6a1b42/userguide/recommend/movielens_cv.html -- diff --git a/userguide/recommend/movielens_cv.html b/userguide/recommend/movielens_cv.html index 8303b48..7d0394d 100644 --- a/userguide/recommend/movielens_cv.html +++ b/userguide/recommend/movielens_cv.html @@ -861,9 +861,9 @@ - + - + 5.1. @@ -876,6 +876,21 @@ + + + + + +5.2. + +Step-by-Step Tutorial on Supervised Learning + + + + + + + @@ -1228,6 +1243,57 @@ + + + + + +6.8. + +Criteo Tutorial + + + + + + + + + + + + + +6.8.1. + +Data preparation + + + + + + + + + + + + +6.8.2. + +Field-Aware Factorization Machines + + + + + + + + + + + + @@ -2321,7 +2387,7 @@ Apache Hivemall is an effort undergoing incubation at The Apache Software Founda var gitbook = gitbook || []; gitbook.push(function() { -gitbook.page.hasChanged({"page":{"title":"10-fold cross validation (Matrix Factorization)","level":"9.3.6","depth":2,"next":{"title":"Outlier Detection using Local Outlier Factor (LOF)","level":"10.1","depth":1,"path":"anomaly/lof.md","ref":"anomaly/lof.md","articles":[]},"previous":{"title":"SLIM for fast top-k recommendation","level":"9.3.5","depth":2,"path":"recommend/movielens_slim.md","ref":"recommend/movielens_slim.md","articles":[]},"dir":"ltr"},"config":{"plugins":["theme-api","edit-link","github","splitter","sitemap","etoc","callouts","toggle-chapters","anchorjs","codeblock-filename","expandable-chapters","multipart","codeblock-filename","katex","emphasize","localized-footer"],"styles":{"website":"styles/website.css","pdf":"styles/pdf.css","epub":"styles/epub.css","mobi":"styles/mobi.css","ebook":"styles/ebook.css","print":"styles/print.css"},"pluginsConfig":{"emphasize":{},"callouts":{},"etoc":{"h2lb":3,"header":1,"maxdepth":3,"mindepth":1,"notoc":true},"github ":{"url":"
[17/20] incubator-hivemall-site git commit: Added criteo example and step-by-step guide for general classifier/regressor
http://git-wip-us.apache.org/repos/asf/incubator-hivemall-site/blob/6c6a1b42/userguide/binaryclass/criteo_ffm.html -- diff --git a/userguide/binaryclass/criteo_ffm.html b/userguide/binaryclass/criteo_ffm.html new file mode 100644 index 000..acaec48 --- /dev/null +++ b/userguide/binaryclass/criteo_ffm.html @@ -0,0 +1,2708 @@ + + + + + + +Field-Aware Factorization Machines · Hivemall User Manual + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +http://hivemall.incubator.apache.org/; target="_blank" class="custom-link"> Home + + + + + + + + + + +TABLE OF CONTENTS + + + + + + + + +1.1. + +Introduction + + + + + + + + + + + + +1.2. + +Getting Started + + + + + + + + + + + + + +1.2.1. + +Installation + + + + + + + + + + + + +1.2.2. + +Install as permanent functions + + + + + + + + + + + + +1.2.3. + +Input Format + + + + + + + + + + + + + + + + + +1.3. + +List of Functions + + + + + + + + + + + + +1.4. + +Tips for Effective Hivemall + + + + + + + + + + + + + +1.4.1. + +Explicit add_bias() for better prediction + + + + + + + + + + + + +1.4.2. + +Use rand_amplify() to better prediction results + + + + + + + + + + + + +1.4.3. + +Real-time prediction on RDBMS + + + + + + + + + + + + +1.4.4. + +Ensemble learning for stable prediction + + + + + + + + +
[07/20] incubator-hivemall-site git commit: Added criteo example and step-by-step guide for general classifier/regressor
http://git-wip-us.apache.org/repos/asf/incubator-hivemall-site/blob/6c6a1b42/userguide/search_index.json -- diff --git a/userguide/search_index.json b/userguide/search_index.json index e5fc83d..8e69da9 100644 --- a/userguide/search_index.json +++ b/userguide/search_index.json @@ -1 +1 @@ -{"index":{"version":"0.5.12","fields":[{"name":"title","boost":10},{"name":"keywords","boost":15},{"name":"body","boost":1}],"ref":"url","documentStore":{"store":{"./":["(asf),","adadelta.","adapt","algorithm","analyt","anomali","apach","architectur","art","batch","build","built","classification,","collect","confid","consid","conversely,","cross","data","design","detection,","eas","effort","engineering.","factor","featur","foundat","function","functionalities:","functions.","hive","hive/pig.","hivemal","incub","incubator.","interface.","introduct","k","learn","learning;","librari","machin","machines,","mainli","model","nearest","neighbor,","number","offer","pig","platform","predict","provid","queri","recommendation,","regression,","regular","run","runtime.","soft","softwar","spark","spark/pig,","sponsor","state","such","support","through","thus,","udf/udaf/udtf","undergo","us","varieti","vectors,","versatil","weight","weighted,"],"getting_started/":["(asf),","apach","effort","founda t","get","hivemal","incub","incubator.","softwar","sponsor","start","summari","undergo"],"getting_started/installation.html":["$","$home/.hiverc","(asf),","(of","./target.","/home/myui/tmp/defin","/home/myui/tmp/hivemal","/tmp/defin","/tmp/hivemal","7","add","all.hiv","all.hive;","alternatively,","apach","automat","bin/build.sh","build","cd","choic","clone","command","defin","docker","e.g.,","each","effort","file.","find","follow","foundat","function","git","given","hadoop","hive","hivemal","hivemall.git","https://github.com/apache/incub","incub","incubator.","instal","jar","java","later","line","load","note","pig","platforms:","prerequisit","run","session.","softwar","sourc","spark","sponsor","start","test","then,","time","time.","two","undergo","v0.13","v0.5.0)","v2.4.0","version,","xxx.jar","xxx.jar;"],"getting_started/permanent-functions.html":["\"hivemall.\"","\"hivemall.*\";","(asf),","/apps/hivemal","/tmp/defin",">","apach","auxiliari","avoid","call","case","cautiony","confir m","creat","databas","database.","databases,","ddl","default","defin","each","effort","exist","first,","follow","follows:","foundat","fs","function","hadoop","hdf","hive","hivemal","hivemall.","hivemall.adadelta","hivemall.adagrad","hivemall;","hiveserv","hivevar:hivemall_jar=hdfs:///apps/hivemall/hivemal","incub","incubator.","instal","jar","later","live","load","mkdir","need","non","p","perman","permanent.hive,","permanent.hive;","prefix","put","queri","scheme,","script","session.","sessions.","set","show","softwar","sourc","specifi","sponsor","step","support","then,","through","udf","udfs.","undergo","us","v0.13","xxx.jar","xxx.jar.jar;"],"getting_started/input-format.html":["\"0\"","\"0:1.0\"","\":\",","\"dog\"","\"height:1.5\"","\"length:2.0\"","\"length:2.0\")","\"name\"),","\"weight\"),","\"xxx","(","(+1)","(=16777216)","(a.k.a.","(asf),","(binari","(call","(cf.","(e.g.,","(for","(i.e.,","(int","(or","(v0.3.1","(we","(weight)",")","),",");",",\"","..","0","0,","0.0","0.0f ","1","1)","1),","1,","1,180.3,70.2","1,cast(nul","1.","1.0","1.0,","1.0.","1.0f","10:3.4","123:0.5","16777216.","2","2^24","3)","34567:0.231","43352:55.3","55.0","55.0);",":","::=","[\"1:3.0\",\"2:4.0\",\"3:5.0\"]","[\"apple:1.0\",\"height:180.3\",\"weight:70.2\"]","[\"apple:1.0\",null,\"weight:70.2\"]","[\"is_cat#1\",\"is_dog#0\",\"is_lion#1.0\",\"is_pengin#true\",\"species#dog\"]","[\"is_cat#1\",\"is_dog#0\",\"is_lion#1.0\",\"is_pengin#true\",null]","[,","[10,7,9]","accept","actual","ad","add_bia","add_feature_index","add_feature_index(array(3,4.0,5))","addit","alternatively,","apach","append","appli","argument","arguments:","array","arrayint|bigint|text>","array(\"apple\",\"height\",\"weight\"),","array(\"bias:1.0\"),","array(\"height\",","array(\"height:1.5\",","array(\"id\",","array(\"is_cat\",\"is_dog\",\"is_lion\",\"is_pengin\",\"species\"),","articl","befor","between","bia","bias/dummi","bigint)","bigint,","binari","call","cast","cast(target","categor","categorical_feat
incubator-hivemall git commit: [HIVEMALL-215] Add step-by-step tutorial on Supervised Learning
Repository: incubator-hivemall Updated Branches: refs/heads/master 533c15404 -> 07eb707b7 [HIVEMALL-215] Add step-by-step tutorial on Supervised Learning ## What changes were proposed in this pull request? In this PR, step by step tutorial is going to be introduced. ## What type of PR is it? Documentation ## What is the Jira issue? https://issues.apache.org/jira/browse/HIVEMALL-215 Author: Aki Ariga Closes #158 from chezou/tutorial. Project: http://git-wip-us.apache.org/repos/asf/incubator-hivemall/repo Commit: http://git-wip-us.apache.org/repos/asf/incubator-hivemall/commit/07eb707b Tree: http://git-wip-us.apache.org/repos/asf/incubator-hivemall/tree/07eb707b Diff: http://git-wip-us.apache.org/repos/asf/incubator-hivemall/diff/07eb707b Branch: refs/heads/master Commit: 07eb707b7c939776c7e096d581874d06a237dde5 Parents: 533c154 Author: Aki Ariga Authored: Fri Aug 31 15:01:48 2018 +0900 Committer: Makoto Yui Committed: Fri Aug 31 15:01:48 2018 +0900 -- docs/gitbook/SUMMARY.md| 3 +- docs/gitbook/misc/prediction.md| 163 --- docs/gitbook/supervised_learning/prediction.md | 163 +++ docs/gitbook/supervised_learning/tutorial.md | 461 4 files changed, 626 insertions(+), 164 deletions(-) -- http://git-wip-us.apache.org/repos/asf/incubator-hivemall/blob/07eb707b/docs/gitbook/SUMMARY.md -- diff --git a/docs/gitbook/SUMMARY.md b/docs/gitbook/SUMMARY.md index 155a221..6c69848 100644 --- a/docs/gitbook/SUMMARY.md +++ b/docs/gitbook/SUMMARY.md @@ -79,7 +79,8 @@ ## Part V - Supervised Learning -* [How Prediction Works](misc/prediction.md) +* [How Prediction Works](supervised_learning/prediction.md) +* [Step-by-Step Tutorial on Supervised Learning](supervised_learning/tutorial.md) ## Part VI - Binary Classification http://git-wip-us.apache.org/repos/asf/incubator-hivemall/blob/07eb707b/docs/gitbook/misc/prediction.md -- diff --git a/docs/gitbook/misc/prediction.md b/docs/gitbook/misc/prediction.md deleted file mode 100644 index 53d0cea..000 --- a/docs/gitbook/misc/prediction.md +++ /dev/null @@ -1,163 +0,0 @@ - - - - -# What is "prediction problem"? - -In a context of machine learning, numerous tasks can be seen as **prediction problem**. For example, this user guide provides solutions for: - -- [spam detection](../binaryclass/webspam.md) -- [news article classification](../multiclass/news20.md) -- [click-through-rate estimation](../regression/kddcup12tr2.md) - -For any kinds of prediction problems, we generally provide a set of input-output pairs as: - -- **Input:** Set of features - - e.g., `["1:0.001","4:0.23","35:0.0035",...]` -- **Output:** Target value - - e.g., 1, 0, 0.54, 42.195, ... - -Once a prediction model has been constructed based on the samples, the model can make prediction for unforeseen inputs. - -In order to train prediction models, an algorithm so-called ***stochastic gradient descent*** (SGD) is normally applied. You can learn more about this from the following external resources: - -- [scikit-learn documentation](http://scikit-learn.org/stable/modules/sgd.html) -- [Spark MLlib documentation](http://spark.apache.org/docs/latest/mllib-optimization.html) - -Importantly, depending on types of output value, prediction problem can be categorized into **regression** and **classification** problem. - -# Regression - -The goal of regression is to predict **real values** as shown below: - -| features (input) | target real value (output) | -|:---|:---:| -|["1:0.001","4:0.23","35:0.0035",...] | 21.3 | -|["1:0.2","3:0.1","13:0.005",...] | 6.2 | -|["5:1.3","22:0.0.089","77:0.0001",...] | 17.1 | -| ... | ... | - -In practice, target values could be any of small/large float/int negative/positive values. [Our CTR prediction tutorial](../regression/kddcup12tr2.md) solves regression problem with small floating point target values in a 0-1 range, for example. - -While there are several ways to realize regression by using Hivemall, `train_regressor()` is one of the most flexible functions. This feature is explained in [this page](../regression/general.md). - -# Classification - -In contrast to regression, output for classification problems should be (integer) **labels**: - -| features (input) | label (output) | -|:---|:---:| -|["1:0.001","4:0.23","35:0.0035",...] | 0 | -|["1:0.2","3:0.1","13:0.005",...] | 1 | -|["5:1.3","22:0.0.089","77:0.0001",...] | 1 | -| ... | ... | - -In case the number of possible labels is 2 (0/1 or -1/1), the problem is **binary classification**, and Hivemall's `train_classifier()` function enables you to build binary classifiers. [Binary Classification](../binaryclass/general.md)