This is an automated email from the ASF dual-hosted git repository.
zkaoudi pushed a commit to branch main
in repository https://gitbox.apache.org/repos/asf/incubator-wayang-website.git
The following commit(s) were added to refs/heads/main by this push:
new 1e1fcd5 rephrasing and updating information
1e1fcd5 is described below
commit 1e1fcd594d0a2bacdc8526b1e4312f5c19ea5f5e
Author: Zoi <[email protected]>
AuthorDate: Wed Dec 27 14:49:33 2023 +0200
rephrasing and updating information
---
_publications/2022-09-08-vldb_tutorial.md | 46 ++++++++++++++++++++++++++++++
assets/pdf/paper/p3778-kaoudi.pdf | Bin 0 -> 538029 bytes
2 files changed, 46 insertions(+)
diff --git a/_publications/2022-09-08-vldb_tutorial.md
b/_publications/2022-09-08-vldb_tutorial.md
new file mode 100644
index 0000000..84533e2
--- /dev/null
+++ b/_publications/2022-09-08-vldb_tutorial.md
@@ -0,0 +1,46 @@
+---
+license: |
+ Licensed to the Apache Software Foundation (ASF) under one or more
+ contributor license agreements. See the NOTICE file distributed with
+ this work for additional information regarding copyright ownership.
+ The ASF licenses this file to You under the Apache License, Version 2.0
+ (the "License"); you may not use this file except in compliance with
+ the License. You may obtain a copy of the License at
+
+ http://www.apache.org/licenses/LICENSE-2.0
+
+ Unless required by applicable law or agreed to in writing, software
+ distributed under the License is distributed on an "AS IS" BASIS,
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ See the License for the specific language governing permissions and
+ limitations under the License.
+layout: publication
+title: Publication
+subtitle: >
+ Unified Data Analytics: State-of-the-art and Open Problems
+link-name: Tutorial paper at VLDB 2022
+img-thumb: assets/img/screenshot/apache-wayang.png
+authors: Zoi Kaoudi and Jorge-Arnulfo Quiané-Ruiz
+year: 2022
+month: 08
+day: 01
+link-paper: assets/pdf/paper/p3778-kaoudi.pdf
+link-external: false
+---
+
+There is an urgent need for unifying data analytics as more and more
+application tasks become more complex: Nowadays, it is normal to
+see tasks performing data preparation, analytical processing, and
+machine learning operations in a single pipeline. Despite this need,
+achieving this is still a dreadful process where developers have to
+get familiar with many data processing platforms and write ad hoc
+scripts for integrating them. This tutorial is motivated by this need
+from both academia and industry. We will discuss the importance
+of unifying data processing as well as the current efforts to achieve
+it. In particular, we will introduce a classification of the different
+cases where an application needs or benefits from data analytics
+unification and discuss the challenges in each case. Along with this
+classification, we will also present current efforts known up to date
+that aim at unifying data processing, such as Apache Beam and
+Apache Wayang, and emphasize their differences. We will conclude
+with open problems and their challenges.
diff --git a/assets/pdf/paper/p3778-kaoudi.pdf
b/assets/pdf/paper/p3778-kaoudi.pdf
new file mode 100644
index 0000000..37d898c
Binary files /dev/null and b/assets/pdf/paper/p3778-kaoudi.pdf differ