YeonSu Han created TAJO-1388:
--------------------------------

             Summary: [Umbrella] Kafka Storage Integration.
                 Key: TAJO-1388
                 URL: https://issues.apache.org/jira/browse/TAJO-1388
             Project: Tajo
          Issue Type: New Feature
          Components: storage
            Reporter: YeonSu Han
            Assignee: YeonSu Han


Apache Kafka is one of the widely used message queueing system. If we can use 
the Kafka as Tajo storage, analysis area of Tajo user is be broaden. For 
example, as realtime analysis. 
For this, I propose 'Kafka storage'. Please review my proposal and give your 
opinion.

* Table Creation
{code:sql}
CREATE [EXTERNAL] TABLE [IF NOT EXISTS] <table_name> [(<column_name>
<data_type>, ... )]
using kafka with 
(‘kafka.topic’=’<kafka_topic_name>’,‘kafka.zk’=’<kafka_zookeeper_info>’,[other 
options])
{code}
** Use “kafka” keyword in “using” clause for creating kafka table in Tajo.
** kafka table name is mapped to a Tajo table name with , 'kafka.topic' 
property.

* Column mapping of kafka message
** Delimited line mapping (default)
** json mapping
** ...

* Concept
** The topic of kafka correspond to table.
** The partition of kafka correspond to file.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to