Re: Improving show segment info

2020-02-19 Thread haomarch
1. show partition enhanced I think it's more useful to see the data volume of each partiton, futhermore, the data volume of every day will an important metric which cann't be count now. Sum up, if the partition column is "day, hour" Shall we focus on these two problem: Q1: How can we find the data

Re: Improving show segment info

2020-02-19 Thread haomarch
The problem users concerned is more likely: What partitions do not have completely loaded? If wer just add partition-segment list in "show partitions", or "min/max(timecolumn)" in "show segemnts", the above question can't be answered clearly. The recommendation is, add a new cammand "show partit

Re: Improving show segment info

2020-02-17 Thread Jacky Li
> 2020年2月17日 下午2:00,akashrn5 写道: > > Hi, > >>> *1. How about creating a "tableName.segmentInfo" child table for each main >>> table?* user can query this table and easy to support filter, group by. we >>> just have to finalize the schema of this table. > We already have many things like index

Re: Improving show segment info

2020-02-17 Thread Jacky Li
> 2020年2月17日 下午2:03,akashrn5 写道: > > Hi Ajantha, > > I think event time comes into picture when the user has the timestamp > column, like in timeseries. So only in that case, this column makes sense. > > Else it won't be there. > > @Likun, correct me if my understanding is wrong. > Yes, y

Re: Improving show segment info

2020-02-16 Thread akashrn5
Hi Ajantha, I think event time comes into picture when the user has the timestamp column, like in timeseries. So only in that case, this column makes sense. Else it won't be there. @Likun, correct me if my understanding is wrong. Regards, Akash R Nilugal -- Sent from: http://apache-carbond

Re: Improving show segment info

2020-02-16 Thread akashrn5
Hi, >>*1. How about creating a "tableName.segmentInfo" child table for each main >>table?* user can query this table and easy to support filter, group by. we >>just have to finalize the schema of this table. We already have many things like index tables, datamap tables, just to store this metadata

Re: Improving show segment info

2020-02-16 Thread Ajantha Bhat
3. And about event time. I don't think we need to keep it for every row. It is a waste of storage size. can we keep in loadMetadetails or file level ? On Mon, Feb 17, 2020 at 11:10 AM Ajantha Bhat wrote: > Hi Likun, > > I think this display command is hard to maintain if we provide all these > o

Re: Improving show segment info

2020-02-16 Thread Ajantha Bhat
Hi Likun, I think this display command is hard to maintain if we provide all these options manually. *1. How about creating a "tableName.segmentInfo" child table for each main table?* user can query this table and easy to support filter, group by. we just have to finalize the schema of this table

Re: Improving show segment info

2020-02-16 Thread akashrn5
Hi, >I got your point, but given the partition column by user does not help reducing the information. If we want to reduce the >amount of the information, we should ask user to give the filter on partition column like example 3 in my original mail. 1. my concern was if there are more partition co

Re: Improving show segment info

2020-02-16 Thread Jacky Li
> 2020年2月16日 下午4:58,akashrn5 写道: > > Hi likun, > > Thanks for proposing this > > +1, its a good way and its better to provide user more info about segment. > > I have following doubts and suggestions. > > 1. You have mentioned DDL as Show segments On table, but currently it is > show segm

Re: Improving show segment info

2020-02-16 Thread akashrn5
Hi likun, Thanks for proposing this +1, its a good way and its better to provide user more info about segment. I have following doubts and suggestions. 1. You have mentioned DDL as Show segments On table, but currently it is show segments for table, i suggest not to change the current one,we c

Improving show segment info

2020-02-16 Thread Jacky Li
Hi community, Currently for SHOW SEGMENT command, carbon will print: +---+-+---+---+-+---+-+--++ |SegmentSequenceId |Status |Load Start Time|Load End Time |Merged To|File Format|Data Size|Ind