[ https://issues.apache.org/jira/browse/DRILL-7077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16806450#comment-16806450 ]
Volodymyr Vysotskyi edited comment on DRILL-7077 at 4/1/19 6:39 AM: -------------------------------------------------------------------- Merged with commit id 3c798d338f4f4dab6713956e0a94d18e6e5c72bd was (Author: vvysotskyi): Merged with commit 3c798d338f4f4dab6713956e0a94d18e6e5c72bd > Add Function to Facilitate Time Series Analysis > ----------------------------------------------- > > Key: DRILL-7077 > URL: https://issues.apache.org/jira/browse/DRILL-7077 > Project: Apache Drill > Issue Type: New Feature > Reporter: Charles Givre > Assignee: Charles Givre > Priority: Major > Labels: doc-impacting, ready-to-commit > Fix For: 1.16.0 > > > When analyzing time based data, you will often have to aggregate by time > grains. While some time grains will be easy to calculate, others, such as > quarter, can be quite difficult. These functions enable a user to quickly and > easily aggregate data by various units of time. Usage is as follows: > {code:java} > SELECT <fields> > FROM <data> > GROUP BY nearestDate(<timestamp_column>, <time increment>{code} > So let's say that a user wanted to count the number of hits on a web server > per 15 minute, the query might look like this: > {code:java} > SELECT nearestDate(`eventDate`, '15MINUTE' ) AS eventDate, > COUNT(*) AS hitCount > FROM dfs.`log.httpd` > GROUP BY nearestDate(`eventDate`, '15MINUTE'){code} > Currently supports the following time units: > * YEAR > * QUARTER > * MONTH > * WEEK_SUNDAY > * WEEK_MONDAY > * DAY > * HOUR > * HALF_HOUR / 30MIN > * QUARTER_HOUR / 15MIN > * MINUTE > * 30SECOND > * 15SECOND > * SECOND > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)