[ https://issues.apache.org/jira/browse/DRILL-7077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16813895#comment-16813895 ]
Bridget Bevens commented on DRILL-7077: --------------------------------------- Hi [~cgivre], I'm trying this function and may be doing something wrong, but 15SECOND and 30SECOND are not working for me: select nearestdate(CAST(COLUMNS[2] as timestamp), '30SECOND') as nearest_second from dfs.samples.`/bee/time.csv`; Error: SYSTEM ERROR: DrillRuntimeException: [30SECOND] is not a valid time statement. Expecting: [YEAR, QUARTER, MONTH, WEEK_SUNDAY, WEEK_MONDAY, DAY, HOUR, HALF_HOUR, QUARTER_HOUR, MINUTE, HALF_MINUTE, QUARTER_MINUTE, SECOND] Fragment 0:0 Please, refer to logs for more information. [Error Id: f119202e-ec24-4670-83c2-14b4a7f83ebf on doc23.lab:31010] (state=,code=0) apache drill> select nearestdate(CAST(COLUMNS[2] as timestamp), 'SECOND') as nearest_second from dfs.samples.`/bee/time.csv`; +-----------------------+ | nearest_second | +-----------------------+ | 2018-01-01 05:10:15.0 | | 2017-02-02 01:02:03.0 | | 2003-04-06 07:11:11.0 | +-----------------------+ 3 rows selected (0.191 seconds) Are 15SECOND and 30SECOND supported? Thanks, Bridget > Add Function to Facilitate Time Series Analysis > ----------------------------------------------- > > Key: DRILL-7077 > URL: https://issues.apache.org/jira/browse/DRILL-7077 > Project: Apache Drill > Issue Type: New Feature > Reporter: Charles Givre > Assignee: Charles Givre > Priority: Major > Labels: doc-impacting, ready-to-commit > Fix For: 1.16.0 > > > When analyzing time based data, you will often have to aggregate by time > grains. While some time grains will be easy to calculate, others, such as > quarter, can be quite difficult. These functions enable a user to quickly and > easily aggregate data by various units of time. Usage is as follows: > {code:java} > SELECT <fields> > FROM <data> > GROUP BY nearestDate(<timestamp_column>, <time increment>{code} > So let's say that a user wanted to count the number of hits on a web server > per 15 minute, the query might look like this: > {code:java} > SELECT nearestDate(`eventDate`, '15MINUTE' ) AS eventDate, > COUNT(*) AS hitCount > FROM dfs.`log.httpd` > GROUP BY nearestDate(`eventDate`, '15MINUTE'){code} > Currently supports the following time units: > * YEAR > * QUARTER > * MONTH > * WEEK_SUNDAY > * WEEK_MONDAY > * DAY > * HOUR > * HALF_HOUR / 30MIN > * QUARTER_HOUR / 15MIN > * MINUTE > * 30SECOND > * 15SECOND > * SECOND > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)