[ 
https://issues.apache.org/jira/browse/FALCON-580?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14119339#comment-14119339
 ] 

Srikanth Sundarrajan commented on FALCON-580:
---------------------------------------------

{quote}
How will eviction ever work? It will be broken or at best will not evict 
anything.
{quote}
When data is not time based, eviction based on time isn't supposed to do 
anything, if and when we implement age based eviction (based on directory 
creation /modification time stamp), this would evict as appropriate. 

{quote}
DistCp becomes very inefficient as time and data grows since it now needs to 
check every file on each side for copy.
{quote}
True, but can't be reason disallowing non-time based pattern on feed paths, but 
what if this is a snapshot and needs to be constantly synced up (which is 
essentially non-time based). In either case this is a feature to be used 
ideally for reasonably sized snapshots which are constantly mutated. 

A flag would help to formalize this, but flag can be inferred with a schema 
change simply by evaluating if time pattern is present in the path or not. But 
dont know what the equivalent would be for hcat table based feed.



> Mandate date pattern for the feed path in the xsd
> -------------------------------------------------
>
>                 Key: FALCON-580
>                 URL: https://issues.apache.org/jira/browse/FALCON-580
>             Project: Falcon
>          Issue Type: Bug
>            Reporter: Sowmya Ramesh
>
>  The granularity of date pattern in the feed path should be atleast that of a 
> frequency of a feed. This should be mandated in the feed xsd.
> e.g.: 
> {noformat}
> Valid format: <location type="data" 
> path="/hdfsDataLocation/${YEAR}/${MONTH}/${DAY}/${HOUR}/${MINUTE}"/>
> Invalid format: <location type="data" path="/hdfsDataLocation"/>
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to