[jira] [Resolved] (KYLIN-1961) Project name is always constant instead of real project name in email notification

2016-08-16 Thread Dong Li (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dong Li resolved KYLIN-1961.

   Resolution: Fixed
Fix Version/s: v1.5.4

> Project name is always constant instead of real project name in email 
> notification
> --
>
> Key: KYLIN-1961
> URL: https://issues.apache.org/jira/browse/KYLIN-1961
> Project: Kylin
>  Issue Type: Bug
>Affects Versions: v1.5.3
>Reporter: Dong Li
>Assignee: Dong Li
> Fix For: v1.5.4
>
>
> Build Result of Job kylin_sales_cube - 20160930235500_20161006105000 - BUILD 
> - PDT 2016-08-17 05:56:41
> Build Result: SUCCEED
> Job Engine: sandbox.hortonworks.com
> Env: envName
> Project: projectName
> Cube Name: kylin_sales_cube
> Source Records Count: 0
> Start Time: Wed Aug 17 05:57:17 PDT 2016
> Duration: 5mins
> MR Waiting: 1mins
> Last Update Time: Wed Aug 17 06:02:19 PDT 2016
> Submitter: ADMIN
> Error Log: job has succeeded



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-1961) Project name is always constant instead of real project name in email notification

2016-08-16 Thread Dong Li (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dong Li updated KYLIN-1961:
---
Description: 
Build Result of Job kylin_sales_cube - 20160930235500_20161006105000 - BUILD - 
PDT 2016-08-17 05:56:41
Build Result: SUCCEED
Job Engine: sandbox.hortonworks.com
Env: envName
Project: projectName
Cube Name: kylin_sales_cube
Source Records Count: 0
Start Time: Wed Aug 17 05:57:17 PDT 2016
Duration: 5mins
MR Waiting: 1mins
Last Update Time: Wed Aug 17 06:02:19 PDT 2016
Submitter: ADMIN
Error Log: job has succeeded

  was:Project name is always constant 'projectName' in email notification, so 
is deployEnv.


> Project name is always constant instead of real project name in email 
> notification
> --
>
> Key: KYLIN-1961
> URL: https://issues.apache.org/jira/browse/KYLIN-1961
> Project: Kylin
>  Issue Type: Bug
>Affects Versions: v1.5.3
>Reporter: Dong Li
>Assignee: Dong Li
>
> Build Result of Job kylin_sales_cube - 20160930235500_20161006105000 - BUILD 
> - PDT 2016-08-17 05:56:41
> Build Result: SUCCEED
> Job Engine: sandbox.hortonworks.com
> Env: envName
> Project: projectName
> Cube Name: kylin_sales_cube
> Source Records Count: 0
> Start Time: Wed Aug 17 05:57:17 PDT 2016
> Duration: 5mins
> MR Waiting: 1mins
> Last Update Time: Wed Aug 17 06:02:19 PDT 2016
> Submitter: ADMIN
> Error Log: job has succeeded



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (KYLIN-1612) Cube is ready but insight tables not result

2016-08-16 Thread liyang (JIRA)

[ 
https://issues.apache.org/jira/browse/KYLIN-1612?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15423624#comment-15423624
 ] 

liyang commented on KYLIN-1612:
---

pls git log | grep

> Cube is ready but insight tables not result
> ---
>
> Key: KYLIN-1612
> URL: https://issues.apache.org/jira/browse/KYLIN-1612
> Project: Kylin
>  Issue Type: Bug
>Reporter: hongbin ma
>Assignee: liyang
> Fix For: v1.5.2
>
>
> As reported by Roy ( aqinnxuk...@163.com)
> Hi All,
> i use kylin v1.5.1.  add Cube and bulid successful.Cube status is Ready. but 
> in Insight tabs, tables display no result.
> has anyone encountered a similar situation, how to resovle it?
> Best Regards
> Roy He



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Resolved] (KYLIN-1944) UTF 8

2016-08-16 Thread liyang (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

liyang resolved KYLIN-1944.
---
Resolution: Invalid

Not about Kylin.

> UTF 8 
> --
>
> Key: KYLIN-1944
> URL: https://issues.apache.org/jira/browse/KYLIN-1944
> Project: Kylin
>  Issue Type: Bug
>  Components: Tools, Build and Test
>Reporter: nandan priyadarshi
>Assignee: Zhong,Jason
>   Original Estimate: 34h
>  Remaining Estimate: 34h
>
> I am using Datastax Cassandra 4.8 version. Using Solr for Search activity. In 
> my table , I have Chinese character as well as English character. Table 
> retrieval is working smooth ,but some time I am not able to retrive all 
> records for particular filter. 
> For example :- 
> select * from yf_product_books where solr_query = 'author:杰奎琳'; 
> gives me 1 record . 
> but 
> select * from yf_product_books where solr_query = 'author:杰奎*';
> or
> select * from yf_product_books where solr_query = 'author:*奎*';
> I am not getting zero rows. 
> Please suggest me where I am  doing wrong.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (KYLIN-1834) java.lang.IllegalArgumentException: Value not exists! - in Step 4 - Build Dimension Dictionary

2016-08-16 Thread liyang (JIRA)

[ 
https://issues.apache.org/jira/browse/KYLIN-1834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15423571#comment-15423571
 ] 

liyang commented on KYLIN-1834:
---

KYLIN-1948 is not related, it is not about dictionary.

> java.lang.IllegalArgumentException: Value not exists! - in Step 4 - Build 
> Dimension Dictionary
> --
>
> Key: KYLIN-1834
> URL: https://issues.apache.org/jira/browse/KYLIN-1834
> Project: Kylin
>  Issue Type: Bug
>Affects Versions: v1.5.2, v1.5.2.1
>Reporter: Richard Calaba
>Priority: Blocker
> Attachments: job_2016_06_28_09_59_12-value-not-found.zip
>
>
> Getting exception in Step 4 - Build Dimension Dictionary:
> java.lang.IllegalArgumentException: Value not exists!
>   at 
> org.apache.kylin.dimension.Dictionary.getIdFromValueBytes(Dictionary.java:160)
>   at 
> org.apache.kylin.dict.TrieDictionary.getIdFromValueImpl(TrieDictionary.java:158)
>   at 
> org.apache.kylin.dimension.Dictionary.getIdFromValue(Dictionary.java:96)
>   at 
> org.apache.kylin.dimension.Dictionary.getIdFromValue(Dictionary.java:76)
>   at 
> org.apache.kylin.dict.lookup.SnapshotTable.takeSnapshot(SnapshotTable.java:96)
>   at 
> org.apache.kylin.dict.lookup.SnapshotManager.buildSnapshot(SnapshotManager.java:106)
>   at 
> org.apache.kylin.cube.CubeManager.buildSnapshotTable(CubeManager.java:215)
>   at 
> org.apache.kylin.cube.cli.DictionaryGeneratorCLI.processSegment(DictionaryGeneratorCLI.java:59)
>   at 
> org.apache.kylin.cube.cli.DictionaryGeneratorCLI.processSegment(DictionaryGeneratorCLI.java:42)
>   at 
> org.apache.kylin.engine.mr.steps.CreateDictionaryJob.run(CreateDictionaryJob.java:56)
>   at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:70)
>   at org.apache.hadoop.util.ToolRunner.run(ToolRunner.java:84)
>   at 
> org.apache.kylin.engine.mr.common.HadoopShellExecutable.doWork(HadoopShellExecutable.java:60)
>   at 
> org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:114)
>   at 
> org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:50)
>   at 
> org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:114)
>   at 
> org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:124)
>   at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>   at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>   at java.lang.Thread.run(Thread.java:745)
> result code:2
> The code which generates the exception is:
> org.apache.kylin.dimension.Dictionary.java:
>  /**
>  * A lower level API, return ID integer from raw value bytes. In case of 
> not found 
>  * 
>  * - if roundingFlag=0, throw IllegalArgumentException; 
>  * - if roundingFlag<0, the closest smaller ID integer if exist; 
>  * - if roundingFlag>0, the closest bigger ID integer if exist. 
>  * 
>  * Bypassing the cache layer, this could be significantly slower than 
> getIdFromValue(T value).
>  * 
>  * @throws IllegalArgumentException
>  * if value is not found in dictionary and rounding is off;
>  * or if rounding cannot find a smaller or bigger ID
>  */
> final public int getIdFromValueBytes(byte[] value, int offset, int len, 
> int roundingFlag) throws IllegalArgumentException {
> if (isNullByteForm(value, offset, len))
> return nullId();
> else {
> int id = getIdFromValueBytesImpl(value, offset, len, 
> roundingFlag);
> if (id < 0)
> throw new IllegalArgumentException("Value not exists!");
> return id;
> }
> } 
> ==
> The Cube is big - fact 110 mio rows, the largest dimension (customer) has 10 
> mio rows. I have increased the JVM -Xmx to 16gb and set the 
> kylin.table.snapshot.max_mb=2048 in kylin.properties to make sure the Cube 
> build doesn't fail (previously we were getting exception complaining about 
> the 300MB limit for Dimension dictionary size (req. approx 700MB)).
> ==
> Before that we were getting exception complaining about the Dictionary 
> encoding problem - "Too high cardinality is not suitable for dictionary -- 
> cardinality: 10873977" - this we resolved by changing the affected 
> dimension/row key Encoding from "dict" to "int; length=8" on the Advanced 
> Settings of the Cube.
> ==
> We have 2 high-cardinality fields (one from fact table and one from the big 
> dimension (customer - see 

[jira] [Updated] (KYLIN-1849) add basic search capability at model UI

2016-08-16 Thread kangkaisen (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

kangkaisen updated KYLIN-1849:
--
Attachment: KYLIN-1849.patch

This patch supports to search cube by name in web.

> add basic search capability at model UI
> ---
>
> Key: KYLIN-1849
> URL: https://issues.apache.org/jira/browse/KYLIN-1849
> Project: Kylin
>  Issue Type: New Feature
>  Components: Web 
>Affects Versions: v1.5.2
>Reporter: Dayue Gao
>Assignee: kangkaisen
> Attachments: KYLIN-1849.patch
>
>
> In order to work with dozens of cubes, could we add a search box at "Model" 
> page? Just like the one at "Monitor" page.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Assigned] (KYLIN-1849) add basic search capability at model UI

2016-08-16 Thread kangkaisen (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

kangkaisen reassigned KYLIN-1849:
-

Assignee: kangkaisen  (was: Zhong,Jason)

> add basic search capability at model UI
> ---
>
> Key: KYLIN-1849
> URL: https://issues.apache.org/jira/browse/KYLIN-1849
> Project: Kylin
>  Issue Type: New Feature
>  Components: Web 
>Affects Versions: v1.5.2
>Reporter: Dayue Gao
>Assignee: kangkaisen
>
> In order to work with dozens of cubes, could we add a search box at "Model" 
> page? Just like the one at "Monitor" page.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (KYLIN-1960) Provide a new build type called COVER

2016-08-16 Thread Zhong Yanghong (JIRA)

[ 
https://issues.apache.org/jira/browse/KYLIN-1960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15422471#comment-15422471
 ] 

Zhong Yanghong commented on KYLIN-1960:
---

[~Shaofengshi] and [~liyang.g...@gmail.com], could you help review this patch? 
Thanks.

> Provide a new build type called COVER
> -
>
> Key: KYLIN-1960
> URL: https://issues.apache.org/jira/browse/KYLIN-1960
> Project: Kylin
>  Issue Type: New Feature
>Reporter: Zhong Yanghong
>Assignee: Zhong Yanghong
> Attachments: provide_a_new_build_type_COVER.patch
>
>
> The current three build types are not good at dealing with incremental 
> building with refreshing old data. 
> For example, there are [S1, S2, S3] segments in a cube. S1 with time range 
> [t1, t2); S2 with time range [t2, t3); S3 with time range [t3, t4). 
> Now users want to refresh the old data within [t2, t4). The current strategy 
> is to merge S2 and S3 firstly. Then refresh the bigger segment with time 
> range [t2, t4). The first step is meaningless and wasteful. 
> What if users want to build a new segment with time range [t2, t5), where t5 
> is larger than t4. The current strategy is clumsy.
> How about providing a new build type? Here, it's called COVER. For this type, 
> users also should provide start time and end time as parameters. The start 
> time should match the boundaries of segments. In this case, it should be in 
> set {t1, t2, t3, t4}. While the end time should be larger than t4, or match 
> one of the end times of existing segments. In this case, it should be in set 
> {t2, t3, t4}. Of course, the end time should be larger than the start time. 
> What job engine will do with this build type is as follows:
> 1. first build a new segment with the time range;
> 2. then the covered segments will be deleted.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-1960) Provide a new build type called COVER

2016-08-16 Thread Zhong Yanghong (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhong Yanghong updated KYLIN-1960:
--
Attachment: (was: provide_a_new_build_type_COVER.patch)

> Provide a new build type called COVER
> -
>
> Key: KYLIN-1960
> URL: https://issues.apache.org/jira/browse/KYLIN-1960
> Project: Kylin
>  Issue Type: New Feature
>Reporter: Zhong Yanghong
>Assignee: Zhong Yanghong
> Attachments: provide_a_new_build_type_COVER.patch
>
>
> The current three build types are not good at dealing with incremental 
> building with refreshing old data. 
> For example, there are [S1, S2, S3] segments in a cube. S1 with time range 
> [t1, t2); S2 with time range [t2, t3); S3 with time range [t3, t4). 
> Now users want to refresh the old data within [t2, t4). The current strategy 
> is to merge S2 and S3 firstly. Then refresh the bigger segment with time 
> range [t2, t4). The first step is meaningless and wasteful. 
> What if users want to build a new segment with time range [t2, t5), where t5 
> is larger than t4. The current strategy is clumsy.
> How about providing a new build type? Here, it's called COVER. For this type, 
> users also should provide start time and end time as parameters. The start 
> time should match the boundaries of segments. In this case, it should be in 
> set {t1, t2, t3, t4}. While the end time should be larger than t4, or match 
> one of the end times of existing segments. In this case, it should be in set 
> {t2, t3, t4}. Of course, the end time should be larger than the start time. 
> What job engine will do with this build type is as follows:
> 1. first build a new segment with the time range;
> 2. then the covered segments will be deleted.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-1960) Provide a new build type called COVER

2016-08-16 Thread Zhong Yanghong (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhong Yanghong updated KYLIN-1960:
--
Attachment: provide_a_new_build_type_COVER.patch

> Provide a new build type called COVER
> -
>
> Key: KYLIN-1960
> URL: https://issues.apache.org/jira/browse/KYLIN-1960
> Project: Kylin
>  Issue Type: New Feature
>Reporter: Zhong Yanghong
>Assignee: Zhong Yanghong
> Attachments: provide_a_new_build_type_COVER.patch
>
>
> The current three build types are not good at dealing with incremental 
> building with refreshing old data. 
> For example, there are [S1, S2, S3] segments in a cube. S1 with time range 
> [t1, t2); S2 with time range [t2, t3); S3 with time range [t3, t4). 
> Now users want to refresh the old data within [t2, t4). The current strategy 
> is to merge S2 and S3 firstly. Then refresh the bigger segment with time 
> range [t2, t4). The first step is meaningless and wasteful. 
> What if users want to build a new segment with time range [t2, t5), where t5 
> is larger than t4. The current strategy is clumsy.
> How about providing a new build type? Here, it's called COVER. For this type, 
> users also should provide start time and end time as parameters. The start 
> time should match the boundaries of segments. In this case, it should be in 
> set {t1, t2, t3, t4}. While the end time should be larger than t4, or match 
> one of the end times of existing segments. In this case, it should be in set 
> {t2, t3, t4}. Of course, the end time should be larger than the start time. 
> What job engine will do with this build type is as follows:
> 1. first build a new segment with the time range;
> 2. then the covered segments will be deleted.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-1960) Provide a new build type called COVER

2016-08-16 Thread Zhong Yanghong (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhong Yanghong updated KYLIN-1960:
--
Attachment: provide_a_new_build_type_COVER.patch

> Provide a new build type called COVER
> -
>
> Key: KYLIN-1960
> URL: https://issues.apache.org/jira/browse/KYLIN-1960
> Project: Kylin
>  Issue Type: New Feature
>Reporter: Zhong Yanghong
>Assignee: Zhong Yanghong
> Attachments: provide_a_new_build_type_COVER.patch
>
>
> The current three build types are not good at dealing with incremental 
> building with refreshing old data. 
> For example, there are [S1, S2, S3] segments in a cube. S1 with time range 
> [t1, t2); S2 with time range [t2, t3); S3 with time range [t3, t4). 
> Now users want to refresh the old data within [t2, t4). The current strategy 
> is to merge S2 and S3 firstly. Then refresh the bigger segment with time 
> range [t2, t4). The first step is meaningless and wasteful. 
> What if users want to build a new segment with time range [t2, t5), where t5 
> is larger than t4. The current strategy is clumsy.
> How about providing a new build type? Here, it's called COVER. For this type, 
> users also should provide start time and end time as parameters. The start 
> time should match the boundaries of segments. In this case, it should be in 
> set {t1, t2, t3, t4}. While the end time should be larger than t4, or match 
> one of the end times of existing segments. In this case, it should be in set 
> {t2, t3, t4}. Of course, the end time should be larger than the start time. 
> What job engine will do with this build type is as follows:
> 1. first build a new segment with the time range;
> 2. then the covered segments will be deleted.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-1702) The Key of the Snapshot to the related lookup table may be not informative

2016-08-16 Thread Zhong Yanghong (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhong Yanghong updated KYLIN-1702:
--
Attachment: (was: 
change_layout_path_of_table_snapshot_with_tableName.patch)

> The Key of the Snapshot to the related lookup table may be not informative
> --
>
> Key: KYLIN-1702
> URL: https://issues.apache.org/jira/browse/KYLIN-1702
> Project: Kylin
>  Issue Type: Improvement
>Reporter: Zhong Yanghong
>Assignee: Zhong Yanghong
> Attachments: change_layout_path_of_table_snapshot_with_tableName.patch
>
>
> Currently the key for the snapshot stored in hbase metadata file is as 
> follows:
> ResourceStore.SNAPSHOT_RESOURCE_ROOT + "/" + new 
> File(signature.getPath()).getName() + "/" + uuid + ".snapshot"
> However, some hive tables stored in hive may organized like 
> dirName/tableName/00, dirName/tableName/01.
> Based on current setting, the key will be 
> ResourceStore.SNAPSHOT_RESOURCE_ROOT + "/" + 00 + "/" + uuid + ".snapshot", 
> which is lack of the table name information.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (KYLIN-1702) The Key of the Snapshot to the related lookup table may be not informative

2016-08-16 Thread Zhong Yanghong (JIRA)

 [ 
https://issues.apache.org/jira/browse/KYLIN-1702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Zhong Yanghong updated KYLIN-1702:
--
Attachment: change_layout_path_of_table_snapshot_with_tableName.patch

A small patch for changing the layout path of snapshot table is attached. 
[~Shaofengshi], could you help review this patch? Thanks.

> The Key of the Snapshot to the related lookup table may be not informative
> --
>
> Key: KYLIN-1702
> URL: https://issues.apache.org/jira/browse/KYLIN-1702
> Project: Kylin
>  Issue Type: Improvement
>Reporter: Zhong Yanghong
>Assignee: Zhong Yanghong
> Attachments: change_layout_path_of_table_snapshot_with_tableName.patch
>
>
> Currently the key for the snapshot stored in hbase metadata file is as 
> follows:
> ResourceStore.SNAPSHOT_RESOURCE_ROOT + "/" + new 
> File(signature.getPath()).getName() + "/" + uuid + ".snapshot"
> However, some hive tables stored in hive may organized like 
> dirName/tableName/00, dirName/tableName/01.
> Based on current setting, the key will be 
> ResourceStore.SNAPSHOT_RESOURCE_ROOT + "/" + 00 + "/" + uuid + ".snapshot", 
> which is lack of the table name information.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)