[jira] [Created] (KYLIN-4121) Cleanup hive view intermediate tables after job be finished

2019-07-31 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-4121:
---

 Summary: Cleanup hive view intermediate tables after job be 
finished
 Key: KYLIN-4121
 URL: https://issues.apache.org/jira/browse/KYLIN-4121
 Project: Kylin
  Issue Type: Improvement
  Components: Job Engine
Reporter: Shaofeng SHI


Reported by community user:
I have a cube with a fact table join a lookup table  in hive, and both are hive 
view. I submit a job one time per hour.
 
Kylin can drop the intermediate fact table , but doesn't drop the intermediate 
lookup table .
 
I check the source code , and find out that , at '13 step: Hive Cleanup' ,  the 
source code has been commented.
 
 
It is a legacy issue. Now KYLIN-3515 has fixed that, so the cleanup can be 
enabled.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)


[jira] [Created] (KYLIN-4063) Avoid repeatedly calling "string.toLowerCase" in TimedJsonStreamParser#parse

2019-07-01 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-4063:
---

 Summary: Avoid repeatedly calling "string.toLowerCase" in 
TimedJsonStreamParser#parse
 Key: KYLIN-4063
 URL: https://issues.apache.org/jira/browse/KYLIN-4063
 Project: Kylin
  Issue Type: Improvement
  Components: NRT Streaming
Reporter: Shaofeng SHI


In TimedJsonStreamParser#parse, it has this:

 
{code:java}
for (TblColRef column : allColumns) {
final String columnName = column.getName().toLowerCase(Locale.ROOT);
if (populateDerivedTimeColumns(columnName, result, t) == false) {
result.add(getValueByKey(column, root));
}
}
{code}
 

As this method will be invoked for each message, and then for each column it 
will have a "toLowerCase(Locale.ROOT)", which is unnecessary, because the 
"allColumns" won't change.

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-4062) Too many "if else" clause in PushDownRunnerJdbcImpl#toSqlType

2019-06-30 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-4062:
---

 Summary: Too many "if else" clause in 
PushDownRunnerJdbcImpl#toSqlType
 Key: KYLIN-4062
 URL: https://issues.apache.org/jira/browse/KYLIN-4062
 Project: Kylin
  Issue Type: Improvement
  Components: Query Engine
Reporter: Shaofeng SHI


In this method, it has 30 "if else" which is low efficient; Should use a static 
Hashmap, then only need 1 check.

 
{code:java}
if ("string".equalsIgnoreCase(type)) {
return Types.VARCHAR;
} else if ("varchar".equalsIgnoreCase(type)) {
return Types.VARCHAR;
} else if ("char".equalsIgnoreCase(type)) {
return Types.CHAR;
} else if
...{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3999) Enable dynamic column by default

2019-05-10 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3999:
---

 Summary: Enable dynamic column by default
 Key: KYLIN-3999
 URL: https://issues.apache.org/jira/browse/KYLIN-3999
 Project: Kylin
  Issue Type: Improvement
  Components: Query Engine
Reporter: Shaofeng SHI


More and more user expects to use "SUM(Case when)" feature, and got error. The 
reason is the dynamic column is disabled by default. We should consider to 
enable it by default:

 

kylin.query.enable-dynamic-column=true



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3878) NPE to run sonar analysis

2019-03-15 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3878:
---

 Summary: NPE to run sonar analysis
 Key: KYLIN-3878
 URL: https://issues.apache.org/jira/browse/KYLIN-3878
 Project: Kylin
  Issue Type: Test
  Components: Tools, Build and Test
Reporter: Shaofeng SHI


mvn sonar:sonar -Dsonar.host.url=https://sonarcloud.io 
-Dsonar.organization=kylin -e

[INFO] 
[INFO] BUILD FAILURE
[INFO] 
[INFO] Total time: 03:13 min
[INFO] Finished at: 2019-03-15T14:42:16Z
[INFO] 
[ERROR] Failed to execute goal 
org.sonarsource.scanner.maven:sonar-maven-plugin:3.6.0.1398:sonar (default-cli) 
on project kylin: null: MojoExecutionException: NullPointerException -> [Help 1]
org.apache.maven.lifecycle.LifecycleExecutionException: Failed to execute goal 
org.sonarsource.scanner.maven:sonar-maven-plugin:3.6.0.1398:sonar (default-cli) 
on project kylin: null
 at org.apache.maven.lifecycle.internal.MojoExecutor.execute 
(MojoExecutor.java:213)
 at org.apache.maven.lifecycle.internal.MojoExecutor.execute 
(MojoExecutor.java:154)
 at org.apache.maven.lifecycle.internal.MojoExecutor.execute 
(MojoExecutor.java:146)
 at org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject 
(LifecycleModuleBuilder.java:117)
 at org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject 
(LifecycleModuleBuilder.java:81)
 at 
org.apache.maven.lifecycle.internal.builder.singlethreaded.SingleThreadedBuilder.build
 (SingleThreadedBuilder.java:56)
 at org.apache.maven.lifecycle.internal.LifecycleStarter.execute 
(LifecycleStarter.java:128)
 at org.apache.maven.DefaultMaven.doExecute (DefaultMaven.java:305)
 at org.apache.maven.DefaultMaven.doExecute (DefaultMaven.java:192)
 at org.apache.maven.DefaultMaven.execute (DefaultMaven.java:105)
 at org.apache.maven.cli.MavenCli.execute (MavenCli.java:956)
 at org.apache.maven.cli.MavenCli.doMain (MavenCli.java:290)
 at org.apache.maven.cli.MavenCli.main (MavenCli.java:194)
 at sun.reflect.NativeMethodAccessorImpl.invoke0 (Native Method)
 at sun.reflect.NativeMethodAccessorImpl.invoke 
(NativeMethodAccessorImpl.java:62)
 at sun.reflect.DelegatingMethodAccessorImpl.invoke 
(DelegatingMethodAccessorImpl.java:43)
 at java.lang.reflect.Method.invoke (Method.java:498)
 at org.codehaus.plexus.classworlds.launcher.Launcher.launchEnhanced 
(Launcher.java:289)
 at org.codehaus.plexus.classworlds.launcher.Launcher.launch (Launcher.java:229)
 at org.codehaus.plexus.classworlds.launcher.Launcher.mainWithExitCode 
(Launcher.java:415)
 at org.codehaus.plexus.classworlds.launcher.Launcher.main (Launcher.java:356)
Caused by: org.apache.maven.plugin.MojoExecutionException
 at org.sonarsource.scanner.maven.bootstrap.ScannerBootstrapper.execute 
(ScannerBootstrapper.java:67)
 at org.sonarsource.scanner.maven.SonarQubeMojo.execute (SonarQubeMojo.java:104)
 at org.apache.maven.plugin.DefaultBuildPluginManager.executeMojo 
(DefaultBuildPluginManager.java:137)
 at org.apache.maven.lifecycle.internal.MojoExecutor.execute 
(MojoExecutor.java:208)
 at org.apache.maven.lifecycle.internal.MojoExecutor.execute 
(MojoExecutor.java:154)
 at org.apache.maven.lifecycle.internal.MojoExecutor.execute 
(MojoExecutor.java:146)
 at org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject 
(LifecycleModuleBuilder.java:117)
 at org.apache.maven.lifecycle.internal.LifecycleModuleBuilder.buildProject 
(LifecycleModuleBuilder.java:81)
 at 
org.apache.maven.lifecycle.internal.builder.singlethreaded.SingleThreadedBuilder.build
 (SingleThreadedBuilder.java:56)
 at org.apache.maven.lifecycle.internal.LifecycleStarter.execute 
(LifecycleStarter.java:128)
 at org.apache.maven.DefaultMaven.doExecute (DefaultMaven.java:305)
 at org.apache.maven.DefaultMaven.doExecute (DefaultMaven.java:192)
 at org.apache.maven.DefaultMaven.execute (DefaultMaven.java:105)
 at org.apache.maven.cli.MavenCli.execute (MavenCli.java:956)
 at org.apache.maven.cli.MavenCli.doMain (MavenCli.java:290)
 at org.apache.maven.cli.MavenCli.main (MavenCli.java:194)
 at sun.reflect.NativeMethodAccessorImpl.invoke0 (Native Method)
 at sun.reflect.NativeMethodAccessorImpl.invoke 
(NativeMethodAccessorImpl.java:62)
 at sun.reflect.DelegatingMethodAccessorImpl.invoke 
(DelegatingMethodAccessorImpl.java:43)
 at java.lang.reflect.Method.invoke (Method.java:498)
 at org.codehaus.plexus.classworlds.launcher.Launcher.launchEnhanced 
(Launcher.java:289)
 at org.codehaus.plexus.classworlds.launcher.Launcher.launch (Launcher.java:229)
 at org.codehaus.plexus.classworlds.launcher.Launcher.mainWithExitCode 
(Launcher.java:415)
 at org.codehaus.plexus.classworlds.launcher.Launcher.main (Launcher.java:356)
Caused by: java.lang.NullPointerException
 at org.A.E.get (Un

[jira] [Created] (KYLIN-3862) Check the binary packages

2019-03-09 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3862:
---

 Summary: Check the binary packages
 Key: KYLIN-3862
 URL: https://issues.apache.org/jira/browse/KYLIN-3862
 Project: Kylin
  Issue Type: Task
Reporter: Shaofeng SHI


As to the approval of binary packages:
 
It's not possible in general to check the exact contents of a binary, however 
there are some checks that should be made:
- sigs and hashes must be OK
- the package must contain the correct NOTICE and LICENSE files for the 
included content
- the package must not contain any content not derived from the source.
- in the case of bundled binaries, reviewers must check that all contents are 
represented in the LICENSE (and NOTICE file if required).
The bundle must not contain any files that are prohibited from distribution 
(category X).



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3863) Check the binary packages

2019-03-09 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3863:
---

 Summary: Check the binary packages
 Key: KYLIN-3863
 URL: https://issues.apache.org/jira/browse/KYLIN-3863
 Project: Kylin
  Issue Type: Task
Reporter: Shaofeng SHI


As to the approval of binary packages:
 
It's not possible in general to check the exact contents of a binary, however 
there are some checks that should be made:
- sigs and hashes must be OK
- the package must contain the correct NOTICE and LICENSE files for the 
included content
- the package must not contain any content not derived from the source.
- in the case of bundled binaries, reviewers must check that all contents are 
represented in the LICENSE (and NOTICE file if required).
The bundle must not contain any files that are prohibited from distribution 
(category X).



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3826) MergeCuboidJob only uploads necessary segment's dictionary

2019-02-23 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3826:
---

 Summary: MergeCuboidJob only uploads necessary segment's dictionary
 Key: KYLIN-3826
 URL: https://issues.apache.org/jira/browse/KYLIN-3826
 Project: Kylin
  Issue Type: Improvement
  Components: Job Engine
Reporter: Shaofeng SHI


On yesterday's Kylin meetup, Zhang Wei mentioned that the "MergeCuboidJob" will 
upload all segment's metadata, which will take extra long time when the segment 
number is large. While this is unnecessary.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3825) Add ACL Rest APIs to document

2019-02-22 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3825:
---

 Summary: Add ACL Rest APIs to document
 Key: KYLIN-3825
 URL: https://issues.apache.org/jira/browse/KYLIN-3825
 Project: Kylin
  Issue Type: Improvement
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3823) Release v2.6.1

2019-02-22 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3823:
---

 Summary: Release v2.6.1
 Key: KYLIN-3823
 URL: https://issues.apache.org/jira/browse/KYLIN-3823
 Project: Kylin
  Issue Type: Task
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3795) Submit Spark jobs via Apache Livy

2019-01-29 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3795:
---

 Summary: Submit Spark jobs via Apache Livy
 Key: KYLIN-3795
 URL: https://issues.apache.org/jira/browse/KYLIN-3795
 Project: Kylin
  Issue Type: New Feature
  Components: Spark Engine
Reporter: Shaofeng SHI


Livy is a rest service for Spark. Some users are using Livy as the interface 
for Spark. Kylin can have the capability to submit spark job via Livy.

https://livy.incubator.apache.org/



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3796) MongoDB as data source

2019-01-29 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3796:
---

 Summary: MongoDB as data source
 Key: KYLIN-3796
 URL: https://issues.apache.org/jira/browse/KYLIN-3796
 Project: Kylin
  Issue Type: New Feature
  Components: Others
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3793) org.apache.kylin.source.kafka.util.KafkaSampleProducer exit after generating 1 message

2019-01-28 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3793:
---

 Summary: org.apache.kylin.source.kafka.util.KafkaSampleProducer 
exit after generating 1 message
 Key: KYLIN-3793
 URL: https://issues.apache.org/jira/browse/KYLIN-3793
 Project: Kylin
  Issue Type: Bug
  Components: NRT Streaming
Affects Versions: v2.6.0
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3753) Delete old hbase table when refresh the segment

2019-01-02 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3753:
---

 Summary: Delete old hbase table when refresh the segment
 Key: KYLIN-3753
 URL: https://issues.apache.org/jira/browse/KYLIN-3753
 Project: Kylin
  Issue Type: Improvement
  Components: Job Engine
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3702) Release 2.5.2

2018-11-25 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3702:
---

 Summary: Release 2.5.2
 Key: KYLIN-3702
 URL: https://issues.apache.org/jira/browse/KYLIN-3702
 Project: Kylin
  Issue Type: Task
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3693) TopN, Count distinct incorrect in Spark engine

2018-11-18 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3693:
---

 Summary: TopN, Count distinct incorrect in Spark engine
 Key: KYLIN-3693
 URL: https://issues.apache.org/jira/browse/KYLIN-3693
 Project: Kylin
  Issue Type: Bug
  Components: Spark Engine
Affects Versions: v2.5.0, v2.5.1
Reporter: Shaofeng SHI


 
{code:java}
[ERROR] Failures: 
[ERROR]   
ITCombinationTest>ITKylinQueryTest.testPreciselyDistinctCountQuery:284->KylinTestBase.execAndCompQuery:544->KylinTestBase.execAndCompQuery:647->KylinTestBase.assertTableEquals:691
 value (table=query result of query01, row=0, col=USER_COUNT) expected:<8[46]> 
but was:<8[52]>
[ERROR]   
ITCombinationTest>ITKylinQueryTest.testTopNQuery:277->KylinTestBase.execAndCompQuery:544->KylinTestBase.execAndCompQuery:647->KylinTestBase.assertTableEquals:691
 value (table=query result of query82, row=0, col=CAL_DT) 
expected:<2012-01-[28]> but was:<2012-01-[05]>
[INFO] 
{code}
 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3679) Fetch Kafka topic with Spark streaming

2018-11-11 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3679:
---

 Summary: Fetch Kafka topic with Spark streaming
 Key: KYLIN-3679
 URL: https://issues.apache.org/jira/browse/KYLIN-3679
 Project: Kylin
  Issue Type: New Feature
  Components: Spark Engine
Reporter: Shaofeng SHI


Now Kylin uses a MR job to fetch Kafka messages in parallel and then persist to 
HDFS for subsequent processing. If user selects to use Spark engine, we can use 
Spark streaming API to do this. Spark streaming can read the Kafka message in a 
given offset range as a RDD, then it would be easy to process;

https://spark.apache.org/docs/latest/streaming-kafka-0-10-integration.html 

With Spark streaming, Kylin can also easily connect with other data source like 
Kinesis, Flume, etc.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3678) CacheStateChecker may remove a cache file that under building

2018-11-08 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3678:
---

 Summary: CacheStateChecker may remove a cache file that under 
building
 Key: KYLIN-3678
 URL: https://issues.apache.org/jira/browse/KYLIN-3678
 Project: Kylin
  Issue Type: Bug
  Components: Job Engine
Affects Versions: v2.5.0, v2.4.1, v2.4.0, v2.5.1
Reporter: Shaofeng SHI


Kylin test failed with such error:
{code:java}
2018-11-09 02:15:24,379 DEBUG [main] cachesync.CachedCrudAssist:127 : Loaded 1 
ExternalFilterDesc(s) out of 1 resource
2018-11-09 02:15:24,380 WARN  [main] common.KylinConfigBase:77 : KYLIN_HOME was 
not set
2018-11-09 02:15:24,380 INFO  [main] cache.RocksDBLookupBuilder:66 : create new 
rocksdb 
folder:lookup_cache/rocksdb/DEFAULT.TEST_COUNTRY/f19bc17c-d41d-a4be-b561-f6bd275f4c90/db
 for table cache:DEFAULT.TEST_COUNTRY
2018-11-09 02:15:24,380 INFO  [main] cache.RocksDBLookupBuilder:69 : start to 
build lookup table:DEFAULT.TEST_COUNTRY to rocks 
db:lookup_cache/rocksdb/DEFAULT.TEST_COUNTRY/f19bc17c-d41d-a4be-b561-f6bd275f4c90/db
2018-11-09 02:15:26,814 WARN  [lookup-cache-state-checker-1] 
common.KylinConfigBase:77 : KYLIN_HOME was not set
2018-11-09 02:15:26,814 INFO  [lookup-cache-state-checker-1] 
cache.RocksDBLookupTableCache:334 : check snapshot local cache state, local 
path:lookup_cache/rocksdb
2018-11-09 02:15:26,814 INFO  [lookup-cache-state-checker-1] 
cache.RocksDBLookupTableCache:361 : removed cache 
file:/var/jenkins/workspace/kylin-manual-ci/core-dictionary/lookup_cache/rocksdb/DEFAULT.TEST_COUNTRY/f19bc17c-d41d-a4be-b561-f6bd275f4c90,
 it is not referred by any cube
2018-11-09 02:15:28,474 ERROR [main] cache.RocksDBLookupBuilder:77 : error when 
put data to rocksDB
org.rocksdb.RocksDBException: While open a file for random read: 
lookup_cache/rocksdb/DEFAULT.TEST_COUNTRY/f19bc17c-d41d-a4be-b561-f6bd275f4c90/db/18.sst:
 No such file or directory
at org.rocksdb.RocksDB.put(Native Method)
at org.rocksdb.RocksDB.put(RocksDB.java:453)
at 
org.apache.kylin.dict.lookup.cache.RocksDBLookupBuilder.build(RocksDBLookupBuilder.java:74)
at 
org.apache.kylin.dict.lookup.cache.RocksDBLookupTableCacheTest.testRestoreCacheFromFiles(RocksDBLookupTableCacheTest.java:115)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at 
org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
at 
org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
at 
org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
at 
org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
at 
org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
at 
org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
at 
org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78)
at 
org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57)
at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288)
at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268)
at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
at 
org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:365)
at 
org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:273)
at 
org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:238)
at 
org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:159)
at 
org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:379)
at 
org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:340)
at 
org.apache.maven.surefire.booter.ForkedBooter.execute(ForkedBooter.java:125)
at 
org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:413)
{code}
 

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3664) Hive metrics reporter HiveProducer doesn't support multiple instances on one host

2018-11-02 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3664:
---

 Summary: Hive metrics reporter HiveProducer doesn't support 
multiple instances on one host
 Key: KYLIN-3664
 URL: https://issues.apache.org/jira/browse/KYLIN-3664
 Project: Kylin
  Issue Type: Bug
  Components: Metrics
Reporter: Shaofeng SHI


In HiveProducer.java, it uses a file named with the host name, this causing if 
there are multiple Kylin instances on the same machine, will have conflict when 
appending metrics to HDFS file:

 
{code:java}
String hostName;
try {
hostName = InetAddress.getLocalHost().getHostName();
} catch (UnknownHostException e) {
hostName = "UNKNOWN";
}
CONTENT_FILE_NAME = hostName + "-part-";
{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3652) Segment auto-merge should not be triggered in some cases

2018-10-29 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3652:
---

 Summary: Segment auto-merge should not be triggered in some cases
 Key: KYLIN-3652
 URL: https://issues.apache.org/jira/browse/KYLIN-3652
 Project: Kylin
  Issue Type: Improvement
  Components: Metadata
Reporter: Shaofeng SHI


The segment auto-merge will be triggered whenever there is segment change: add, 
remove, etc. This causes some problem, for example, the merging job has 
problem, and the user wants to do something for an existing segment. But once 
he discarding the auto-generated merging job, a new job is generated, causing 
an endless loop.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3651) JDBCResourceStore doesn't list all resources

2018-10-28 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3651:
---

 Summary: JDBCResourceStore doesn't list all resources
 Key: KYLIN-3651
 URL: https://issues.apache.org/jira/browse/KYLIN-3651
 Project: Kylin
  Issue Type: Bug
  Components: Metadata
Affects Versions: v2.5.0
Reporter: Shaofeng SHI


The JDBC resource store persists metadata into two tables:

kylin_default_instance

kylin_default_instance_log

 

The metadata entry like "/execute_output" will be saved to the _log table; But 
the "list" operation on "/" only lists the subfolders in the first table, no in 
the second table. This will cause partial metadata copy when use 
#ResourceTool.copy.

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3648) Kylin may not add "kylin.storage.hbase.cluster-hdfs-config-file" to configuration properly

2018-10-24 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3648:
---

 Summary: Kylin may not add 
"kylin.storage.hbase.cluster-hdfs-config-file" to configuration properly
 Key: KYLIN-3648
 URL: https://issues.apache.org/jira/browse/KYLIN-3648
 Project: Kylin
  Issue Type: Bug
  Components: Storage - HBase
Reporter: Shaofeng SHI


From this blog,  we know that Configuration.addResource should better use a 
Path as the input instead of a string:
conf.addResource(new Path("/home/hadoop/conf/core-site.xml"));
[https://dzone.com/articles/debugging-%E2%80%9Cwrong-fs-expected]

 

But in Kylin's HBaseConnection, it add the hbase cluster fs configure file as 
string:
{code:java}
String hdfsConfigFile = 
KylinConfig.getInstanceFromEnv().getHBaseClusterHDFSConfigFile();
if (hdfsConfigFile == null || hdfsConfigFile.isEmpty()) {
return;
}
Configuration hdfsConf = new Configuration(false);
hdfsConf.addResource(hdfsConfigFile);
{code}
This may make it doesn't work.

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3646) Add instruction about rowkey sequence in "Advanced setting" page

2018-10-23 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3646:
---

 Summary: Add instruction about rowkey sequence in "Advanced 
setting" page
 Key: KYLIN-3646
 URL: https://issues.apache.org/jira/browse/KYLIN-3646
 Project: Kylin
  Issue Type: Improvement
  Components: Web 
Reporter: Shaofeng SHI


The position of dimension in rowkey is important for performance (both build 
and query); But this message is not clear for a normal user. We'd better add 
instruction on the web GUI, so that user knows to adjust it by some rule.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3641) Release 2.5.1

2018-10-21 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3641:
---

 Summary: Release 2.5.1
 Key: KYLIN-3641
 URL: https://issues.apache.org/jira/browse/KYLIN-3641
 Project: Kylin
  Issue Type: Task
Reporter: Shaofeng SHI


Kylin 2.5.0 has released for 1 month, we collected a lot of good feedbacks and 
also several bug reportings. Now it is time to release 2.5.1 .



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3637) Union All not working with count()

2018-10-18 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3637:
---

 Summary: Union All not working with count()
 Key: KYLIN-3637
 URL: https://issues.apache.org/jira/browse/KYLIN-3637
 Project: Kylin
  Issue Type: Bug
Reporter: Shaofeng SHI


>From user@kylin mailing list:

When I ran sql "SELECT count(TRANS_ID) as TRANS_ID FROM KYLIN_SALES where
TRANS_ID <> 1 union all select count(TRANS_ID) as TRANS_ID FROM KYLIN_SALES"
on sample cube.

Got Error:
Error while compiling generated Java code: public static class Record2_1
implements java.io.Serializable { public Long TRANS_ID; public long
_KY_COUNT__; public Record2_1() {} public boolean equals(Object o) { if
(this == o) \{ return true; } if (!(o instanceof Record2_1)) { return false;
} return java.util.Objects.equals(this.TRANS_ID, ((Record2_1) o).TRANS_ID)
&& this._KY_COUNT__ == ((Record2_1) o)._KY_COUNT__; } public int hashCode()
{ int h = 0; h = org.apache.calcite.runtime.Utilities.hash(h,
this.TRANS_ID); h = org.apache.calcite.runtime.Utilities.hash(h,
this._KY_COUNT__); return h; } public int compareTo(Record2_1 that) { int c;
c = org.apache.calcite.runtime.Utilities.compareNullsLast(this.TRANS_ID,
that.TRANS_ID); if (c != 0) \{ return c; } c =
org.apache.calcite.runtime.Utilities.compare(this._KY_COUNT__,
that._KY_COUNT__); if (c != 0) \{ return c; } return 0; } public String
toString() { return "{TRANS_ID=" + this.TRANS_ID + ", _KY_COUNT__=" +
this._KY_COUNT__ + "}"; } } public static class Record1_0 implements
java.io.Serializable \{ public long f0; public Record1_0() {} public boolean
equals(Object o) \{ if (this == o) { return true; } if (!(o instanceof
Record1_0)) \{ return false; } return this.f0 == ((Record1_0) o).f0; } public
int hashCode() { int h = 0; h = org.apache.calcite.runtime.Utilities.hash(h,
this.f0); return h; } public int compareTo(Record1_0 that) { final int c; c
= org.apache.calcite.runtime.Utilities.compare(this.f0, that.f0); if (c !=
0) \{ return c; } return 0; } public String toString() { return "{f0=" +
this.f0 + "}"; } } org.apache.calcite.DataContext root; public
org.apache.calcite.linq4j.Enumerable bind(final
org.apache.calcite.DataContext root0) { root = root0; final
org.apache.calcite.linq4j.Enumerable _inputEnumerable =
((org.apache.kylin.query.schema.OLAPTable)
root.getRootSchema().getSubSchema("DEFAULT").getTable("KYLIN_SALES")).executeOLAPQuery(root,
1); final org.apache.calcite.linq4j.AbstractEnumerable child = new
org.apache.calcite.linq4j.AbstractEnumerable(){ public
org.apache.calcite.linq4j.Enumerator enumerator() { return new
org.apache.calcite.linq4j.Enumerator(){ public final
org.apache.calcite.linq4j.Enumerator inputEnumerator =
_inputEnumerable.enumerator(); public void reset() {
inputEnumerator.reset(); } public boolean moveNext() { while
(inputEnumerator.moveNext()) { final Long inp0_ = (Long) ((Object[])
inputEnumerator.current())[0]; if (inp0_ != null && inp0_.longValue() != 1L)
{ return true; } } return false; } public void close() {
inputEnumerator.close(); } public Object current() { final Object[] current
= (Object[]) inputEnumerator.current(); return new Object[] { current[0],
current[11]}; } }; } }; final org.apache.calcite.linq4j.Enumerable
_inputEnumerable0 = ((org.apache.kylin.query.schema.OLAPTable)
root.getRootSchema().getSubSchema("DEFAULT").getTable("KYLIN_SALES")).executeOLAPQuery(root,
2); final org.apache.calcite.linq4j.AbstractEnumerable child1 = new
org.apache.calcite.linq4j.AbstractEnumerable(){ public
org.apache.calcite.linq4j.Enumerator enumerator() { return new
org.apache.calcite.linq4j.Enumerator(){ public final
org.apache.calcite.linq4j.Enumerator inputEnumerator =
_inputEnumerable0.enumerator(); public void reset() {
inputEnumerator.reset(); } public boolean moveNext() { return
inputEnumerator.moveNext(); } public void close() { inputEnumerator.close();
} public Object current() { final Object[] current = (Object[])
inputEnumerator.current(); return new Record2_1( (Long) current[0],
org.apache.calcite.runtime.SqlFunctions.toLong(current[11])); } }; } };
return
org.apache.calcite.linq4j.Linq4j.singletonEnumerable(child.aggregate(new
org.apache.calcite.linq4j.function.Function0() { public Object apply() {
long a0s0; a0s0 = 0; Record1_0 record0; record0 = new Record1_0();
record0.f0 = a0s0; return record0; } } .apply(), new
org.apache.calcite.linq4j.function.Function2() { public Record1_0
apply(Record1_0 acc, Object[] in) { acc.f0 = acc.f0 +
org.apache.calcite.runtime.SqlFunctions.toLong(in[1]); return acc; } public
Record1_0 apply(Object acc, Object in) { return apply( (Record1_0) acc,
(Object[]) in); } } , new org.apache.calcite.linq4j.function.Function1() {
public long apply(Record1_0 acc) \{ return acc.f0; } public Object
apply(Object acc) \{ return apply( (Record1_0) acc); } }
)).concat(org.apache.calcite.linq4j.Linq4j.singletonEnumerable(child1.aggregate(new
org.apache.calcite.linq4j.function.Func

[jira] [Created] (KYLIN-3635) Percentile calculation on Spark engine is not thread safe

2018-10-18 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3635:
---

 Summary: Percentile calculation on Spark engine is not thread safe
 Key: KYLIN-3635
 URL: https://issues.apache.org/jira/browse/KYLIN-3635
 Project: Kylin
  Issue Type: Bug
Affects Versions: v2.4.1, v2.4.0, v2.3.1, v2.3.0, v2.3.2, v2.5.0
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3626) Allow customization for Parquet path

2018-10-11 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3626:
---

 Summary: Allow customization for Parquet path
 Key: KYLIN-3626
 URL: https://issues.apache.org/jira/browse/KYLIN-3626
 Project: Kylin
  Issue Type: Sub-task
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3624) Convert cuboid to Parquet in Spark

2018-10-11 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3624:
---

 Summary: Convert cuboid to Parquet in Spark
 Key: KYLIN-3624
 URL: https://issues.apache.org/jira/browse/KYLIN-3624
 Project: Kylin
  Issue Type: Sub-task
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3625) Query engine for Parquet

2018-10-11 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3625:
---

 Summary: Query engine for Parquet
 Key: KYLIN-3625
 URL: https://issues.apache.org/jira/browse/KYLIN-3625
 Project: Kylin
  Issue Type: Sub-task
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3623) Convert cuboid to Parquet in MR

2018-10-11 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3623:
---

 Summary: Convert cuboid to Parquet in MR
 Key: KYLIN-3623
 URL: https://issues.apache.org/jira/browse/KYLIN-3623
 Project: Kylin
  Issue Type: Sub-task
  Components: Job Engine
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3622) Cube layout in Parquet

2018-10-11 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3622:
---

 Summary: Cube layout in Parquet
 Key: KYLIN-3622
 URL: https://issues.apache.org/jira/browse/KYLIN-3622
 Project: Kylin
  Issue Type: Sub-task
Reporter: Shaofeng SHI


The file and schema layout for Cubes in Parquet.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3621) Parquet as Cube storage

2018-10-11 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3621:
---

 Summary: Parquet as Cube storage
 Key: KYLIN-3621
 URL: https://issues.apache.org/jira/browse/KYLIN-3621
 Project: Kylin
  Issue Type: New Feature
  Components: Query Engine, Storage - HBase
Reporter: Shaofeng SHI


As discussed in dev@kylin, we plan to make a new storage engine based on Apache 
Parquet and Apache Spark. The discussion is:

http://apache-kylin.74782.x6.nabble.com/DISCUSS-Columnar-storage-engine-for-Apache-Kylin-td11821.html#a11910



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3611) Upgrade Tomcat to 7.0.91, 8.5.34 or later

2018-10-03 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3611:
---

 Summary: Upgrade Tomcat to 7.0.91, 8.5.34 or later
 Key: KYLIN-3611
 URL: https://issues.apache.org/jira/browse/KYLIN-3611
 Project: Kylin
  Issue Type: Improvement
Reporter: Shaofeng SHI


h2. [SECURITY] CVE-2018-11784 Apache Tomcat - Open Redirect
 
 
 
CVE-2018-11784 Apache Tomcat - Open Redirect

Severity: Moderate

Vendor: The Apache Software Foundation

Versions Affected:
Apache Tomcat 9.0.0.M1 to 9.0.11
Apache Tomcat 8.5.0 to 8.5.33
Apache Tomcat 7.0.23 to 7.0.90
The unsupported 8.0.x release line has not been analysed but is likely
to be affected.

Description:
When the default servlet returned a redirect to a directory (e.g.
redirecting to '/foo/' when the user requested '/foo') a specially
crafted URL could be used to cause the redirect to be generated to any
URI of the attackers choice.

Mitigation:
Users of the affected versions should apply one of the following
mitigations:
- Upgrade to Apache Tomcat 9.0.12 or later.
- Upgrade to Apache Tomcat 8.5.34 or later.
- Upgrade to Apache Tomcat 7.0.91 or later.
- Use mapperDirectoryRedirectEnabled="true" and
  mapperContextRootRedirectEnabled="true" on the Context to ensure that
  redirects are issued by the Mapper rather than the default Servlet.
  See the Context configuration documentation for further important
  details.

Credit:
This vulnerability was found by Sergey Bobrov and reported responsibly
to the Apache Tomcat Security Team.

History:
2018-10-03 Original advisory

References:
[1] [http://tomcat.apache.org/security-9.html]
[2] [http://tomcat.apache.org/security-8.html]
[3] [http://tomcat.apache.org/security-7.html]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3606) Unstable KylinUserManagerTest

2018-09-29 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3606:
---

 Summary: Unstable KylinUserManagerTest
 Key: KYLIN-3606
 URL: https://issues.apache.org/jira/browse/KYLIN-3606
 Project: Kylin
  Issue Type: Bug
Affects Versions: v2.6.0
Reporter: Shaofeng SHI


{code:java}
[INFO] Tests run: 3, Failures: 0, Errors: 0, 
Skipped: 0, Time elapsed: 4.187 s - in 
org.apache.kylin.rest.security.QueryWithTableACLTest
[INFO] Running 
org.apache.kylin.rest.security.KylinUserManagerTest
2018-09-30 04:01:45,114 INFO  [main] common.KylinConfig:113 : Destroy 
KylinConfig
2018-09-30 04:01:45,160 INFO  [main] common.KylinConfig:332 : Loading 
kylin-defaults.properties from 
/home/travis/build/apache/kylin/core-common/target/classes/kylin-defaults.properties
2018-09-30 04:01:45,169 INFO  [main] common.KylinConfig:287 : Use 
KYLIN_CONF=../examples/test_metadata/
2018-09-30 04:01:45,170 INFO  [main] common.KylinConfig:99 : Initialized a new 
KylinConfig from getInstanceFromEnv : 1932094362
2018-09-30 04:01:45,170 INFO  [main] common.KylinConfigBase:198 : Kylin Config 
was updated with kylin.metadata.url : ../examples/test_metadata/
2018-09-30 04:01:45,170 INFO  [main] common.KylinConfigBase:198 : Kylin Config 
was updated with kylin.env.hdfs-working-dir : file:///tmp/kylin
2018-09-30 04:01:45,193 INFO  [main] util.MultiNodeManagerTestBase:50 : Chosen 
port for CacheServiceTest is 45238
2018-09-30 04:01:45,194 INFO  [main] common.KylinConfigBase:198 : Kylin Config 
was updated with kylin.server.cluster-servers : localhost:45238
2018-09-30 04:01:45,194 INFO  [main] common.KylinConfigBase:198 : Kylin Config 
was updated with kylin.server.cluster-servers : localhost:45238
2018-09-30 04:01:45,194 INFO  [main] common.KylinConfigBase:198 : Kylin Config 
was updated with kylin.metadata.url : ../examples/test_metadata
2018-09-30 04:01:45,273 INFO  [Thread-2-SendThread(localhost:2181)] 
zookeeper.ClientCnxn:1029 : Opening socket connection to server 
localhost/127.0.0.1:2181. Will not attempt to authenticate using SASL (unknown 
error)
2018-09-30 04:01:45,274 INFO  [Thread-2-SendThread(localhost:2181)] 
zookeeper.ClientCnxn:1166 : Socket error occurred: localhost/127.0.0.1:2181: 
Connection refused
2018-09-30 04:01:45,275 INFO  [main] util.log:192 : Logging initialized @92503ms
2018-09-30 04:01:46,021 INFO  [main] server.Server:346 : jetty-9.3.22.v20171030
2018-09-30 04:01:46,225 INFO  [main] handler.ContextHandler:781 : Started 
o.e.j.s.ServletContextHandler@545311c{/,null,AVAILABLE}
2018-09-30 04:01:46,254 INFO  [main] server.AbstractConnector:278 : Started 
ServerConnector@4bdfeead{HTTP/1.1,[http/1.1]}{0.0.0.0:45238}
2018-09-30 04:01:46,254 INFO  [main] server.Server:414 : Started @93482ms
2018-09-30 04:01:46,255 INFO  [main] security.KylinUserManager:61 : 
Initializing KylinUserManager with config ../examples/test_metadata/
2018-09-30 04:01:46,256 INFO  [main] persistence.ResourceStore:88 : Using 
metadata url ../examples/test_metadata/ for resource store
2018-09-30 04:01:46,256 DEBUG [main] cachesync.CachedCrudAssist:118 : Reloading 
ManagedUser from 
/home/travis/build/apache/kylin/server/../examples/test_metadata/user
2018-09-30 04:01:46,257 DEBUG [main] cachesync.CachedCrudAssist:127 : Loaded 0 
ManagedUser(s) out of 0 resource
2018-09-30 04:01:46,257 INFO  [main] common.KylinConfig:432 : Creating new 
manager instance of class org.apache.kylin.metadata.cachesync.Broadcaster
2018-09-30 04:01:46,258 DEBUG [main] cachesync.Broadcaster:101 : 1 nodes in the 
cluster: [localhost:45238]
2018-09-30 04:01:46,259 INFO  [main] security.KylinUserManager:61 : 
Initializing KylinUserManager with config ../examples/test_metadata
2018-09-30 04:01:46,259 INFO  [main] persistence.ResourceStore:88 : Using 
metadata url ../examples/test_metadata for resource store
2018-09-30 04:01:46,259 DEBUG [main] cachesync.CachedCrudAssist:118 : Reloading 
ManagedUser from 
/home/travis/build/apache/kylin/server/../examples/test_metadata/user
2018-09-30 04:01:46,260 DEBUG [main] cachesync.CachedCrudAssist:127 : Loaded 0 
ManagedUser(s) out of 0 resource
2018-09-30 04:01:46,260 INFO  [main] common.KylinConfig:432 : Creating new 
manager instance of class org.apache.kylin.metadata.cachesync.Broadcaster
2018-09-30 04:01:46,260 DEBUG [main] cachesync.Broadcaster:101 : 1 nodes in the 
cluster: [localhost:45238]
2018-09-30 04:01:46,261 DEBUG [main] cachesync.CachedCrudAssist:190 : Saving 
ManagedUser at /user/U1
2018-09-30 04:01:46,263 DEBUG [pool-173-thread-1] cachesync.Broadcaster:113 : 
Servers in the cluster: [localhost:45238]
2018-09-30 04:01:46,375 INFO  [Thread-2-SendThread(localhost:2181)] 
zookeeper.ClientCnxn:1029 : Opening socket connection to server 
localhost/127.0.0.1:2181. Will not attempt to authenticate using SASL (unknown 
error)
2018-09-30 04:01:46,376 INFO  [Thread-2-SendThread(localhost:2181)] 
zookeeper.Clie

[jira] [Created] (KYLIN-3605) Fix security issues reported by snyk.io

2018-09-29 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3605:
---

 Summary: Fix security issues reported by snyk.io
 Key: KYLIN-3605
 URL: https://issues.apache.org/jira/browse/KYLIN-3605
 Project: Kylin
  Issue Type: Improvement
Reporter: Shaofeng SHI


[org.apache.commons:commons-compress|https://github.com/apache/commons-compress]
 defines an API for working with compression and archive formats.

Affected versions of this package are vulnerable to Directory Traversal.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3597) Fix sonar reported static code issues

2018-09-25 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3597:
---

 Summary: Fix sonar reported static code issues
 Key: KYLIN-3597
 URL: https://issues.apache.org/jira/browse/KYLIN-3597
 Project: Kylin
  Issue Type: Improvement
  Components: Others
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3577) Mvn dependency warnings

2018-09-22 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3577:
---

 Summary: Mvn dependency warnings
 Key: KYLIN-3577
 URL: https://issues.apache.org/jira/browse/KYLIN-3577
 Project: Kylin
  Issue Type: Improvement
Affects Versions: v2.5.0
Reporter: Shaofeng SHI


[INFO] 
*--- maven-dependency-plugin:2.10:analyze-only (analyze) @ kylin-core-common 
---* 
[WARNING] Used undeclared dependencies found:[WARNING]
com.fasterxml.jackson.core:jackson-annotations:jar:2.9.5:compile[WARNING]
com.google.guava:guava:jar:14.0:provided[WARNING]
commons-lang:commons-lang:jar:2.6:provided[WARNING]
com.fasterxml.jackson.core:jackson-core:jar:2.9.5:compile[WARNING]
org.apache.httpcomponents:httpcore:jar:4.2.4:compile[WARNING]
io.dropwizard.metrics:metrics-core:jar:3.1.2:compile[WARNING]
commons-io:commons-io:jar:2.4:provided[WARNING]
commons-logging:commons-logging:jar:1.1.1:compile[WARNING]
javax.xml.bind:jaxb-api:jar:2.2.2:provided[WARNING]
commons-codec:commons-codec:jar:1.6:compile[WARNING]
commons-cli:commons-cli:jar:1.2:provided[WARNING] Unused declared dependencies 
found:[WARNING]io.dropwizard.metrics:metrics-jvm:jar:3.1.2:compile[WARNING] 
   log4j:log4j:jar:1.2.17:provided[WARNING]
org.slf4j:slf4j-log4j12:jar:1.7.21:provided[WARNING]
mysql:mysql-connector-java:jar:5.1.8:provided[WARNING]
org.slf4j:jcl-over-slf4j:jar:1.7.21:compile
 
[INFO] 
*--- maven-dependency-plugin:2.10:analyze-only (analyze) @ kylin-core-metadata 
---* 
[WARNING] Used undeclared dependencies found:[WARNING]
org.apache.commons:commons-lang3:jar:3.4:compile[WARNING]
com.fasterxml.jackson.core:jackson-annotations:jar:2.9.5:compile[WARNING]
org.mockito:mockito-core:jar:1.10.19:test[WARNING]
com.fasterxml.jackson.core:jackson-core:jar:2.9.5:compile[WARNING]
org.objenesis:objenesis:jar:2.2:provided[WARNING]
org.apache.calcite:calcite-core:jar:1.13.0-kylin-r4:compile[WARNING]
org.apache.calcite.avatica:avatica:jar:1.10.0:compile[WARNING] Unused declared 
dependencies found:[WARNING]org.slf4j:slf4j-api:jar:1.7.21:compile[WARNING] 
   log4j:log4j:jar:1.2.17:provided[WARNING]
org.slf4j:slf4j-log4j12:jar:1.7.21:provided[WARNING]
org.apache.hadoop:hadoop-common:jar:2.7.1:provided[WARNING]
org.apache.kylin:atopcalcite:jar:2.6.0-SNAPSHOT:compile[WARNING]
commons-cli:commons-cli:jar:1.2:provided[WARNING]
org.slf4j:jcl-over-slf4j:jar:1.7.21:compile
 
[INFO] 
*--- maven-dependency-plugin:2.10:analyze-only (analyze) @ 
kylin-core-dictionary ---* 
[WARNING] Used undeclared dependencies found:[WARNING]
com.fasterxml.jackson.core:jackson-annotations:jar:2.9.5:compile[WARNING]
commons-lang:commons-lang:jar:2.6:provided[WARNING]
com.google.guava:guava:jar:14.0:provided[WARNING]
org.apache.kylin:kylin-core-common:jar:2.6.0-SNAPSHOT:compile[WARNING]
commons-io:commons-io:jar:2.4:provided[WARNING]
org.apache.calcite.avatica:avatica:jar:1.10.0:compile[WARNING]
com.google.code.findbugs:jsr305:jar:3.0.1:provided[WARNING] Unused declared 
dependencies found:[WARNING]org.slf4j:slf4j-api:jar:1.7.21:compile[WARNING] 
   log4j:log4j:jar:1.2.17:provided[WARNING]
org.slf4j:slf4j-log4j12:jar:1.7.21:provided[WARNING]
org.slf4j:jcl-over-slf4j:jar:1.7.21:compile
 
[INFO] 
*--- maven-dependency-plugin:2.10:analyze-only (analyze) @ kylin-core-cube ---* 
[WARNING] Used undeclared dependencies found:[WARNING]
org.apache.commons:commons-lang3:jar:3.4:compile[WARNING]
com.fasterxml.jackson.core:jackson-annotations:jar:2.9.5:compile[WARNING]
commons-codec:commons-codec:jar:1.4:compile[WARNING]
com.fasterxml.jackson.core:jackson-core:jar:2.9.5:compile[WARNING]
org.apache.kylin:kylin-core-common:jar:2.6.0-SNAPSHOT:compile[WARNING]
org.apache.commons:commons-math3:jar:3.1.1:provided[WARNING]
org.apache.calcite.avatica:avatica:jar:1.10.0:compile[WARNING]
com.fasterxml.jackson.core:jackson-databind:jar:2.9.5:compile[WARNING] Unused 
declared dependencies found:[WARNING]
org.slf4j:slf4j-api:jar:1.7.21:compile[WARNING]
log4j:log4j:jar:1.2.17:provided[WARNING]
org.slf4j:slf4j-log4j12:jar:1.7.21:provided[WARNING]
com.esotericsoftware:kryo-shaded:jar:4.0.0:provided[WARNING]
org.apache.hadoop:hadoop-common:jar:2.7.1:provided[WARNING]
org.slf4j:jcl-over-slf4j:jar:1.7.21:compile
[INFO] 
*--- maven-dependency-plugin:2.10:analyze-only (analyze) @ kylin-core-metrics 
---* 
[WARNING] Unused declared dependencies found:[WARNING]
log4j:log4j:jar:1.2.17:provided[WARNING]
org.slf4j:slf4j-log4j12:jar:1.7.21:provided[WARNING]
org.apache.hadoop:hadoop-common:jar:2.7.1:provided[WARNING]
org.slf4j:jcl-over-slf4j:jar:1.7.21:compile
 
[INFO] 
*--- maven-dependency-plugin:2.10:analyze-only (analyze) @ kylin-core-job ---* 
[WARNING] Used undeclared dependencies found:[WARNING]
com.fasterxml.jackson.core:jackson-an

[jira] [Created] (KYLIN-3576) IllegalArgumentException: No enum constant org.apache.kylin.rest.service.JobService.JobSearchMode.

2018-09-22 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3576:
---

 Summary: IllegalArgumentException: No enum constant 
org.apache.kylin.rest.service.JobService.JobSearchMode.
 Key: KYLIN-3576
 URL: https://issues.apache.org/jira/browse/KYLIN-3576
 Project: Kylin
  Issue Type: Bug
  Components: Job Engine
Reporter: Shaofeng SHI


{code:java}
2018-09-22 19:58:20,258 INFO  [main] execution.ExecutableManager:69 : Using 
metadata url: ../examples/test_metadata/
2018-09-22 19:58:20,265 ERROR [main] controller.JobController:81 : Invalid 
value for JobSearchMode: '', skip it.
java.lang.IllegalArgumentException: No enum constant 
org.apache.kylin.rest.service.JobService.JobSearchMode.
at java.lang.Enum.valueOf(Enum.java:238)
at 
org.apache.kylin.rest.service.JobService$JobSearchMode.valueOf(JobService.java:1236)
at 
org.apache.kylin.rest.controller.JobController.list(JobController.java:79)
at 
org.apache.kylin.rest.controller.JobControllerTest.testBasics(JobControllerTest.java:103)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:498)
at 
org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
at 
org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
at 
org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
at 
org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
at 
org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
at 
org.springframework.test.context.junit4.statements.RunBeforeTestMethodCallbacks.evaluate(RunBeforeTestMethodCallbacks.java:75)
at 
org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
at 
org.springframework.test.context.junit4.statements.RunAfterTestMethodCallbacks.evaluate(RunAfterTestMethodCallbacks.java:86)
at 
org.springframework.test.context.junit4.statements.SpringRepeat.evaluate(SpringRepeat.java:84)
{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3571) Not build Spark in Kylin's binary package

2018-09-19 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3571:
---

 Summary: Not build Spark in Kylin's binary package
 Key: KYLIN-3571
 URL: https://issues.apache.org/jira/browse/KYLIN-3571
 Project: Kylin
  Issue Type: Improvement
  Components: Environment 
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3563) Web UI style upgrade

2018-09-17 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3563:
---

 Summary: Web UI style upgrade
 Key: KYLIN-3563
 URL: https://issues.apache.org/jira/browse/KYLIN-3563
 Project: Kylin
  Issue Type: Improvement
  Components: Web 
Reporter: Shaofeng SHI


Kylin's web GUI style almost has no change since the first day. We can 
introduce some change to make it fresh.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3554) Spark job failed but Yarn shows SUCCEED, causing Kylin move to next step

2018-09-11 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3554:
---

 Summary: Spark job failed but Yarn shows SUCCEED, causing Kylin 
move to next step
 Key: KYLIN-3554
 URL: https://issues.apache.org/jira/browse/KYLIN-3554
 Project: Kylin
  Issue Type: Bug
  Components: Spark Engine
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3551) Spark job failed with "FileNotFoundException"

2018-09-10 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3551:
---

 Summary: Spark job failed with "FileNotFoundException" 
 Key: KYLIN-3551
 URL: https://issues.apache.org/jira/browse/KYLIN-3551
 Project: Kylin
  Issue Type: Bug
  Components: Spark Engine
Reporter: Shaofeng SHI


java.io.FileNotFoundException: File does not exist: 
hdfs://sandbox.hortonworks.com:8020/kylin/kylin_default_instance/kylin-a3e39298-8dc3-21f2-cf16-0aa5e451c777/kylin_sales_cube_clone_clone/counter
at 
org.apache.hadoop.hdfs.DistributedFileSystem$22.doCall(DistributedFileSystem.java:1319)
at 
org.apache.hadoop.hdfs.DistributedFileSystem$22.doCall(DistributedFileSystem.java:1311)
at 
org.apache.hadoop.fs.FileSystemLinkResolver.resolve(FileSystemLinkResolver.java:81)
at 
org.apache.hadoop.hdfs.DistributedFileSystem.getFileStatus(DistributedFileSystem.java:1311)
at 
org.apache.hadoop.io.SequenceFile$Reader.(SequenceFile.java:1752)
at 
org.apache.hadoop.io.SequenceFile$Reader.(SequenceFile.java:1776)
at 
org.apache.kylin.common.util.HadoopUtil.readFromSequenceFile(HadoopUtil.java:218)
at 
org.apache.kylin.common.util.HadoopUtil.readFromSequenceFile(HadoopUtil.java:233)
at 
org.apache.kylin.engine.spark.SparkExecutable.doWork(SparkExecutable.java:319)
at 
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:163)
at 
org.apache.kylin.job.execution.DefaultChainedExecutable.doWork(DefaultChainedExecutable.java:69)
at 
org.apache.kylin.job.execution.AbstractExecutable.execute(AbstractExecutable.java:163)
at 
org.apache.kylin.job.impl.threadpool.DefaultScheduler$JobRunner.run(DefaultScheduler.java:113)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
at java.lang.Thread.run(Thread.java:748)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3550) "kylin.source.hive.flat-table-field-delimiter" has extra "\" when create intermediate flat table

2018-09-10 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3550:
---

 Summary: "kylin.source.hive.flat-table-field-delimiter" has extra 
"\" when create intermediate flat table
 Key: KYLIN-3550
 URL: https://issues.apache.org/jira/browse/KYLIN-3550
 Project: Kylin
  Issue Type: Bug
  Components: Job Engine
Affects Versions: v2.4.0, v2.4.1
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3527) Hybrid couldn't save when there is only 1 cube

2018-09-03 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3527:
---

 Summary: Hybrid couldn't save when there is only 1 cube
 Key: KYLIN-3527
 URL: https://issues.apache.org/jira/browse/KYLIN-3527
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Affects Versions: v2.5.0
Reporter: Shaofeng SHI
 Attachments: hybrid-not-saved.png

I created a hybrid with two cubes; Later I want to remove one cube from it, but 
I found when there is only 1 cube, the "Submit" button was disabled.

 

The hybrid should allows 0, 1 or more cubes.

 

Besides, I found the move in and out icons were not properly displayed.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3524) Clone cube reports "Overwriting conflict /project/learn_kylin.json" error

2018-09-02 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3524:
---

 Summary: Clone cube reports "Overwriting conflict 
/project/learn_kylin.json" error
 Key: KYLIN-3524
 URL: https://issues.apache.org/jira/browse/KYLIN-3524
 Project: Kylin
  Issue Type: Bug
  Components: Metadata
Affects Versions: v2.5.0
Reporter: Shaofeng SHI


{code:java}
53956 2018-09-02 20:25:51,458 INFO [http-bio-7080-exec-4] 
service.CubeService:218 : New cube kylin_sales_cube_shao2 has 161 cuboids
53957 2018-09-02 20:25:51,458 INFO [http-bio-7080-exec-4] cube.CubeManager:225 
: Creating cube 'learn_kylin-->kylin_sales_cube_shao2' from desc 
'kylin_sales_cube_shao2'
53958 2018-09-02 20:25:51,458 INFO [http-bio-7080-exec-4] cube.CubeManager:316 
: Updating cube instance 'kylin_sales_cube_shao2'
53959 2018-09-02 20:25:51,459 DEBUG [http-bio-7080-exec-4] 
cachesync.CachedCrudAssist:190 : Saving CubeInstance at 
/cube/kylin_sales_cube_shao2.json
53960 2018-09-02 20:25:51,460 DEBUG [pool-16-thread-1] 
cachesync.Broadcaster:113 : Servers in the cluster: [localhost:7070]
53961 2018-09-02 20:25:51,460 DEBUG [pool-16-thread-1] 
cachesync.Broadcaster:123 : Announcing new broadcast to all: 
BroadcastEvent{entity=cube, event=create, cacheKey=kylin_sales_cube_shao2}
53962 2018-09-02 20:25:51,461 DEBUG [http-bio-7080-exec-4] 
cachesync.CachedCrudAssist:190 : Saving ProjectInstance at 
/project/learn_kylin.json
53963 2018-09-02 20:25:51,462 ERROR [http-bio-7080-exec-4] 
controller.BasicController:62 :
53964 org.apache.kylin.common.persistence.WriteConflictException: Overwriting 
conflict /project/learn_kylin.json, expect old TS 1535685985266, but it is 
1535695655000
53965 at 
org.apache.kylin.storage.hbase.HBaseResourceStore.checkAndPutResourceImpl(HBaseResourceStore.java:325)
53966 at 
org.apache.kylin.common.persistence.ResourceStore.checkAndPutResourceCheckpoint(ResourceStore.java:318)
53967 at 
org.apache.kylin.common.persistence.ResourceStore.putResource(ResourceStore.java:303)
53968 at 
org.apache.kylin.common.persistence.ResourceStore.putResource(ResourceStore.java:282)
53969 at 
org.apache.kylin.metadata.cachesync.CachedCrudAssist.save(CachedCrudAssist.java:192)
53970 at 
org.apache.kylin.metadata.project.ProjectManager.save(ProjectManager.java:371)
53971 at 
org.apache.kylin.metadata.project.ProjectManager.addRealizationToProject(ProjectManager.java:283)
53972 at 
org.apache.kylin.metadata.project.ProjectManager.moveRealizationToProject(ProjectManager.java:266)
53973 at org.apache.kylin.cube.CubeManager.createCube(CubeManager.java:232)
53974 at 
org.apache.kylin.rest.service.CubeService.createCubeAndDesc(CubeService.java:220)
53975 at 
org.apache.kylin.rest.service.CubeService$$FastClassBySpringCGLIB$$17a07c0e.invoke()
53976 at 
org.springframework.cglib.proxy.MethodProxy.invoke(MethodProxy.java:204)
53977 at 
org.springframework.aop.framework.CglibAopProxy$CglibMethodInvocation.invokeJoinpoint(CglibAopProxy.java:738)
{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3521) Enable Cube Planner by default

2018-08-31 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3521:
---

 Summary: Enable Cube Planner by default
 Key: KYLIN-3521
 URL: https://issues.apache.org/jira/browse/KYLIN-3521
 Project: Kylin
  Issue Type: Improvement
Affects Versions: v2.5.0
Reporter: Shaofeng SHI


Cube planner can significantly reduce the cuboid number that to build. As it 
wasn't enabled by default in 2.3 and 2.4, many users don't know that.

 

To let more user to start using it, I suggest to enable it by default. As Cube 
planner only works when build the first segment, it only affect the cuboid 
scheduler of a new Cube. Old cubes will not be affected.

 

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3518) Coprocessor reports NPE when execute a query on HBase 2.0

2018-08-27 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3518:
---

 Summary: Coprocessor reports NPE when execute a query on HBase 2.0
 Key: KYLIN-3518
 URL: https://issues.apache.org/jira/browse/KYLIN-3518
 Project: Kylin
  Issue Type: Bug
  Components: Storage - HBase
Reporter: Shaofeng SHI


On HDP 3.0, build a cube and then run a simple count query, NPE occurred:

 
{code:java}
2018-08-28 01:30:16,969 DEBUG [Query f7bf8004-b516-e372-18df-0d507075d471-71] 
v2.CubeHBaseRPC:315 : hbase.rpc.timeout = 9 ms, use 81000 ms as timeout for 
coprocessor
2018-08-28 01:30:16,983 INFO [Query f7bf8004-b516-e372-18df-0d507075d471-71] 
v2.CubeHBaseEndpointRPC:141 : Serialized scanRequestBytes 522 bytes, 
rawScanBytesString 44 bytes
2018-08-28 01:30:16,984 INFO [Query f7bf8004-b516-e372-18df-0d507075d471-71] 
v2.CubeHBaseEndpointRPC:143 : The scan 67b41fc6 for segment 
kylin_sales_cube_clone[2012010100_2013010100] is as below with 1 
separate raw scans, shard part of start/end key is set to 0
2018-08-28 01:30:16,991 INFO [Query f7bf8004-b516-e372-18df-0d507075d471-71] 
v2.CubeHBaseRPC:288 : Visiting hbase table KYLIN_5Q088VO5I0: cuboid require 
post aggregation, from 0 to 16384 Start: 
\x00\x00\x00\x00\x00\x00\x00\x00\x40\x00\x00\x00\x00 
(\x00\x00\x00\x00\x00\x00\x00\x00@\x00\x00\x00\x00) Stop: 
\x00\x00\x00\x00\x00\x00\x00\x00\x40\x00\xFF\xFF\xFF\x00 
(\x00\x00\x00\x00\x00\x00\x00\x00@\x00\xFF\xFF\xFF\x00), No Fuzzy Key
2018-08-28 01:30:16,991 DEBUG [Query f7bf8004-b516-e372-18df-0d507075d471-71] 
v2.CubeHBaseEndpointRPC:148 : Submitting rpc to 1 shards starting from shard 0, 
scan range count 1
2018-08-28 01:30:17,010 INFO [Query f7bf8004-b516-e372-18df-0d507075d471-71] 
common.KylinConfig:332 : Loading kylin-defaults.properties from 
file:/root/shaofengshi/apache-kylin-2.5.0-SNAPSHOT-bin/tomcat/webapps/kylin/WEB-INF/lib/kylin-core-common-2.5.0-SNAPSHOT.jar!/kylin-defaults.properties
2018-08-28 01:30:17,033 DEBUG [Query f7bf8004-b516-e372-18df-0d507075d471-71] 
common.KylinConfig:291 : KYLIN_CONF property was not set, will seek KYLIN_HOME 
env variable
2018-08-28 01:30:17,051 INFO [pool-14-thread-1] hbase.HBaseConnection:110 : 
Creating coprocessor thread pool with max of 2048, core of 2048
2018-08-28 01:30:17,094 INFO [Query f7bf8004-b516-e372-18df-0d507075d471-71] 
gtrecord.SequentialCubeTupleIterator:73 : Using SortedIteratorMergerWithLimit 
to merge segment results
2018-08-28 01:30:17,097 DEBUG [Query f7bf8004-b516-e372-18df-0d507075d471-71] 
enumerator.OLAPEnumerator:117 : return TupleIterator...
2018-08-28 01:30:21,607 INFO [kylin-coproc--pool9-t1] 
client.RpcRetryingCallerImpl:134 : Call exception, tries=6, retries=6, 
started=4410 ms ago, cancelled=false, msg=java.io.IOException
at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:468)
at org.apache.hadoop.hbase.ipc.CallRunner.run(CallRunner.java:131)
at org.apache.hadoop.hbase.ipc.RpcExecutor$Handler.run(RpcExecutor.java:324)
at org.apache.hadoop.hbase.ipc.RpcExecutor$Handler.run(RpcExecutor.java:304)
Caused by: java.lang.NullPointerException
at 
org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.CubeVisitService.visitCube(CubeVisitService.java:253)
at 
org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitService.callMethod(CubeVisitProtos.java:)
at org.apache.hadoop.hbase.regionserver.HRegion.execService(HRegion.java:8032)
at 
org.apache.hadoop.hbase.regionserver.RSRpcServices.execServiceOnRegion(RSRpcServices.java:2426)
at 
org.apache.hadoop.hbase.regionserver.RSRpcServices.execService(RSRpcServices.java:2408)
at 
org.apache.hadoop.hbase.shaded.protobuf.generated.ClientProtos$ClientService$2.callBlockingMethod(ClientProtos.java:42010)
at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:409)
... 3 more
, details=row '' on table 'KYLIN_5Q088VO5I0' at 
region=KYLIN_5Q088VO5I0,,1535417272444.27b82cb4702db4557a98b9a7e60b7692., 
hostname=ignite03.com,16020,1534313612401, seqNum=2
2018-08-28 01:30:25,633 INFO [kylin-coproc--pool9-t1] 
client.RpcRetryingCallerImpl:134 : Call exception, tries=7, retries=7, 
started=8436 ms ago, cancelled=false, msg=java.io.IOException
at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:468)
at org.apache.hadoop.hbase.ipc.CallRunner.run(CallRunner.java:131)
at org.apache.hadoop.hbase.ipc.RpcExecutor$Handler.run(RpcExecutor.java:324)
at org.apache.hadoop.hbase.ipc.RpcExecutor$Handler.run(RpcExecutor.java:304)
Caused by: java.lang.NullPointerException
at 
org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.CubeVisitService.visitCube(CubeVisitService.java:253)
at 
org.apache.kylin.storage.hbase.cube.v2.coprocessor.endpoint.generated.CubeVisitProtos$CubeVisitService.callMethod(CubeVisitProtos.java:)
at org.apache.hadoop.hbase.regionserver.HRegion.execService(HRegion.java:8032)
at 
org.apache.hadoop.hbase.regionserver.RSRpcServices.execServiceOnRegi

[jira] [Created] (KYLIN-3517) Couldn't update coprocessor on HBase 2.0

2018-08-27 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3517:
---

 Summary: Couldn't update coprocessor on HBase 2.0
 Key: KYLIN-3517
 URL: https://issues.apache.org/jira/browse/KYLIN-3517
 Project: Kylin
  Issue Type: Bug
  Components: Storage - HBase
Reporter: Shaofeng SHI


On HDP 3.0, run update coprocessor, got this error:

 
{code:java}
2018-08-28 00:24:26,683 ERROR [pool-7-thread-1] util.DeployCoprocessorCLI:383 : 
Error processing KYLIN_O9JRT8XOQ9
java.lang.UnsupportedOperationException: HTableDescriptor is read-only
at 
org.apache.hadoop.hbase.client.ImmutableHTableDescriptor.getDelegateeForModification(ImmutableHTableDescriptor.java:59)
at 
org.apache.hadoop.hbase.HTableDescriptor.removeCoprocessor(HTableDescriptor.java:768)
at 
org.apache.kylin.storage.hbase.util.DeployCoprocessorCLI.resetCoprocessor(DeployCoprocessorCLI.java:300)
at 
org.apache.kylin.storage.hbase.util.DeployCoprocessorCLI$ResetCoprocessorWorker.run(DeployCoprocessorCLI.java:375)
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at java.lang.Thread.run(Thread.java:745)
{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3513) Release 2.5.0

2018-08-24 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3513:
---

 Summary: Release 2.5.0
 Key: KYLIN-3513
 URL: https://issues.apache.org/jira/browse/KYLIN-3513
 Project: Kylin
  Issue Type: Task
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3504) Release 2.4.1

2018-08-20 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3504:
---

 Summary: Release 2.4.1
 Key: KYLIN-3504
 URL: https://issues.apache.org/jira/browse/KYLIN-3504
 Project: Kylin
  Issue Type: Task
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3488) Support MySQL as Kylin metadata storage

2018-08-10 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3488:
---

 Summary: Support MySQL as Kylin metadata storage
 Key: KYLIN-3488
 URL: https://issues.apache.org/jira/browse/KYLIN-3488
 Project: Kylin
  Issue Type: New Feature
  Components: Metadata
Reporter: Shaofeng SHI


Kylin uses HBase as the metastore; But in some cases user expects the metadata 
not in HBase.

Sonny Heer from mailing list mentioned:

"I'm fairly certain anyone using Kylin with AWS EMR will benefit from this.   
Having multiple hbase clusters across AZs is a huge benefit.  BTW only thing 
blocking at the moment is write operations happening from kylin query nodes."



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3472) TopN merge in Spark engine is slow

2018-07-27 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3472:
---

 Summary: TopN merge in Spark engine is slow
 Key: KYLIN-3472
 URL: https://issues.apache.org/jira/browse/KYLIN-3472
 Project: Kylin
  Issue Type: Bug
  Components: Spark Engine
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3464) Less user confirmation

2018-07-23 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3464:
---

 Summary: Less user confirmation
 Key: KYLIN-3464
 URL: https://issues.apache.org/jira/browse/KYLIN-3464
 Project: Kylin
  Issue Type: Improvement
  Components: Web 
Reporter: Shaofeng SHI
 Attachments: 2018-07-22 下午3.07.10.png

Kylin web has too many confirmation box with user; I need to repeatedly click 
"OK" to continue.

Suggest to remove unnecessary notification, or notification without 
confirmation. See the attachement.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3462) "dfs.replication=2" and compression not work in Spark cube engine

2018-07-22 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3462:
---

 Summary: "dfs.replication=2" and compression not work in Spark 
cube engine
 Key: KYLIN-3462
 URL: https://issues.apache.org/jira/browse/KYLIN-3462
 Project: Kylin
  Issue Type: Bug
  Components: Spark Engine
Affects Versions: v2.4.0, v2.3.1, v2.3.0
Reporter: Shaofeng SHI


In a comparison between Spark and MR cubing, I noticed the cuboid files that 
Spark engine generated is 3x lager than MR, and took 4x larger more disk on 
HDFS than MR.

 

The reason is, the "dfs.replication=2" didn't work when Spark save to HDFS. And 
by default no compression.   



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3452) Optimize spark cubing memory footprint

2018-07-15 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3452:
---

 Summary: Optimize spark cubing memory footprint
 Key: KYLIN-3452
 URL: https://issues.apache.org/jira/browse/KYLIN-3452
 Project: Kylin
  Issue Type: Improvement
  Components: Spark Engine
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3449) Should allow deleting a segment in NEW status

2018-07-13 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3449:
---

 Summary: Should allow deleting a segment in NEW status
 Key: KYLIN-3449
 URL: https://issues.apache.org/jira/browse/KYLIN-3449
 Project: Kylin
  Issue Type: Improvement
Reporter: Shaofeng SHI


Sometimes the job is dropped or not successfully created, then the new segment 
becomes orphan. So far Kylin doesn't allow deleting a NEW status segment, that 
is bad.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3446) Convert to HFile in spark reports ZK connection refused

2018-07-12 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3446:
---

 Summary: Convert to HFile in spark reports ZK connection refused
 Key: KYLIN-3446
 URL: https://issues.apache.org/jira/browse/KYLIN-3446
 Project: Kylin
  Issue Type: Bug
  Components: Spark Engine
Reporter: Shaofeng SHI
 Fix For: v2.5.0


{code:java}
to server localhost/127.0.0.1:2181. Will not attempt to authenticate using SASL 
(unknown error)
2018-07-12 18:51:21,001 INFO [Scheduler 1109292714 Job 
62f42193-20ff-4ca9-b898-52978a473bce-864] spark.SparkExecutable:38 : 18/07/12 
18:51:21 WARN zookeeper.ClientCnxn: Session 0x0 for server null, unexpected 
error, closing socket connection and attempting reconnect
2018-07-12 18:51:21,002 INFO [Scheduler 1109292714 Job 
62f42193-20ff-4ca9-b898-52978a473bce-864] spark.SparkExecutable:38 : 
java.net.ConnectException: Connection refused
2018-07-12 18:51:21,002 INFO [Scheduler 1109292714 Job 
62f42193-20ff-4ca9-b898-52978a473bce-864] spark.SparkExecutable:38 : at 
sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
2018-07-12 18:51:21,002 INFO [Scheduler 1109292714 Job 
62f42193-20ff-4ca9-b898-52978a473bce-864] spark.SparkExecutable:38 : at 
sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:717)
2018-07-12 18:51:21,002 INFO [Scheduler 1109292714 Job 
62f42193-20ff-4ca9-b898-52978a473bce-864] spark.SparkExecutable:38 : at 
org.apache.zookeeper.ClientCnxnSocketNIO.doTransport(ClientCnxnSocketNIO.java:361)
2018-07-12 18:51:21,002 INFO [Scheduler 1109292714 Job 
62f42193-20ff-4ca9-b898-52978a473bce-864] spark.SparkExecutable:38 : at 
org.apache.zookeeper.ClientCnxn$SendThread.run(ClientCnxn.java:1081)
{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3443) Optimize cube job in Spark

2018-07-06 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3443:
---

 Summary: Optimize cube job in Spark
 Key: KYLIN-3443
 URL: https://issues.apache.org/jira/browse/KYLIN-3443
 Project: Kylin
  Issue Type: Improvement
  Components: Spark Engine
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3442) Fact distinct columns in Spark

2018-07-06 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3442:
---

 Summary: Fact distinct columns in Spark
 Key: KYLIN-3442
 URL: https://issues.apache.org/jira/browse/KYLIN-3442
 Project: Kylin
  Issue Type: Improvement
  Components: Spark Engine
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3441) Merge cube segments in Spark

2018-07-06 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3441:
---

 Summary: Merge cube segments in Spark
 Key: KYLIN-3441
 URL: https://issues.apache.org/jira/browse/KYLIN-3441
 Project: Kylin
  Issue Type: Improvement
  Components: Spark Engine
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3440) Couldn't merge if there is an empty segment

2018-07-03 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3440:
---

 Summary: Couldn't merge if there is an empty segment
 Key: KYLIN-3440
 URL: https://issues.apache.org/jira/browse/KYLIN-3440
 Project: Kylin
  Issue Type: Bug
  Components: Web 
Affects Versions: v2.4.0, v2.3.1
Reporter: Shaofeng SHI
 Attachments: 2018-07-04 上午8.42.08.png

I have a cube which has empty segment (actually the segment is not empty, just 
because the job counter wasn't fetched). When I try to merge the segments, 
Kylin reports error, asking me if I want to continue forcely; When click "yes", 
the same error be reported. From the http request, I didn't see the 
"forceMergeEmptySegment" parameter was set.
{code:java}
org.apache.kylin.rest.exception.InternalErrorException: Empty cube segment 
found, couldn't merge unless 'forceMergeEmptySegment' set to true: 
[2012010100_2012080100, 2012080100_2018070100]
at 
org.apache.kylin.rest.controller.CubeController.buildInternal(CubeController.java:416)
at 
org.apache.kylin.rest.controller.CubeController.rebuild(CubeController.java:366)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.lang.reflect.Method.invoke(Method.java:606)
at 
org.springframework.web.method.support.InvocableHandlerMethod.doInvoke(InvocableHandlerMethod.java:205)
at 
org.springframework.web.method.support.InvocableHandlerMethod.invokeForRequest(InvocableHandlerMethod.java:133)
at 
org.springframework.web.servlet.mvc.method.annotation.ServletInvocableHandlerMethod.invokeAndHandle(ServletInvocableHandlerMethod.java:97)
at 
org.springframework.web.servlet.mvc.method.annotation.RequestMappingHandlerAdapter.invokeHandlerMethod(RequestMappingHandlerAdapter.java:827)
at 
org.springframework.web.servlet.mvc.method.annotation.RequestMappingHandlerAdapter.handleInternal(RequestMappingHandlerAdapter.java:738)
at 
org.springframework.web.servlet.mvc.method.AbstractHandlerMethodAdapter.handle(AbstractHandlerMethodAdapter.java:85)
at 
org.springframework.web.servlet.DispatcherServlet.doDispatch(DispatcherServlet.java:967)
at 
org.springframework.web.servlet.DispatcherServlet.doService(DispatcherServlet.java:901)
at 
org.springframework.web.servlet.FrameworkServlet.processRequest(FrameworkServlet.java:970)
at 
org.springframework.web.servlet.FrameworkServlet.doPut(FrameworkServlet.java:883)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:653)
at 
org.springframework.web.servlet.FrameworkServlet.service(FrameworkServlet.java:846)
at javax.servlet.http.HttpServlet.service(HttpServlet.java:731)
at 
org.apache.catalina.core.ApplicationFilterChain.internalDoFilter(ApplicationFilterChain.java:303)
at 
org.apache.catalina.core.ApplicationFilterChain.doFilter(ApplicationFilterChain.java:208)
{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3437) Optimize HBase connection pool in Kylin side

2018-07-02 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3437:
---

 Summary: Optimize HBase connection pool in Kylin side
 Key: KYLIN-3437
 URL: https://issues.apache.org/jira/browse/KYLIN-3437
 Project: Kylin
  Issue Type: Improvement
  Components: Query Engine
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3435) Only keep base cuboid files on HDFS for future merge

2018-07-01 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3435:
---

 Summary: Only keep base cuboid files on HDFS for future merge
 Key: KYLIN-3435
 URL: https://issues.apache.org/jira/browse/KYLIN-3435
 Project: Kylin
  Issue Type: Improvement
  Components: Job Engine
Reporter: Shaofeng SHI


Today Kylin keeps all cuboids data in HDFS for future merge. When doing the 
merge, Kylin need re-encode the dimension values with the new dictionaries, for 
all cuboids.

 

If we only keep the base cuboid, lots of disk space can be saved. On merge, 
after merge the base cuboid, calculate others from the new base cuboid.  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3432) Merge segments from HBase snapshots

2018-06-27 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3432:
---

 Summary: Merge segments from HBase snapshots
 Key: KYLIN-3432
 URL: https://issues.apache.org/jira/browse/KYLIN-3432
 Project: Kylin
  Issue Type: Improvement
  Components: Job Engine
Reporter: Shaofeng SHI


HBase introduces the MultiTableSnapshotInputFormat since 1.2 ; Kylin can merge 
segments with this method, so that the cuboid files can be deleted.

 
 # HBASE-13356



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3427) Convert to HFile in Spark

2018-06-26 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3427:
---

 Summary: Convert to HFile in Spark
 Key: KYLIN-3427
 URL: https://issues.apache.org/jira/browse/KYLIN-3427
 Project: Kylin
  Issue Type: Improvement
  Components: Spark Engine
Reporter: Shaofeng SHI


Some references:

https://www.opencore.com/blog/2016/10/efficient-bulk-load-of-hbase-using-spark/



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3422) Support multi-path of domain for kylin connection

2018-06-20 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3422:
---

 Summary: Support multi-path of domain for kylin connection
 Key: KYLIN-3422
 URL: https://issues.apache.org/jira/browse/KYLIN-3422
 Project: Kylin
  Issue Type: Improvement
  Components: Driver - JDBC, Driver - ODBC
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3419) Upgrade to Java 8

2018-06-19 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3419:
---

 Summary: Upgrade to Java 8
 Key: KYLIN-3419
 URL: https://issues.apache.org/jira/browse/KYLIN-3419
 Project: Kylin
  Issue Type: Improvement
  Components: Environment 
Reporter: Shaofeng SHI


Java 7 will not be supported. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3418) User interface for hybrid model

2018-06-19 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3418:
---

 Summary: User interface for hybrid model
 Key: KYLIN-3418
 URL: https://issues.apache.org/jira/browse/KYLIN-3418
 Project: Kylin
  Issue Type: Improvement
  Components: Web 
Reporter: Shaofeng SHI


Hybrid model is useful for model change. While now there is no entry for it 
from GUI, this makes many users don't see such feature.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3415) Remove "external" module

2018-06-19 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3415:
---

 Summary: Remove "external" module
 Key: KYLIN-3415
 URL: https://issues.apache.org/jira/browse/KYLIN-3415
 Project: Kylin
  Issue Type: Task
  Components: Tools, Build and Test
Reporter: Shaofeng SHI


Kylin's "external" module was created to build shaded jar so that it can be 
referenced in "core-common" module. While this function is not in real use now, 
and it blocks the apache release process. So we can safely remove it.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3414) Optimize the cleanup of project L2 cache

2018-06-17 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3414:
---

 Summary: Optimize the cleanup of project L2 cache
 Key: KYLIN-3414
 URL: https://issues.apache.org/jira/browse/KYLIN-3414
 Project: Kylin
  Issue Type: Improvement
  Components: Metadata
Reporter: Shaofeng SHI


Whenever a cube changed, all projects' level 2 cache was cleared. This can be 
optimized to only flush the specific project's cache.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3408) Release 2.4.0

2018-06-12 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3408:
---

 Summary: Release 2.4.0
 Key: KYLIN-3408
 URL: https://issues.apache.org/jira/browse/KYLIN-3408
 Project: Kylin
  Issue Type: Task
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3383) Document on how to query Kylin from Spark

2018-05-16 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3383:
---

 Summary: Document on how to query Kylin from Spark
 Key: KYLIN-3383
 URL: https://issues.apache.org/jira/browse/KYLIN-3383
 Project: Kylin
  Issue Type: Improvement
  Components: Documentation
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3382) YARN job link wasn't displayed when job is running

2018-05-15 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3382:
---

 Summary: YARN job link wasn't displayed when job is running
 Key: KYLIN-3382
 URL: https://issues.apache.org/jira/browse/KYLIN-3382
 Project: Kylin
  Issue Type: Bug
  Components: Spark Engine
Reporter: Shaofeng SHI


Affected steps: "Build cube with Spark"



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3381) Stop/abort cubing job doesn't stop Spark job

2018-05-15 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3381:
---

 Summary: Stop/abort cubing job doesn't stop Spark job
 Key: KYLIN-3381
 URL: https://issues.apache.org/jira/browse/KYLIN-3381
 Project: Kylin
  Issue Type: Bug
  Components: Spark Engine
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3378) Support Kafka table join with Hive tables

2018-05-10 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3378:
---

 Summary: Support Kafka table join with Hive tables
 Key: KYLIN-3378
 URL: https://issues.apache.org/jira/browse/KYLIN-3378
 Project: Kylin
  Issue Type: Improvement
  Components: Streaming
Reporter: Shaofeng SHI


At this moment, if the data source is Kafka, only 1 table allowed in the data 
model. In some cases, joining the kafka stream with hive lookup tables is 
expected



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3369) Reduce the data size sink from Kafka topic to HDFS

2018-05-06 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3369:
---

 Summary: Reduce the data size sink from Kafka topic to HDFS
 Key: KYLIN-3369
 URL: https://issues.apache.org/jira/browse/KYLIN-3369
 Project: Kylin
  Issue Type: Improvement
  Components: Streaming
Reporter: Shaofeng SHI


When building a cube from Kafka topic, the first step is to sink the Kafka data 
to HDFS. In today's implementation, it will persist all the fields of a message 
to disk. While in many cases, only a couple of fields will be needed for 
cubing; Today's behavior wastes network bandwidth and disk space.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3368) "/kylin/kylin_metadata/metadata/" has many gargage for spark cubing

2018-05-04 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3368:
---

 Summary: "/kylin/kylin_metadata/metadata/" has many gargage for 
spark cubing
 Key: KYLIN-3368
 URL: https://issues.apache.org/jira/browse/KYLIN-3368
 Project: Kylin
  Issue Type: Bug
  Components: Spark Engine
Affects Versions: v2.3.0, v2.2.0
Reporter: Shaofeng SHI


If using Spark as cube engine, Kylin will dump metadata to HDFS in 
"/kylin/kylin_metadata/metadata/"; As time goes on, many files were left there.

 

They should be deleted when the job is finished.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3355) Improve the HTTP return code of Rest API

2018-04-28 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3355:
---

 Summary: Improve the HTTP return code of Rest API
 Key: KYLIN-3355
 URL: https://issues.apache.org/jira/browse/KYLIN-3355
 Project: Kylin
  Issue Type: Improvement
Reporter: Shaofeng SHI


Kylin returns 500 error for some invalid input; for example an invalid cube 
name, when the job exceeds the maximum number, etc.

We need to refine the behavior; For invalid user input, return 4XX http code. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3353) Merge job should not be blocked by "kylin.cube.max-building-segments"

2018-04-27 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3353:
---

 Summary: Merge job should not be blocked by 
"kylin.cube.max-building-segments"
 Key: KYLIN-3353
 URL: https://issues.apache.org/jira/browse/KYLIN-3353
 Project: Kylin
  Issue Type: Improvement
  Components: Job Engine
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3350) Build cube incrementally from a file list

2018-04-25 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3350:
---

 Summary: Build cube incrementally from a file list
 Key: KYLIN-3350
 URL: https://issues.apache.org/jira/browse/KYLIN-3350
 Project: Kylin
  Issue Type: New Feature
  Components: Job Engine
Reporter: Shaofeng SHI


This a new requirement.

In some cases, upstream components dump the new data into files, and then 
expect Kylin can consume data from a given file list, instead of from Hive.

 

 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3304) Instable UT CacheServiceTest

2018-03-20 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3304:
---

 Summary: Instable UT CacheServiceTest
 Key: KYLIN-3304
 URL: https://issues.apache.org/jira/browse/KYLIN-3304
 Project: Kylin
  Issue Type: Test
  Components: Tools, Build and Test
Reporter: Shaofeng SHI


Kylin integration test is unstable, sometimes failed at the CacheServiceTest:
*00:48:11* Failed tests: *00:48:11*   
CacheServiceTest.testCubeCRUD:237*00:48:11* Tests in error: *00:48:11*   
CacheServiceTest.testMetaCRUD:316->waitForCounterAndClear:156 Runtime timeout



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3302) Upgrade commons-email

2018-03-20 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3302:
---

 Summary: Upgrade commons-email
 Key: KYLIN-3302
 URL: https://issues.apache.org/jira/browse/KYLIN-3302
 Project: Kylin
  Issue Type: Improvement
  Components: Integration
Affects Versions: v2.3.0, v2.2.0
Reporter: Shaofeng SHI


Check [CVE-2017-9801|https://nvd.nist.gov/vuln/detail/CVE-2017-9801]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3301) Upgrade opensaml

2018-03-20 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3301:
---

 Summary: Upgrade opensaml
 Key: KYLIN-3301
 URL: https://issues.apache.org/jira/browse/KYLIN-3301
 Project: Kylin
  Issue Type: Improvement
  Components: Integration
Affects Versions: v2.3.0
Reporter: Shaofeng SHI


opensaml 2.6.4 is reported with security issue (CVE-2015-1796), need upgrade.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3300) Upgrade jackson-databind

2018-03-20 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3300:
---

 Summary: Upgrade jackson-databind
 Key: KYLIN-3300
 URL: https://issues.apache.org/jira/browse/KYLIN-3300
 Project: Kylin
  Issue Type: Improvement
  Components: Integration
Reporter: Shaofeng SHI


jackson-databind 2.6.3 and 2.6.5 are reported with security issue 
(CVE-2017-7525), need ugprade



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3294) Remove HBaseMROutput.java, RangeKeyDistributionJob.java and other sunset classes

2018-03-18 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3294:
---

 Summary: Remove HBaseMROutput.java, RangeKeyDistributionJob.java 
and other sunset classes
 Key: KYLIN-3294
 URL: https://issues.apache.org/jira/browse/KYLIN-3294
 Project: Kylin
  Issue Type: Improvement
  Components: Job Engine
Reporter: Shaofeng SHI


They were legacy classes, keeping them will add maintainence effort especially 
when upgrade HBase version. Should delete them.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3285) "Value NNN not exists" error run executing query

2018-03-09 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3285:
---

 Summary:  "Value NNN not exists" error run executing query
 Key: KYLIN-3285
 URL: https://issues.apache.org/jira/browse/KYLIN-3285
 Project: Kylin
  Issue Type: Bug
  Components: Query Engine
Affects Versions: v2.3.0
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3265) Add "jobSearchMode" as a condition to "/kylin/jobs" API

2018-02-23 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3265:
---

 Summary: Add "jobSearchMode" as a condition to "/kylin/jobs" API
 Key: KYLIN-3265
 URL: https://issues.apache.org/jira/browse/KYLIN-3265
 Project: Kylin
  Issue Type: Improvement
  Components: REST Service
Affects Versions: v2.3.0
Reporter: Shaofeng SHI


In kylin 2.3, the "/kylin/jobs" API will return both cubing job and checkpoint 
job together, this changed the API's behavior as there isn't checkpoint job 
before. We should keep the api only return cubing jobs to be compitable as 
before, and extend the API to add another condition 'jobSearchMode' to 
determine which kind of job should be returned. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3232) Need document for ops tools

2018-02-03 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3232:
---

 Summary: Need document for ops tools
 Key: KYLIN-3232
 URL: https://issues.apache.org/jira/browse/KYLIN-3232
 Project: Kylin
  Issue Type: Improvement
  Components: Documentation
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI


Kylin has many good ops tool, like CubeMigrationCLI, CubeMetaIngester, etc; 
They can greatly improve administrators productivity. But now there is no 
document about them (you need check the code to know what they're doing)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3231) Improve the "how to upgrade" guide

2018-02-03 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3231:
---

 Summary: Improve the "how to upgrade" guide
 Key: KYLIN-3231
 URL: https://issues.apache.org/jira/browse/KYLIN-3231
 Project: Kylin
  Issue Type: Improvement
  Components: Documentation
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI


Some user complains that the "How to upgrade" document is not friendly: it need 
provide the detailed step for each version.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3230) Need documenation for the configurations/parameters

2018-02-03 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3230:
---

 Summary: Need documenation for the configurations/parameters
 Key: KYLIN-3230
 URL: https://issues.apache.org/jira/browse/KYLIN-3230
 Project: Kylin
  Issue Type: Improvement
  Components: Documentation
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI


Kylin has more and more configuration parameters. But there is no a central 
document to explain these configurations and the impact they have. We need add 
such a doc for users.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3229) Update tutorial for 2.3 release

2018-02-03 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3229:
---

 Summary: Update tutorial for 2.3 release
 Key: KYLIN-3229
 URL: https://issues.apache.org/jira/browse/KYLIN-3229
 Project: Kylin
  Issue Type: Sub-task
  Components: Documentation
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI


The step by step tutorial on kylin website is out of date. Need be updated 
together with the release.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3177) Merged Streaming cube segments has no start/end time

2018-01-17 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3177:
---

 Summary: Merged Streaming cube segments has no start/end time
 Key: KYLIN-3177
 URL: https://issues.apache.org/jira/browse/KYLIN-3177
 Project: Kylin
  Issue Type: Bug
  Components: Job Engine, Metadata
Affects Versions: v2.2.0
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI


Create a streaming cube, define auto merge thresholds; and then build it 
repeatedly.

When some segments be merged into a bigger segment, the new segment's start 
time and end time be 0 (show 1970-01-01).

 

This is a bug similar with KYLIN-3165, introduced to Kylin 2.2 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (KYLIN-3161) Enforce global dictionary for bitmap count distinct column

2018-01-11 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3161:
---

 Summary: Enforce global dictionary for bitmap count distinct column
 Key: KYLIN-3161
 URL: https://issues.apache.org/jira/browse/KYLIN-3161
 Project: Kylin
  Issue Type: Improvement
  Components: Metadata, Web 
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI


For bitmap based count distinct column (as the data type is not int), a Global 
dictionary is required. But now user can use normal dictionary, which may cause 
incorrect result.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (KYLIN-3145) Support Kafka JSON message whose property name includes "_"

2018-01-01 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3145:
---

 Summary: Support Kafka JSON message whose property name includes 
"_"
 Key: KYLIN-3145
 URL: https://issues.apache.org/jira/browse/KYLIN-3145
 Project: Kylin
  Issue Type: Improvement
  Components: Streaming
Reporter: Shaofeng SHI


So far Kylin doesn't support JSON message which has property name with "_", 
because that would be conflict with Kylin's logic.

For example, the JSON message is :

{code}
{ 
 "user" : {
  "first_name" : "Tom",
  "age" : "20"
  }

}
{code}

When map this topic to a table, the "first_name" is mapped to 
"user_first_name"; When Kylin parse the message, it separates by "_" and then 
try to find "user" -> "first" -> "name"; as there is no "first" property, an 
error is reported.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (KYLIN-3142) Too many temp partition files left in Hadoop /tmp folder

2017-12-29 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3142:
---

 Summary: Too many temp partition files left in Hadoop /tmp folder
 Key: KYLIN-3142
 URL: https://issues.apache.org/jira/browse/KYLIN-3142
 Project: Kylin
  Issue Type: Bug
  Components: Job Engine
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI


I have a cluster running several streaming cubes; After months running, there 
are many files left in Hadoop /tmp folder, and it has impacted on the DFS's 
performance. After cleanup it, the cluster is much stable than before.

hadoop fs -ls /tmp/
SLF4J: Class path contains multiple SLF4J bindings.
SLF4J: Found binding in 
[jar:file:/usr/hdp/2.5.4.0-121/hadoop/lib/alluxio-core-client-runtime-1.6.1-jar-with-dependencies.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in 
[jar:file:/usr/hdp/2.5.4.0-121/hadoop/lib/slf4j-log4j12-1.7.10.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: Found binding in 
[jar:file:/usr/hdp/2.5.4.0-121/hadoop/alluxio-core-client-runtime-1.6.1-jar-with-dependencies.jar!/org/slf4j/impl/StaticLoggerBinder.class]
SLF4J: See http://www.slf4j.org/codes.html#multiple_bindings for an explanation.
SLF4J: Actual binding is of type [org.slf4j.impl.Log4jLoggerFactory]
Found 46 items
-rw-r--r--   1 root supergroup153 2017-12-29 11:33 
/tmp/partitions_02e317ac-03de-4cec-8906-6b18bc974858
-rw-r--r--   1 root supergroup153 2017-12-29 07:03 
/tmp/partitions_03108f0b-7873-4610-9346-e1c12388b229
-rw-r--r--   1 root supergroup153 2017-12-29 03:00 
/tmp/partitions_032d22ed-2ed3-401f-8601-b4b9d6f7800b
-rw-r--r--   1 root supergroup153 2017-12-29 02:41 
/tmp/partitions_0869a983-4b20-40b7-a1f9-4ca9413ecd8d
-rw-r--r--   1 root supergroup153 2017-12-29 04:03 
/tmp/partitions_10c47970-d4c2-4af1-819b-93717591ff5d
-rw-r--r--   1 root supergroup153 2017-12-29 06:03 
/tmp/partitions_172e6684-f825-4101-8860-06aaad920c59
-rw-r--r--   1 root supergroup153 2017-12-29 03:12 
/tmp/partitions_249a224d-101d-4eff-b223-5453865e34b8
-rw-r--r--   1 root supergroup153 2017-12-29 03:01 
/tmp/partitions_24b74354-3b5d-44ad-984d-d9d7849988f9
-rw-r--r--   1 root supergroup   1341 2017-12-29 03:08 
/tmp/partitions_27c0089b-ed0f-4a37-af4a-adc686203ac3



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (KYLIN-3129) Startup error on EMR 5.8+

2017-12-21 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3129:
---

 Summary: Startup error on EMR 5.8+
 Key: KYLIN-3129
 URL: https://issues.apache.org/jira/browse/KYLIN-3129
 Project: Kylin
  Issue Type: Bug
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (KYLIN-3062) Hide RAW measure

2017-11-26 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3062:
---

 Summary: Hide RAW measure
 Key: KYLIN-3062
 URL: https://issues.apache.org/jira/browse/KYLIN-3062
 Project: Kylin
  Issue Type: Task
Reporter: Shaofeng SHI


See the discussion here:
http://apache-kylin.74782.x6.nabble.com/Discuss-Disable-hide-quot-RAW-quot-measure-in-Kylin-web-GUI-td6636.html



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (KYLIN-3050) AWS Kinesis as data source

2017-11-20 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3050:
---

 Summary: AWS Kinesis as data source
 Key: KYLIN-3050
 URL: https://issues.apache.org/jira/browse/KYLIN-3050
 Project: Kylin
  Issue Type: New Feature
Reporter: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (KYLIN-3032) Cube size shows 0 but actually it isn't empty

2017-11-10 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3032:
---

 Summary: Cube size shows 0 but actually it isn't empty
 Key: KYLIN-3032
 URL: https://issues.apache.org/jira/browse/KYLIN-3032
 Project: Kylin
  Issue Type: Bug
  Components: Job Engine
 Environment: AWS EMR
Reporter: Shaofeng SHI
Assignee: Dong Li


On EMR, if set working-dir to S3, after build the cube, will find the "Cube 
size" shows 0, but "Source count" is not 0.

This is a bug, when working-dir isn't the same file system as Hadoop default 
FS, will face such problem.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (KYLIN-3028) Build cube error when set S3 as working-dir

2017-11-10 Thread Shaofeng SHI (JIRA)
Shaofeng SHI created KYLIN-3028:
---

 Summary: Build cube error when set S3 as working-dir
 Key: KYLIN-3028
 URL: https://issues.apache.org/jira/browse/KYLIN-3028
 Project: Kylin
  Issue Type: Bug
  Components: Job Engine
Affects Versions: v2.2.0
Reporter: Shaofeng SHI
Assignee: Shaofeng SHI






--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


  1   2   3   4   >