hi, 
my observation. so this process runs slow for the first time but when i
build cube next time it is completing in few minutes only. shareing the log
from second step.. looks like i am missing some mapreduce configuration here

1st attempt:
Counters: 52
        File System Counters
                FILE: Number of bytes read=1404339530
                FILE: Number of bytes written=2100591948
                FILE: Number of read operations=0
                FILE: Number of large read operations=0
                FILE: Number of write operations=0
                HDFS: Number of bytes read=9312271
                HDFS: Number of bytes written=520516128
                HDFS: Number of read operations=59
                HDFS: Number of large read operations=0
                HDFS: Number of write operations=50
        Job Counters 
                Killed map tasks=1
                Killed reduce tasks=2
                Launched map tasks=3
                Launched reduce tasks=18
                Data-local map tasks=3
                Total time spent by all maps in occupied slots (ms)=357469920
                Total time spent by all reduces in occupied slots 
(ms)=3550570080
                Total time spent by all map tasks (ms)=1489458
                Total time spent by all reduce tasks (ms)=7397021
                Total vcore-milliseconds taken by all map tasks=1489458
                Total vcore-milliseconds taken by all reduce tasks=7397021
                Total megabyte-milliseconds taken by all map tasks=11439037440
                Total megabyte-milliseconds taken by all reduce 
tasks=113618242560
        Map-Reduce Framework
                Map input records=368816
                Map output records=104656
                Map output bytes=1021571917
                Map output materialized bytes=698471609
                Input split bytes=6220
                Combine input records=209310
                Combine output records=209308
                Reduce input groups=59717
                Reduce shuffle bytes=698471609
                Reduce input records=104654
                Reduce output records=0
                Spilled Records=313962
                Shuffled Maps =34
                Failed Shuffles=0
                Merged Map outputs=34
                GC time elapsed (ms)=5942
                CPU time spent (ms)=1203260
                Physical memory (bytes) snapshot=18783621120
                Virtual memory (bytes) snapshot=279152558080
                Total committed heap usage (bytes)=43746066432
        Shuffle Errors
                BAD_ID=0
                CONNECTION=0
                IO_ERROR=0
                WRONG_LENGTH=0
                WRONG_MAP=0
                WRONG_REDUCE=0
        File Input Format Counters 
                Bytes Read=0
        File Output Format Counters 
                Bytes Written=0
        
org.apache.kylin.engine.mr.steps.FactDistinctColumnsMapper$RawDataCounter
                BYTES=55368910

2nd attempt:

Counters: 51
        File System Counters
                FILE: Number of bytes read=8209556
                FILE: Number of bytes written=25262878
                FILE: Number of read operations=0
                FILE: Number of large read operations=0
                FILE: Number of write operations=0
                HDFS: Number of bytes read=9336990
                HDFS: Number of bytes written=1747432
                HDFS: Number of read operations=87
                HDFS: Number of large read operations=0
                HDFS: Number of write operations=50
        Job Counters 
                Killed reduce tasks=1
                Launched map tasks=9
                Launched reduce tasks=17
                Data-local map tasks=9
                Total time spent by all maps in occupied slots (ms)=17504880
                Total time spent by all reduces in occupied slots (ms)=36970080
                Total time spent by all map tasks (ms)=72937
                Total time spent by all reduce tasks (ms)=77021
                Total vcore-milliseconds taken by all map tasks=72937
                Total vcore-milliseconds taken by all reduce tasks=77021
                Total megabyte-milliseconds taken by all map tasks=560156160
                Total megabyte-milliseconds taken by all reduce tasks=1183042560
        Map-Reduce Framework
                Map input records=368816
                Map output records=90281
                Map output bytes=12469333
                Map output materialized bytes=8850914
                Input split bytes=27990
                Combine input records=90281
                Combine output records=90272
                Reduce input groups=27059
                Reduce shuffle bytes=8850914
                Reduce input records=90272
                Reduce output records=0
                Spilled Records=180544
                Shuffled Maps =153
                Failed Shuffles=0
                Merged Map outputs=153
                GC time elapsed (ms)=11067
                CPU time spent (ms)=207100
                Physical memory (bytes) snapshot=24891486208
                Virtual memory (bytes) snapshot=339687297024
                Total committed heap usage (bytes)=63315640320
        Shuffle Errors
                BAD_ID=0
                CONNECTION=0
                IO_ERROR=0
                WRONG_LENGTH=0
                WRONG_MAP=0
                WRONG_REDUCE=0
        File Input Format Counters 
                Bytes Read=0
        File Output Format Counters 
                Bytes Written=0
        
org.apache.kylin.engine.mr.steps.FactDistinctColumnsMapper$RawDataCounter
                BYTES=55368910


--
Sent from: http://apache-kylin.74782.x6.nabble.com/

Reply via email to