[jira] [Commented] (ASTERIXDB-2443) The current word tokenizer is too restricted.

2018-08-15 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-2443?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16581764#comment-16581764 ] Chen Li commented on ASTERIXDB-2443: This can be viewed as a short-term solution.

[jira] [Commented] (ASTERIXDB-2215) Filter is not properly applied for a secondary inverted index search

2017-12-29 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-2215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16306467#comment-16306467 ] Chen Li commented on ASTERIXDB-2215: A note: this bug was found when using DRUM to

[jira] [Commented] (ASTERIXDB-2156) Encounter error during feed

2017-11-07 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-2156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16243111#comment-16243111 ] Chen Li commented on ASTERIXDB-2156: [~lwhay]: in our current twittermap cloudberr

[jira] [Commented] (ASTERIXDB-2153) Fulltext does not handle the search option properly

2017-10-31 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-2153?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16233599#comment-16233599 ] Chen Li commented on ASTERIXDB-2153: For our record, this bug was found when we is

[jira] [Commented] (ASTERIXDB-2145) Recovery process fails on 100 datasets

2017-10-25 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-2145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16219636#comment-16219636 ] Chen Li commented on ASTERIXDB-2145: Agreed; we should recover the data sets one b

[jira] [Commented] (ASTERIXDB-2083) An inverted index-search generates OOM Exception.

2017-09-07 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-2083?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16158140#comment-16158140 ] Chen Li commented on ASTERIXDB-2083: This bug is causing TwitterMap not stable. L

[jira] [Commented] (ASTERIXDB-1812) OutofMemoryError when group by on a non-existing field with 300k records (tweets)

2017-08-22 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16137428#comment-16137428 ] Chen Li commented on ASTERIXDB-1812: Cool! There is another Cloudberry-reported,

[jira] [Commented] (ASTERIXDB-1956) An edit-distance-check query generates "Unable to find free page in buffer cache after 1000 cycles (buffer cache undersized?)" Exception

2017-06-27 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16065248#comment-16065248 ] Chen Li commented on ASTERIXDB-1956: There seems to be a bug. Let's discuss this

[jira] [Commented] (ASTERIXDB-1762) AQL+ needs to be revised.

2017-01-04 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15799705#comment-15799705 ] Chen Li commented on ASTERIXDB-1762: This is a very cute solution :-) > AQL+ need

[jira] [Commented] (ASTERIXDB-1762) AQL+ needs to be revised.

2017-01-02 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15794138#comment-15794138 ] Chen Li commented on ASTERIXDB-1762: [~wangsaeu] I will go to office on Tuesday.

[jira] [Commented] (ASTERIXDB-1751) Some of spilled partition files are not deleted properly after the optimized hybrid hash join finishes.

2016-12-18 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15760144#comment-15760144 ] Chen Li commented on ASTERIXDB-1751: Does it mean the main reason is that a partit

[jira] [Commented] (ASTERIXDB-1255) Potential issues related to object creation in Jaccard Similarity evaluation

2016-12-16 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1255?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15755855#comment-15755855 ] Chen Li commented on ASTERIXDB-1255: Cool! > Potential issues related to object c

[jira] [Commented] (ASTERIXDB-1733) Hash Table used by hash-join doesn't conform to the budget.

2016-12-03 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1733?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15719137#comment-15719137 ] Chen Li commented on ASTERIXDB-1733: How easy is it to implement this logic? If i

[jira] [Commented] (ASTERIXDB-1556) Hash Table used by External hash group-by doesn't conform to the budget.

2016-10-31 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15624437#comment-15624437 ] Chen Li commented on ASTERIXDB-1556: [~wangsaeu] since our offices are so close, l

[jira] [Commented] (ASTERIXDB-1556) Hash Table used by External hash group-by doesn't conform to the budget.

2016-10-31 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15623144#comment-15623144 ] Chen Li commented on ASTERIXDB-1556: The results look very good to me! > Hash Tab

[jira] [Commented] (ASTERIXDB-1699) Inverted Index fail to match the keyword

2016-10-24 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15602556#comment-15602556 ] Chen Li commented on ASTERIXDB-1699: See if [~wangsaeu] can help? > Inverted Inde

[jira] [Commented] (ASTERIXDB-1704) Fuzzy-join query is slow

2016-10-23 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15600806#comment-15600806 ] Chen Li commented on ASTERIXDB-1704: [~wangsaeu] If the "hash-group by patch" is t

[jira] [Commented] (ASTERIXDB-1704) Fuzzy-join query is slow

2016-10-22 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15598715#comment-15598715 ] Chen Li commented on ASTERIXDB-1704: Make sure to use the same data set, same func

[jira] [Commented] (ASTERIXDB-1704) Fuzzy-join query is slow

2016-10-21 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1704?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15596673#comment-15596673 ] Chen Li commented on ASTERIXDB-1704: Let's use this issue to keep track of the per

[jira] [Commented] (ASTERIXDB-1700) edit-distance-check on the fields with the 2-gram and the 3-gram index generates a null pointer exception.

2016-10-21 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15596670#comment-15596670 ] Chen Li commented on ASTERIXDB-1700: Glad to know issue 1) is fixed. > edit-dista

[jira] [Commented] (ASTERIXDB-1700) edit-distance-check on the fields with the 2-gram and the 3-gram index generates a null pointer exception.

2016-10-19 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1700?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15590893#comment-15590893 ] Chen Li commented on ASTERIXDB-1700: Why does the plan use both indexes? One of t

[jira] [Commented] (ASTERIXDB-1487) Fuzzy select-join on inverted index poses inconsistent results.

2016-08-31 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15453128#comment-15453128 ] Chen Li commented on ASTERIXDB-1487: For our info: this issue is being investigate

[jira] [Commented] (ASTERIXDB-1003) an inverted index on a dataset with variable length primary key is not supported

2016-08-20 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1003?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429531#comment-15429531 ] Chen Li commented on ASTERIXDB-1003: I still don't think this issue is urgent. [~

[jira] [Commented] (ASTERIXDB-263) Better management of AQL.jj and AQLPLus.jj

2016-08-20 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-263?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429530#comment-15429530 ] Chen Li commented on ASTERIXDB-263: --- This is an old, big topic, and it doesn't seem t

[jira] [Commented] (ASTERIXDB-864) Supporting similarity queries on "fuzzy keywords": making fuzzy matching "fuzzier"

2016-08-20 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-864?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429528#comment-15429528 ] Chen Li commented on ASTERIXDB-864: --- This issue has a very low priority compared to o

[jira] [Commented] (ASTERIXDB-1077) Inverted index tests are slow compared to others

2016-08-20 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1077?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429529#comment-15429529 ] Chen Li commented on ASTERIXDB-1077: [~wangsaeu]: I remember you already removed s

[jira] [Commented] (ASTERIXDB-1049) Similarity query needs the total order of two involved sets.

2016-08-20 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15429527#comment-15429527 ] Chen Li commented on ASTERIXDB-1049: [~lwhay] and [~wangsaeu]: can you check the s

[jira] [Commented] (ASTERIXDB-1556) Hash Table used by External hash group-by doesn't conform to the budget.

2016-08-15 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15422056#comment-15422056 ] Chen Li commented on ASTERIXDB-1556: Cool. I mentioned this comment earlier: the

[jira] [Commented] (ASTERIXDB-1556) Hash Table used by External hash group-by doesn't conform to the budget.

2016-08-14 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15420500#comment-15420500 ] Chen Li commented on ASTERIXDB-1556: Regarding the "separate hash table" approach,

[jira] [Commented] (ASTERIXDB-1556) Hash Table used by External hash group-by doesn't conform to the budget.

2016-08-10 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15416284#comment-15416284 ] Chen Li commented on ASTERIXDB-1556: I had a discussion with [~dtabass] and [~wang

[jira] [Commented] (ASTERIXDB-1556) Hash Table used by External hash group-by doesn't conform to the budget.

2016-08-10 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15415446#comment-15415446 ] Chen Li commented on ASTERIXDB-1556: Interesting results. I will talk to Taewoo a

[jira] [Commented] (ASTERIXDB-1556) Hash Table used by External hash group-by doesn't conform to the budget.

2016-08-08 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15412262#comment-15412262 ] Chen Li commented on ASTERIXDB-1556: [~wangsaeu]: let's first make sure [~dtabass]

[jira] [Commented] (ASTERIXDB-1556) Hash Table used by External hash group-by doesn't conform to the budget.

2016-08-07 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15411167#comment-15411167 ] Chen Li commented on ASTERIXDB-1556: [~wangsaeu] Thanks. Can you be more specific

[jira] [Commented] (ASTERIXDB-1556) Hash Table used by External hash group-by doesn't conform to the budget.

2016-08-06 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15410680#comment-15410680 ] Chen Li commented on ASTERIXDB-1556: [~wangsaeu]: it will help the discussion if y

[jira] [Commented] (ASTERIXDB-1566) UTF8 got wrong when processing chinese char and beyond

2016-08-05 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15409694#comment-15409694 ] Chen Li commented on ASTERIXDB-1566: Seems our way of comparing characters is not

[jira] [Updated] (ASTERIXDB-1566) UTF8 got wrong when processing chinese char and beyond

2016-08-05 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chen Li updated ASTERIXDB-1566: --- Assignee: Jianfeng Jia > UTF8 got wrong when processing chinese char and beyond >

[jira] [Commented] (ASTERIXDB-1556) Prefix-based multi-way Fuzzy-join generates an exception.

2016-08-04 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15408915#comment-15408915 ] Chen Li commented on ASTERIXDB-1556: I also think Option 1 is the best. > Prefix-

[jira] [Commented] (ASTERIXDB-1556) Prefix-based multi-way Fuzzy-join generates an exception.

2016-08-03 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15406231#comment-15406231 ] Chen Li commented on ASTERIXDB-1556: Understood. For (2), how does this spilling

[jira] [Commented] (ASTERIXDB-1556) Prefix-based multi-way Fuzzy-join generates an exception.

2016-08-03 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15406181#comment-15406181 ] Chen Li commented on ASTERIXDB-1556: [~wangsaeu]: The discussion results make sens

[jira] [Commented] (ASTERIXDB-1556) Prefix-based multi-way Fuzzy-join generates an exception.

2016-08-02 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404394#comment-15404394 ] Chen Li commented on ASTERIXDB-1556: Thanks. The good news is that the observed m

[jira] [Comment Edited] (ASTERIXDB-1556) Prefix-based multi-way Fuzzy-join generates an exception.

2016-08-02 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404360#comment-15404360 ] Chen Li edited comment on ASTERIXDB-1556 at 8/2/16 5:03 PM:

[jira] [Commented] (ASTERIXDB-1556) Prefix-based multi-way Fuzzy-join generates an exception.

2016-08-02 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15404360#comment-15404360 ] Chen Li commented on ASTERIXDB-1556: It seems to be a bug? A correct implementati

[jira] [Commented] (ASTERIXDB-1556) Prefix-based multi-way Fuzzy-join generates an exception.

2016-08-01 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15403443#comment-15403443 ] Chen Li commented on ASTERIXDB-1556: Per our discussion, it will be good to take

[jira] [Commented] (ASTERIXDB-1556) Prefix-based multi-way Fuzzy-join generates an exception.

2016-07-29 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15399908#comment-15399908 ] Chen Li commented on ASTERIXDB-1556: Per our discussion today, the next step is th

[jira] [Commented] (ASTERIXDB-1544) Omit the fuzzyjoin on inverted index

2016-07-25 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1544?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15392930#comment-15392930 ] Chen Li commented on ASTERIXDB-1544: We should fix it in the current review of Wen

[jira] [Commented] (ASTERIXDB-1538) LSMRTree index instance is created during the recovery process instead of LSMRTreeWithAntiMatterTuples index instance

2016-07-24 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15391351#comment-15391351 ] Chen Li commented on ASTERIXDB-1538: Never mind. Saw the changes in https://asteri

[jira] [Commented] (ASTERIXDB-1538) LSMRTree index instance is created during the recovery process instead of LSMRTreeWithAntiMatterTuples index instance

2016-07-24 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1538?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15391347#comment-15391347 ] Chen Li commented on ASTERIXDB-1538: Is it a bug easily fixable by Young-Seok? >

[jira] [Commented] (ASTERIXDB-1487) Fuzzy select-join on inverted index poses inconsistent results.

2016-07-24 Thread Chen Li (JIRA)
[ https://issues.apache.org/jira/browse/ASTERIXDB-1487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15391340#comment-15391340 ] Chen Li commented on ASTERIXDB-1487: Just curious: any progress on this issue, Wen