[jira] [Commented] (IGNITE-8610) Searching checkpoint / WAL history for rebalancing is not properly working in case of local/global WAL disabling

2018-06-15 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16513634#comment-16513634
 ] 

ASF GitHub Bot commented on IGNITE-8610:


Github user asfgit closed the pull request at:

https://github.com/apache/ignite/pull/4090


> Searching checkpoint / WAL history for rebalancing is not properly working in 
> case of local/global WAL disabling
> 
>
> Key: IGNITE-8610
> URL: https://issues.apache.org/jira/browse/IGNITE-8610
> Project: Ignite
>  Issue Type: Bug
>  Components: cache
>Affects Versions: 2.5
>Reporter: Pavel Kovalenko
>Assignee: Pavel Kovalenko
>Priority: Major
> Fix For: 2.6
>
>
> After implementation IGNITE-6411 and IGNITE-8087 we can face with situation 
> when after some checkpoint, WAL was temporarily disabled and enabled again. 
> In this case we can't treat that checkpoint as start point to rebalance, 
> because WAL history after such checkpoint may contain gaps.
> We should rework our checkpoint / wal history searching mechanism and ignore 
> such checkpoints.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-8610) Searching checkpoint / WAL history for rebalancing is not properly working in case of local/global WAL disabling

2018-06-06 Thread Alexey Goncharuk (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16503419#comment-16503419
 ] 

Alexey Goncharuk commented on IGNITE-8610:
--

[~Jokser], a few comments:
* In {{GridDhtPreloader}} you've added the following code:
{code}
if (!assignments.isEmpty() && grp.persistenceEnabled()) {
ctx.database().checkpointReadLock();

try {
((GridCacheDatabaseSharedManager) 
ctx.database()).lastCheckpointInapplicableForWalRebalance(grp.groupId());
}
finally {
ctx.database().checkpointReadUnlock();
}
}
{code}
I suggest to introduce such a method to the DatabaseSharedManager and have it 
empty for default implementation, while persistence-enabled implementation will 
acquire checkpoint read lock and du necessary work. This will hide both 
{{instanceof}} and {{if (persistenceEnabled())}}

* You've added a synchronous wait for partition re-creation in 
{{generateAssignments}}, which happens in exchange thread. Let's add our 
generic timed-spin-wait and warn if the wait is too long.

> Searching checkpoint / WAL history for rebalancing is not properly working in 
> case of local/global WAL disabling
> 
>
> Key: IGNITE-8610
> URL: https://issues.apache.org/jira/browse/IGNITE-8610
> Project: Ignite
>  Issue Type: Bug
>  Components: cache
>Affects Versions: 2.5
>Reporter: Pavel Kovalenko
>Assignee: Pavel Kovalenko
>Priority: Major
> Fix For: 2.6
>
>
> After implementation IGNITE-6411 and IGNITE-8087 we can face with situation 
> when after some checkpoint, WAL was temporarily disabled and enabled again. 
> In this case we can't treat that checkpoint as start point to rebalance, 
> because WAL history after such checkpoint may contain gaps.
> We should rework our checkpoint / wal history searching mechanism and ignore 
> such checkpoints.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-8610) Searching checkpoint / WAL history for rebalancing is not properly working in case of local/global WAL disabling

2018-06-04 Thread Ilya Lantukh (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16500245#comment-16500245
 ] 

Ilya Lantukh commented on IGNITE-8610:
--

Looks good.

> Searching checkpoint / WAL history for rebalancing is not properly working in 
> case of local/global WAL disabling
> 
>
> Key: IGNITE-8610
> URL: https://issues.apache.org/jira/browse/IGNITE-8610
> Project: Ignite
>  Issue Type: Bug
>  Components: cache
>Affects Versions: 2.5
>Reporter: Pavel Kovalenko
>Assignee: Pavel Kovalenko
>Priority: Major
> Fix For: 2.6
>
>
> After implementation IGNITE-6411 and IGNITE-8087 we can face with situation 
> when after some checkpoint, WAL was temporarily disabled and enabled again. 
> In this case we can't treat that checkpoint as start point to rebalance, 
> because WAL history after such checkpoint may contain gaps.
> We should rework our checkpoint / wal history searching mechanism and ignore 
> such checkpoints.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (IGNITE-8610) Searching checkpoint / WAL history for rebalancing is not properly working in case of local/global WAL disabling

2018-05-29 Thread ASF GitHub Bot (JIRA)


[ 
https://issues.apache.org/jira/browse/IGNITE-8610?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16493856#comment-16493856
 ] 

ASF GitHub Bot commented on IGNITE-8610:


GitHub user Jokser opened a pull request:

https://github.com/apache/ignite/pull/4090

IGNITE-8610 Checkpoint history search fix + refactoring



You can merge this pull request into a Git repository by running:

$ git pull https://github.com/gridgain/apache-ignite ignite-8610

Alternatively you can review and apply these changes as the patch at:

https://github.com/apache/ignite/pull/4090.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

This closes #4090


commit df3fb2a24d4d43771094e56fea6680f23e3790e6
Author: Pavel Kovalenko 
Date:   2018-05-18T09:37:13Z

IGNITE-8459 WIP

commit 54e88d22bfd4c0b61593b4a8f0d5c319288593b7
Author: Pavel Kovalenko 
Date:   2018-05-18T16:22:01Z

IGNITE-8459 WIP

commit e8aeeea9d6c30df2ee03bfada3a8a399cfef7b6b
Author: Pavel Kovalenko 
Date:   2018-05-21T12:42:33Z

IGNITE-8459 WIP

commit 7d58eb9e3c3fa2358c7937ce3f73715d850a33f1
Author: Pavel Kovalenko 
Date:   2018-05-24T19:46:24Z

IGNITE-8459 Rework.

commit bd8ef85034bbf7c03d6c8b40b36fa4398a3b23ca
Author: Pavel Kovalenko 
Date:   2018-05-24T19:50:11Z

IGNITE-8459 Remove trash.

commit b5addf2b44f8ebebc59823397e134d6796a7e3b6
Author: Pavel Kovalenko 
Date:   2018-05-25T01:18:45Z

IGNITE-8459 Fixed tests.

commit fe4d681ae4341accf047b8408a97a22d34f5520f
Author: Pavel Kovalenko 
Date:   2018-05-28T12:09:21Z

IGNITE-8459 Tests refactoring.

commit 702b88c7466555091c95890a4aa024d869c399f3
Author: Pavel Kovalenko 
Date:   2018-05-28T12:14:14Z

IGNITE-8459 Increase test running time.

commit 5b2ccaacafb2ef3c3e02730958c7708768b7c265
Author: Pavel Kovalenko 
Date:   2018-05-28T13:22:54Z

Merge branch 'master' into ignite-8459

# Conflicts:
#   
modules/core/src/test/java/org/apache/ignite/internal/processors/cache/persistence/IgnitePdsCacheRebalancingAbstractTest.java

commit cc2fda56ecc48ae878baffe5079d25c917a1280c
Author: Pavel Kovalenko 
Date:   2018-05-28T14:25:30Z

IGNITE-8482 Tests improvements.

commit edd27a9ba264581e1d0541c2aafc30451dc469b3
Author: Pavel Kovalenko 
Date:   2018-05-28T14:28:18Z

IGNITE-8482 Remove unused import.

commit e4e1cb92e016230dc8c8b270c2f8dfe72cc3d098
Author: Pavel Kovalenko 
Date:   2018-05-29T14:36:01Z

IGNITE-8610 WIP

commit bae83681794bd02e708647494eba094a4213a9f9
Author: Pavel Kovalenko 
Date:   2018-05-29T16:55:22Z

IGNITE-8610 WIP




> Searching checkpoint / WAL history for rebalancing is not properly working in 
> case of local/global WAL disabling
> 
>
> Key: IGNITE-8610
> URL: https://issues.apache.org/jira/browse/IGNITE-8610
> Project: Ignite
>  Issue Type: Bug
>  Components: cache
>Affects Versions: 2.5
>Reporter: Pavel Kovalenko
>Assignee: Pavel Kovalenko
>Priority: Major
> Fix For: 2.6
>
>
> After implementation IGNITE-6411 and IGNITE-8087 we can face with situation 
> when after some checkpoint, WAL was temporarily disabled and enabled again. 
> In this case we can't treat that checkpoint as start point to rebalance, 
> because WAL history after such checkpoint may contain gaps.
> We should rework our checkpoint / wal history searching mechanism and ignore 
> such checkpoints.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)