dosubot[bot] commented on issue #2849: URL: https://github.com/apache/incubator-hugegraph/issues/2849#issuecomment-3162962598
<!-- Answer --> This is a known and reproducible issue in HugeGraph 1.5.0 distributed deployments with HStore backend: if you take an HStore node offline and the number of available Store nodes drops below the configured minimum (set by `initial-store-count` in PD), the entire cluster becomes unavailable. This is expected behavior based on the Raft consensus and fault tolerance design—enough nodes must be online to maintain quorum and data consistency. If your `initial-store-count` is set to 4, losing any one HStore node will make the cluster unavailable; lowering this value (e.g., to 3) allows one node to go down without full outage, but reduces fault tolerance. See [discussion and config details](https://github.com/apache/incubator-hugegraph/issues/2812). Cluster instability and data out-of-sync are often caused by configuration inconsistencies (like mismatched PD/Store node lists, ports, or IPs), partition allocation not being triggered, or legacy parameters (such as `server.role`) remaining in configs. Make sure all PD and Store nodes have consistent configs, remove deprecated parameters, and check that partition allocation is triggered after all nodes register and leader election completes. Manual partition allocation can be done via the PD REST API `/partition/dispatch` if needed—see [partition allocation troubleshooting](https://github.com/apache/incubator-hugegraph/issues/2821) and [known cluster state issues](https://github.com/apache/incubator-hugegraph/issues/2847). There is no official documentation for distributed HStore cluster fault tolerance in 1.5.0; most troubleshooting relies on community issues. This scenario is under investigation by maintainers and is a known problem in the community. <!-- Dosu Comment Footer --> *To reply, just mention [@dosu](https://go.dosu.dev/dosubot).* --- <sup>How did I do? [Good](https://app.dosu.dev/response-feedback/5cc46cff-4fca-4c45-ad32-ed3cdb177823?feedback_type=great_response) | [Irrelevant](https://app.dosu.dev/response-feedback/5cc46cff-4fca-4c45-ad32-ed3cdb177823?feedback_type=irrelevant_answer) | [Incorrect](https://app.dosu.dev/response-feedback/5cc46cff-4fca-4c45-ad32-ed3cdb177823?feedback_type=incorrect_sources) | [Verbose](https://app.dosu.dev/response-feedback/5cc46cff-4fca-4c45-ad32-ed3cdb177823?feedback_type=too_verbose) | [Hallucination](https://app.dosu.dev/response-feedback/5cc46cff-4fca-4c45-ad32-ed3cdb177823?feedback_type=hallucination) | [Report 🐛](https://app.dosu.dev/response-feedback/5cc46cff-4fca-4c45-ad32-ed3cdb177823?feedback_type=bug_report) | [Other](https://app.dosu.dev/response-feedback/5cc46cff-4fca-4c45-ad32-ed3cdb177823?feedback_type=other)</sup> [](https://app.dosu.dev/e5e246fa-a522-4d8a-ab3c-e2588b5b501b/ask?utm_source=githu b) [](https://go.dosu.dev/discord-bot) [](https://twitter.com/intent/tweet?text=%40dosu_ai%20helped%20me%20solve%20this%20issue!&url=https%3A//github.com/apache/incubator-hugegraph/issues/2849) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
