[ 
https://issues.apache.org/jira/browse/CASSANDRA-2388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13048671#comment-13048671
 ] 

Mck SembWever edited comment on CASSANDRA-2388 at 6/13/11 5:54 PM:
-------------------------------------------------------------------

{quote}
bq.   public String[] sort_endpoints_by_proximity(String endpoint, String[] 
endpoints, boolean restrictToSameDC)
I don't think it makes sense to send the client endpoint to this call since the 
endpoint might not be a cassandra node. It's a reasonable assumption that the 
endpoint it's talking to is local enough to the client to use that.
{quote}
For the test set i was running against, RF=2, each split's has two endpoints 
always in different datacenters.

If the "local" endpoint is down then getLocations() will then call 
client.sort_endpoints_by_proximity(..) and this will fail (being the same 
endpoint).
It then makes a client connection through the "other" endpoint. \[see 
CFRR.describeDatacenter(..)].
This will presume the wrong datacenter and return itself as a valid endpoint. 
I need some way to know what the original datacenter is, even when it is down.

      was (Author: michaelsembwever):
    {quote}
bq.   public String[] sort_endpoints_by_proximity(String endpoint, String[] 
endpoints, boolean restrictToSameDC)
I don't think it makes sense to send the client endpoint to this call since the 
endpoint might not be a cassandra node. It's a reasonable assumption that the 
endpoint it's talking to is local enough to the client to use that.
{quote}
For the test set i was running against, RF=2, each split's has two endpoints 
always in different datacenters.

If the "local" endpoint is down then getLocations() will then call 
client.sort_endpoints_by_proximity(..)
The "local" (or initialAddress) will fail too naturally. 
It then makes a client connection through the "other" endpoint. \[see 
CFRR.describeDatacenter(..)].
This will presume the wrong datacenter and return itself as a valid endpoint. 
I need some way to know what the original datacenter is, even when it is down.
  
> ColumnFamilyRecordReader fails for a given split because a host is down, even 
> if records could reasonably be read from other replica.
> -------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: CASSANDRA-2388
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-2388
>             Project: Cassandra
>          Issue Type: Bug
>          Components: Hadoop
>            Reporter: Eldon Stegall
>            Assignee: Mck SembWever
>              Labels: hadoop, inputformat
>             Fix For: 0.8.1
>
>         Attachments: 0002_On_TException_try_next_split.patch, 
> CASSANDRA-2388.patch, CASSANDRA-2388.patch
>
>
> ColumnFamilyRecordReader only tries the first location for a given split. We 
> should try multiple locations for a given split.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to