[ https://issues.apache.org/jira/browse/SPARK-6101?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14734830#comment-14734830 ]
Rustam Aliyev edited comment on SPARK-6101 at 9/8/15 1:58 PM: -------------------------------------------------------------- What's the status of this? GH repo has not been updated for a while. Few improvements which I'd like to see: # Use {{FilterExpression}} instead of legacy {{ScanFilter}} for {{scan}} operation # Leverage Parallel Scan (http://docs.aws.amazon.com/amazondynamodb/latest/developerguide/QueryAndScan.html#QueryAndScanParallelScan) and make multiple Spark workers pull data in parallel. was (Author: rstml): What's the status of this? GH repo has not been updated for a while. Few improvements which I'd like to see: # Use {FilterExpression} instead of legacy {ScanFilter} in the {scan} # Leverage Parallel Scan (http://docs.aws.amazon.com/amazondynamodb/latest/developerguide/QueryAndScan.html#QueryAndScanParallelScan) and make multiple Spark workers pull data in parallel. > Create a SparkSQL DataSource API implementation for DynamoDB > ------------------------------------------------------------ > > Key: SPARK-6101 > URL: https://issues.apache.org/jira/browse/SPARK-6101 > Project: Spark > Issue Type: Improvement > Components: SQL > Affects Versions: 1.2.0 > Reporter: Chris Fregly > Assignee: Chris Fregly > Fix For: 1.6.0 > > > similar to https://github.com/databricks/spark-avro and > https://github.com/databricks/spark-csv > Here's a good basis for a java-based, high-level dynamodb java connector: > https://github.com/sporcina/dynamodb-connector/ -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org