[ 
https://issues.apache.org/jira/browse/SOLR-10360?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15940964#comment-15940964
 ] 

ASF GitHub Bot commented on SOLR-10360:
---------------------------------------

GitHub user hgadre opened a pull request:

    https://github.com/apache/lucene-solr/pull/173

    [SOLR-10360] Remove an extra space from Hadoop distcp cmd used by Sol…

    …r backup/restore

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/hgadre/lucene-solr SOLR-10360_fix

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/lucene-solr/pull/173.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #173
    
----
commit ae9c60094e3007e3bac42a222f47c296abf8adae
Author: Hrishikesh Gadre <[email protected]>
Date:   2017-03-24T18:47:56Z

    [SOLR-10360] Remove an extra space from Hadoop distcp cmd used by Solr 
backup/restore

----


> Solr HDFS snapshot export fails due to FileNotFoundException error
> ------------------------------------------------------------------
>
>                 Key: SOLR-10360
>                 URL: https://issues.apache.org/jira/browse/SOLR-10360
>             Project: Solr
>          Issue Type: Bug
>      Security Level: Public(Default Security Level. Issues are Public) 
>    Affects Versions: 6.3
>         Environment: SOLR deployed along with a HADOOP cluster (HDFS +M/R 1).
>            Reporter: Hrishikesh Gadre
>            Priority: Minor
>
> The Solr snapshot export command (implemented in the snapshotscli.sh) uses 
> Hadoop distcp tool to copy the Solr index files to the desired location. When 
> the cluster is configured with MR1 framework (instead of YARN), this command 
> fails with following error,
> {noformat}
> Err:With failures, global counters are inaccurate; consider running with -i
> Copy failed: java.io.FileNotFoundException: File does not exist: 
> /user/systest/ /backups/mysnap/copylistings/shard1
>       at 
> org.apache.hadoop.hdfs.server.namenode.INodeFile.valueOf(INodeFile.java:66)
>       at 
> org.apache.hadoop.hdfs.server.namenode.INodeFile.valueOf(INodeFile.java:56)
>       at 
> org.apache.hadoop.hdfs.server.namenode.FSNamesystem.getBlockLocationsInt(FSNamesystem.java:2007)
>       at 
> org.apache.hadoop.hdfs.server.namenode.FSNamesystem.getBlockLocations(FSNamesystem.java:1977)
>       at 
> org.apache.hadoop.hdfs.server.namenode.FSNamesystem.getBlockLocations(FSNamesystem.java:1890)
>       at 
> org.apache.hadoop.hdfs.server.namenode.NameNodeRpcServer.getBlockLocations(NameNodeRpcServer.java:572)
> {noformat}
> During investigation I found that an extra space in the distcp command was 
> causing this failure. For example,
> This command fails with the error mentioned above,
> {noformat}
> hadoop distcp -f  ' /backups/mysnap/copylistings/shard1' 
> /backups/mysnap/snapshot.shard1
> {noformat}
> Removing the space from the copylisting directory path fixes this issue.
> {noformat}
> hadoop distcp -f  '/backups/mysnap/copylistings/shard1' 
> /backups/mysnap/snapshot.shard1
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to