Re: Spark / Scala code not recognising the path?

Abhijeet Kumar Fri, 08 Jun 2018 23:35:20 -0700

Can you please tell the estimated time. So, that my program will wait for that 
time period.


Thanks,
Abhijeet Kumar
> On 09-Jun-2018, at 12:01 PM, Jörn Franke <jornfra...@gmail.com> wrote:
> 
> You need some time until the information of the file creation is propagated.
> 
> On 9. Jun 2018, at 08:07, Abhijeet Kumar <abhijeet.ku...@sentienz.com 
> <mailto:abhijeet.ku...@sentienz.com>> wrote:
> 
>> I'm modifying a CSV file which is inside HDFS and finally putting it back to 
>> HDFS in Spark.
>> val fs=FileSystem.get(spark.sparkContext.hadoopConfiguration)
>> csv_file.coalesce(1).write
>>   .format("csv”)
>>   .mode("overwrite”)
>>   .save("hdfs://localhost:8020/data/temp_insight 
>> <hdfs://localhost:8020/data/temp_insight>”)
>> Thread.sleep(15000)
>> println(fs.exists(new Path("/data/temp_insight")))
>> Output:
>> 
>> false
>> while I have stopped the thread for 15 sec, I have checked my hdfs using 
>> command
>> 
>> hdfs dfs -ls /data/temp_insight
>> Output:
>> 
>> 18/06/08 17:48:18 WARN util.NativeCodeLoader: Unable to load native-hadoop 
>> library for your platform... using builtin-java classes where applicable
>> -rw-r--r--   3 abhijeet supergroup          0 2018-06-08 17:48 
>> /data/temp_insight/_SUCCESS
>> -rw-r--r--   3 abhijeet supergroup        201 2018-06-08 17:48 
>> /data/temp_insight/part-00000-7bffb826-f18d-4022-b089-da85565525b7-c000.csv
>> To cross verify whether it is taking the path of hdfs or not I have added 
>> one more println statement in my code, providing the path which is already 
>> there in HDFS. It's showing true in that case.
>> 
>> So, what could be the reason?
>> 
>> Thanks,
>> 
>> Abhijeet Kumar

Re: Spark / Scala code not recognising the path?

Reply via email to