It probably does not like the colons in the path name “…20:04:27+00:00/…”, especially if you’re running on a Windows box.
On 24 Apr 2021, at 1:29 am, Regin Quinoa <sweatr...@gmail.com<mailto:sweatr...@gmail.com>> wrote: Hi, I am using pyspark sql to load files into table following ```LOAD DATA LOCAL INPATH '/user/hive/warehouse/students' OVERWRITE INTO TABLE test_load;``` https://spark.apache.org/docs/latest/sql-ref-syntax-dml-load.html<https://aus01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fspark.apache.org%2Fdocs%2Flatest%2Fsql-ref-syntax-dml-load.html&data=04%7C01%7Cscoy%40infomedia.com.au%7C9d59e3a62b624695d07508d906769325%7C45d5407150f849caa59f9457123dc71c%7C0%7C1%7C637547928618653970%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=rTIXT15%2BiEoONH21s5%2B%2BU6BJJ4ntpYQFaTJjLAJICj8%3D&reserved=0> It complains pyspark.sql.utils.AnalysisException: load data input path does not exist when the path string has timestamp in the directory structure like XX/XX/2021-03-02T20:04:27+00:00/file.parquet It works with path without timestamp. How to work it around? [http://downloads.ifmsystems.com/data/marketing/images/signatures/driving-force-newsletter.jpg]<https://www.infomedia.com.au/driving-force/?utm_campaign=200630%20Email%20Signature&utm_source=Internal&utm_medium=Email&utm_content=Driving%20Force> This email contains confidential information of and is the copyright of Infomedia. It must not be forwarded, amended or disclosed without consent of the sender. If you received this message by mistake, please advise the sender and delete all copies. Security of transmission on the internet cannot be guaranteed, could be infected, intercepted, or corrupted and you should ensure you have suitable antivirus protection in place. By sending us your or any third party personal details, you consent to (or confirm you have obtained consent from such third parties) to Infomedia’s privacy policy. http://www.infomedia.com.au/privacy-policy/