By default, this data is on the DBFS, and your code need to understand how to access it. Python doesn’t know about it – that’s why it’s failing.
But there is a workaround – DBFS is mounted to the nodes at /dbfs
, so you just need to append it to your file name: instead of /user/delta_test/_delta_log/00000000000000000000.json
, use /dbfs/user/delta_test/_delta_log/00000000000000000000.json
update: on community edition, in DBR 7+, this mount is disabled. The workaround would be to use dbutils.fs.cp
command to copy file from DBFS to local directory, like, /tmp
, or /var/tmp
, and then read from it:
dbutils.fs.cp("/file_on_dbfs", "file:///tmp/local_file")
please note that if you don’t specify URI schema, then the file by default is referring DBFS, and to refer the local file you need to use file://
prefix (see docs).