cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

How to get the modification time of files from a notebook command?

User16869510359
Esteemed Contributor
 
1 ACCEPTED SOLUTION

Accepted Solutions

User16869510359
Esteemed Contributor

The below code snippet is useful to get the modification time of files.

%scala
import scala.util.Try
import org.apache.hadoop.conf.Configuration
import org.apache.hadoop.fs.{FileSystem, Path}
import org.apache.hadoop.io.IOUtils
import java.io.IOException
val conf = sc.hadoopConfiguration
val pathObj=new Path("s3a://my_bucket/location/")
val obj= pathObj.getFileSystem(conf)
val fs = pathObj.getFileSystem(conf)
display(fs.listStatus(pathObj).map(x=>(x.getModificationTime,x.getPath.toString)).toSeq.toDF().withColumn("time",from_unixtime($"_1" /1000)).withColumnRenamed("_2","path").drop("_1"))

View solution in original post

1 REPLY 1

User16869510359
Esteemed Contributor

The below code snippet is useful to get the modification time of files.

%scala
import scala.util.Try
import org.apache.hadoop.conf.Configuration
import org.apache.hadoop.fs.{FileSystem, Path}
import org.apache.hadoop.io.IOUtils
import java.io.IOException
val conf = sc.hadoopConfiguration
val pathObj=new Path("s3a://my_bucket/location/")
val obj= pathObj.getFileSystem(conf)
val fs = pathObj.getFileSystem(conf)
display(fs.listStatus(pathObj).map(x=>(x.getModificationTime,x.getPath.toString)).toSeq.toDF().withColumn("time",from_unixtime($"_1" /1000)).withColumnRenamed("_2","path").drop("_1"))

Welcome to Databricks Community: Lets learn, network and celebrate together

Join our fast-growing data practitioner and expert community of 80K+ members, ready to discover, help and collaborate together while making meaningful connections. 

Click here to register and join today! 

Engage in exciting technical discussions, join a group with your peers and meet our Featured Members.