Data Engineering

Forum Posts

Sorted by:

by Pradeep_Namani • New Contributor III

11-23-2022 10:40:57 PM

5251 Views
5 replies
2 kudos

Date field getting changed when reading from excel file to dataframe in pyspark

The date field is getting changed while reading data from source .xls file to the dataframe. In the source xl file all columns are strings but i am not sure why date column alone behaves differentlyIn Source file date is 1/24/1947.In pyspark datafram...

Data Engineering

5251 Views
5 replies
2 kudos

11-23-2022 10:40:57 PM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

11-24-2022 2:37:36 AM

2 kudos

how about using inferschema one single time to create a correct DF, then create a schema from the df-schema.something like this f.e.from pyspark.sql.types import StructType # Save schema from the original DataFrame into json: schema_json = df.s...

2 kudos

11-24-2022 2:37:36 AM

4 More Replies

by sreedata • New Contributor III

03-31-2022 6:47:42 AM

5295 Views
4 replies
10 kudos

Resolved! Date field getting changed when reading from excel file to dataframe

Data Engineering

5295 Views
4 replies
10 kudos

03-31-2022 6:47:42 AM

View Replies

Latest Reply

Pradeep_Namani
New Contributor III

11-17-2022 6:56:19 AM

10 kudos

Hi Team, @Merca Ovnerud I am also facing same issue , below is the code snippet which I am using df=spark.read.format("com.crealytics.spark.excel").option("header","true").load("/mnt/dataplatform/Tenant_PK/Results.xlsx")I have a couple of date colum...

10 kudos

11-17-2022 6:56:19 AM

3 More Replies

by Swapnil1998 • New Contributor III

11-14-2022 11:28:46 PM

1382 Views
0 replies
2 kudos

Date Formats while extracting data from Cosmos Mongo DB using Azure Databricks.

I have been trying to extract one date field from cosmos which looks like as below:"lastModifiedDate" : { "$date" : 1668443121840 }when the above field is extracted using Databricks it gets converted into a date format which looks like this...

Data Engineering

1382 Views
0 replies
2 kudos

11-14-2022 11:28:46 PM

by Raghav1 • New Contributor II

10-27-2022 7:23:14 AM

923 Views
0 replies
0 kudos

How Delta Lake parses through the files on querying the table with a filter condition

Consider a table that gets partitioned on a date field. But, I'm filtering a column that is not partitioned. Now, with this filter condition whether all the files are parsed to attain the required result set, or does any data skipping happens?

Data Engineering

923 Views
0 replies
0 kudos

10-27-2022 7:23:14 AM

Databricks Community

Date field getting changed when reading from excel file to dataframe in pyspark

Resolved! Date field getting changed when reading from excel file to dataframe

Date Formats while extracting data from Cosmos Mongo DB using Azure Databricks.

How Delta Lake parses through the files on querying the table with a filter condition