Data Engineering

Forum Posts

Sorted by:

by Bartek • Contributor

01-27-2023 5:07:14 AM

3743 Views
1 replies
1 kudos

Save Spark DataFrame to shape file (.shp format)

Hello,I know how to create .shp file from Geopandas dataframe using code similar to this, also mentioned on SO:gpd_df = geopandas.GeoDataFrame(pandas_df, geometry='geom') gpd_df .to_file("username/nh.shp")However I have .parquet files that I can load...

Data Engineering

3743 Views
1 replies
1 kudos

01-27-2023 5:07:14 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-08-2023 8:14:39 PM

1 kudos

@Bartosz Maciejewski :Spark does not have native support for writing Shapefiles directly. However, you can use a third-party library such as GeoPandas or PyShp to write your Spark DataFrame to a Shapefile.Here's an example of how to use GeoPandas to...

1 kudos

03-08-2023 8:14:39 PM

by KVNARK • Honored Contributor II

02-20-2023 9:24:24 PM

6826 Views
1 replies
4 kudos

Resolved! Query related to Storage account authentication

Use Case: Copy data from SharePoint List to Blob using Power AutomateShort Description:To Access the blob storage account from Power Automate. There are three authentication type:1. Access Key2. Service Principal3. Azure AD IntegratedWhich authentica...

Data Engineering

6826 Views
1 replies
4 kudos

02-20-2023 9:24:24 PM

View Replies

Latest Reply

Anonymous
Not applicable

03-08-2023 5:45:26 PM

4 kudos

@KVNARK . :It's recommended to use the Azure AD Integrated authentication type. This authentication type allows you to use Azure Active Directory (AD) to authenticate and manage access to Blob Storage resources at the folder or container level using...

4 kudos

03-08-2023 5:45:26 PM

by Lu_Wang_SA_DBX • Databricks Employee

03-08-2023 12:57:21 PM

1191 Views
0 replies
2 kudos

We will host the first Databricks Bay Area User Group meeting in the Databricks Mountain View office on March 14 2:30-5:00pm PT.We'll have Dave Ma...

We will host the first Databricks Bay Area User Group meeting in the Databricks Mountain View office on March 14 2:30-5:00pm PT.We'll have Dave Mariani - CTO & Founder at AtScale, and Riley Phillips - Enterprise Solution Engineer at Matillion to shar...

Data Engineering

1191 Views
0 replies
2 kudos

03-08-2023 12:57:21 PM

by aki1 • New Contributor II

03-07-2023 9:11:05 PM

2540 Views
2 replies
1 kudos

How to download a file in DBFS that contains multibyte characters in the file path?

I would like to download a file in DBFS using the FileStore Endpoint.If the file or folder name contains multibyte characters, the file path cannot be specified due to URL encoding and an error occurs.Question 1: If a file or folder name contains mul...

Data Engineering

2540 Views
2 replies
1 kudos

03-07-2023 9:11:05 PM

View Replies

Latest Reply

Debayan
Databricks Employee

03-07-2023 10:27:48 PM

1 kudos

Hi,Databricks CLI can be used to download a file from DBFS. https://docs.databricks.com/dev-tools/cli/index.htmlAlso, you can refer to https://stackoverflow.com/questions/49019706/databricks-download-a-dbfs-filestore-file-to-my-local-machine , which ...

1 kudos

03-07-2023 10:27:48 PM

1 More Replies

by Hubert-Dudek • Esteemed Contributor III

03-08-2023 3:12:52 AM

1523 Views
0 replies
4 kudos

lnkd.in

Databricks has introduced a new feature that allows users to send SQL statements to their database via REST API. Users can easily integrate this feature with any tool by simply posting their queries to the /api/2.0/sql/statements/ endpoint. With this...

Data Engineering

1523 Views
0 replies
4 kudos

03-08-2023 3:12:52 AM

by Tewks • New Contributor

03-07-2023 4:21:41 PM

2768 Views
2 replies
5 kudos

Resolved! Databricks SQL External Connections

Lakehouse architectures seem enticing, especially from the standpoint of querying the data lake directly as it sits (as opposed to first migrating the data to an external data warehouse). While documentation and support seems pretty clear regarding ...

Data Engineering

2768 Views
2 replies
5 kudos

03-07-2023 4:21:41 PM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

03-07-2023 8:33:48 PM

5 kudos

these are really awesome details

5 kudos

03-07-2023 8:33:48 PM

1 More Replies

by Erik_L • Contributor II

03-07-2023 2:51:47 PM

2669 Views
1 replies
1 kudos

Resolved! python 3.10 custom docker image

Goal: To use Python 3.10.4+Why: We have Python repos that are not backward compatible we want to use.What: I have created an image from Databricks example experimental containers already with Ubuntu 22.04 (2 major versions newer than the curre...

Data Engineering

2669 Views
1 replies
1 kudos

03-07-2023 2:51:47 PM

View Replies

Latest Reply

Erik_L
Contributor II

03-07-2023 2:54:56 PM

1 kudos

After searching for an hour, I realized what I needed to look for. It's the importing the Iterable from collections, which is deprecated in 3.10. I guess Databricks hasn't migrated code, yet. In which case, I'm at a cross-roads. Databricks 3.9, local...

1 kudos

03-07-2023 2:54:56 PM

by dotan • New Contributor II

03-06-2023 8:27:58 AM

1962 Views
2 replies
1 kudos

Resolved! How do I reduce the size of a hive table's S3 bucket

I have a hive table in Delta format with over 1B rows, when I check the Data Explorer in the SQL section of Databricks it notes that the table size is 139.3GiB with 401 files but when I check the S3 bucket where the files are located (dbfs:/user/hive...

Data Engineering

1962 Views
2 replies
1 kudos

03-06-2023 8:27:58 AM

View Replies

Latest Reply

apingle
Contributor

03-06-2023 1:52:20 PM

1 kudos

When you run updates, deletes etc on a delta table, new files are created. However, the old files are not automatically deleted. This is to allow for features like time travel on the Delta tables. In order to delete older files for a delta table, you...

1 kudos

03-06-2023 1:52:20 PM

1 More Replies

by Hubert-Dudek • Esteemed Contributor III

03-03-2023 9:55:21 AM

1006 Views
1 replies
5 kudos

Exciting news for Databricks users! #databricks launched a new feature that allows users to run job workflows continuously. Setting up a continuous jo...

Exciting news for Databricks users! #databricks launched a new feature that allows users to run job workflows continuously. Setting up a continuous job workflow is straightforward: create a job and select the continuous trigger option in the scheduli...

Data Engineering

1006 Views
1 replies
5 kudos

03-03-2023 9:55:21 AM

View Replies

Latest Reply

jose_gonzalez
Databricks Employee

03-07-2023 10:26:59 AM

5 kudos

Thank you for sharing!!!

5 kudos

03-07-2023 10:26:59 AM

by Sujitha • Databricks Employee

03-01-2023 6:57:22 AM

1207 Views
1 replies
1 kudos

Weekly Release Notes RecapHere’s a quick recap of the latest release notes updates from the past one week. Databricks platform release notesFebruary 2...

Weekly Release Notes RecapHere’s a quick recap of the latest release notes updates from the past one week.Databricks platform release notesFebruary 21 - 28, 2023Ray on Databricks (Public Preview)With Databricks Runtime 12.0 and above, you can create ...

Data Engineering

1207 Views
1 replies
1 kudos

03-01-2023 6:57:22 AM

View Replies

Latest Reply

jose_gonzalez
Databricks Employee

03-07-2023 10:26:03 AM

1 kudos

Thank you for sharing!!!

1 kudos

03-07-2023 10:26:03 AM

by SteveGPT • New Contributor III

02-23-2023 11:49:03 AM

4344 Views
3 replies
3 kudos

How to by pass SSL cert verification, using Repos with Azure Devops

Hi all, after some time working with Devops and Repos and getting used to the convenience our SSL Cert situation got jacked up somehow. While not ideal, I'd like to be able to temporarily bypass cert verification. There are ways to do this in the she...

Data Engineering

4344 Views
3 replies
3 kudos

02-23-2023 11:49:03 AM

View Replies

Latest Reply

SteveGPT
New Contributor III

03-07-2023 7:14:42 AM

3 kudos

Guess I'm out of luck on this one...

3 kudos

03-07-2023 7:14:42 AM

2 More Replies

by Hubert-Dudek • Esteemed Contributor III

03-07-2023 3:56:53 AM

1299 Views
0 replies
5 kudos

Exciting news for #azure users! The #databricks runtime 12.2 has been officially released as a long-term support (LTS) version, providing a stable and...

Exciting news for #azure users! The #databricks runtime 12.2 has been officially released as a long-term support (LTS) version, providing a stable and reliable platform for users to build and deploy their applications. As part of this release, the en...

Data Engineering

1299 Views
0 replies
5 kudos

03-07-2023 3:56:53 AM

by youssefmrini • Databricks Employee

02-28-2023 3:05:44 AM

1942 Views
1 replies
2 kudos

Does DLT Support watermarking and Windowing ?

Yes it does.Here is the syntaxe for Watermarkinghttps://docs.databricks.com/sql/language-manual/sql-ref-syntax-qry-select-watermark.htmlHere it the syntaxe for Windowing https://docs.databricks.com/sql/language-manual/sql-ref-window-functions.html

Data Engineering

1942 Views
1 replies
2 kudos

02-28-2023 3:05:44 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-07-2023 12:36:49 AM

2 kudos

Thanks for sharing.

2 kudos

03-07-2023 12:36:49 AM

by youssefmrini • Databricks Employee

02-28-2023 3:16:37 AM

2022 Views
2 replies
0 kudos

Resolved! Notebook cell output results limit increased ?

Data Engineering

2022 Views
2 replies
0 kudos

02-28-2023 3:16:37 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-06-2023 11:16:38 PM

0 kudos

Hi @Youssef Mrini Thank you for your question! To assist you better, please take a moment to review the answer and let me know if it best fits your needs.Please help us select the best solution by clicking on "Select As Best" if it does.Your feedbac...

0 kudos

03-06-2023 11:16:38 PM

1 More Replies

by youssefmrini • Databricks Employee

02-28-2023 3:07:50 AM

1830 Views
2 replies
2 kudos

Resolved! Can I run Ray applications on Databricks ?

With Databricks Runtime 12.0 and above, you can create a Ray cluster and run Ray applications in Databricks with the Ray on Spark API.Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a ...

Data Engineering

1830 Views
2 replies
2 kudos

02-28-2023 3:07:50 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-06-2023 10:31:36 PM

2 kudos

Hi @Youssef Mrini Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Than...

2 kudos

03-06-2023 10:31:36 PM

1 More Replies

User

Count

1611

768

348

286

252

Databricks Community

Forum Posts

Save Spark DataFrame to shape file (.shp format)

Resolved! Query related to Storage account authentication

We will host the first Databricks Bay Area User Group meeting in the Databricks Mountain View office on March 14 2:30-5:00pm PT.We'll have Dave Ma...

How to download a file in DBFS that contains multibyte characters in the file path?

lnkd.in

Resolved! Databricks SQL External Connections

Resolved! python 3.10 custom docker image

Resolved! How do I reduce the size of a hive table's S3 bucket

Exciting news for Databricks users! #databricks launched a new feature that allows users to run job workflows continuously. Setting up a continuous jo...

Weekly Release Notes RecapHere’s a quick recap of the latest release notes updates from the past one week. Databricks platform release notesFebruary 2...

How to by pass SSL cert verification, using Repos with Azure Devops

Exciting news for #azure users! The #databricks runtime 12.2 has been officially released as a long-term support (LTS) version, providing a stable and...

Does DLT Support watermarking and Windowing ?

Resolved! Notebook cell output results limit increased ?

Resolved! Can I run Ray applications on Databricks ?

Join Us as a Local Community Builder!

How can I use Terraform to assign an external loca...

global temp view issue

Dlt pipeline showing legacy , even though all thin...

SERVERLESS SQL WAREHOUSE

Unity Catalog Table in Databricks Asset Bundle