cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 
Data + AI Summit 2024 - Data Engineering & Streaming

Forum Posts

Bollam
by New Contributor II
  • 496 Views
  • 0 replies
  • 0 kudos

Utility catalog and data governance

Attended the Data and AI Summit 2023 and gained insights into the utility catalog and services that it has to offer, definitely  going to try the data governance as it's a game changer. 

  • 496 Views
  • 0 replies
  • 0 kudos
Ankith
by New Contributor
  • 2660 Views
  • 1 replies
  • 1 kudos

Converting column of XML strings to column of Jsons

Hi,I want to convert column of XML strings to column of Json in PySpark., using withcolumn and xmltodict method as UDF, is giving Json with '=' instead of ':' in the dictionary. Please let me know if there is any alternative for this.

  • 2660 Views
  • 1 replies
  • 1 kudos
Latest Reply
DessertKid
New Contributor II
  • 1 kudos

To convert a column of XML strings to a column of JSON in PySpark, you can use the `from_json` function along with the `xmltodict` library. However, instead of using a UDF with `withColumn`, you can use the `select` function to transform the column.

  • 1 kudos
AlanF
by New Contributor II
  • 333 Views
  • 0 replies
  • 0 kudos

Great Summit

Having a great time at the community hub at the Summit. Highly recommend!

  • 333 Views
  • 0 replies
  • 0 kudos
Lakehouse
by New Contributor
  • 1646 Views
  • 2 replies
  • 0 kudos

Is Rust the future if analytics?

Today I walked into a session that talked about a fairly new language - Rust. The name can mislead you, I believe taking a look at the roots of how to best use CPU cycles is a game changer and Rust is traversing new areas that others might have ignor...

  • 1646 Views
  • 2 replies
  • 0 kudos
Latest Reply
Tom-Coffin
New Contributor II
  • 0 kudos

Yes, Rust is definitely part of the future.  It brings performance and simplicity to us.  I think it will add to the community, rather than replacing.  Scala and R will never go away, Python will always be strong, but Rust gives us one other tool in ...

  • 0 kudos
1 More Replies
kingmobil
by New Contributor
  • 315 Views
  • 0 replies
  • 0 kudos

Summit

Really enjoying my time at the summit!

  • 315 Views
  • 0 replies
  • 0 kudos
Orthoscope
by New Contributor
  • 396 Views
  • 0 replies
  • 0 kudos

Epic Keynote

Wow what was cool, so many of my workarounds have materialized into solutions!!

  • 396 Views
  • 0 replies
  • 0 kudos
abetogi
by New Contributor III
  • 412 Views
  • 0 replies
  • 0 kudos

AI

At Chevron we actively use Databricks to provide answers to business users. It was extremely interesting to see the use LakeHouseIQ initiatives as it can expedite how fast our users can receive their answers/reports. Is there any documentation that I...

  • 412 Views
  • 0 replies
  • 0 kudos
IS8
by New Contributor
  • 248 Views
  • 0 replies
  • 0 kudos

Summit

Great conference. Exited about upcoming LakehouseIQ feature. 

  • 248 Views
  • 0 replies
  • 0 kudos
DiegoG
by New Contributor
  • 348 Views
  • 0 replies
  • 0 kudos

Lot of new learns

the new way to optimize my notebook is something that I learned and I will execute soon 

  • 348 Views
  • 0 replies
  • 0 kudos
tariq
by New Contributor III
  • 6754 Views
  • 2 replies
  • 5 kudos

Databricks reading from a zip file

I have mounted an Azure Blob Storage in the Azure Databricks workspace filestore. The mounted container has zipped files with csv files in them. What is the best way to read the zipped files and write into a delta table?@sasikumar sagabala​ 

  • 6754 Views
  • 2 replies
  • 5 kudos
Latest Reply
Rishitha
New Contributor III
  • 5 kudos

Hello @Debayan  I recently came across the similar scenario, is there a way to do this via autoloader. We have zip Folders added daily to our AWS S3 bucket and we want to be able to unzip and load the csv files continuously (Autoloading)

  • 5 kudos
1 More Replies
Zhudocode
by New Contributor II
  • 1260 Views
  • 2 replies
  • 2 kudos

Resolved! What's the purpose of data governance

Forgive me for a nooby question, but what is the point of data governance if everyone is working at the same company? Is it for insider purposes?

  • 1260 Views
  • 2 replies
  • 2 kudos
Latest Reply
Datajoe
Contributor
  • 2 kudos

Hey Zhudocode, Actually answered this is person, but Data Governance fundamentally is about the appropriate, efficient and effective use of data. Appropriate use has to do with ethical ai, use of personal information, and policy around confidential a...

  • 2 kudos
1 More Replies

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group
Labels