Data Engineering

Forum Posts

Sorted by:

by Stita • New Contributor II

10-06-2022 4:34:37 AM

3405 Views
1 replies
2 kudos

Resolved! How do we pass the row tags dynamically while reading a XML file into a dataframe?

I have a set of xml files where the row tags change dynamically. How can we achieve this scenario in databricks.df1=spark.read.format('xml').option('rootTag','XRoot').option('rowTag','PL1PLLL').load("dbfs:/FileStore/tables/ins/")We need to pass a val...

Data Engineering

3405 Views
1 replies
2 kudos

10-06-2022 4:34:37 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

10-14-2022 4:42:33 AM

2 kudos

If it is dynamically for the whole file, you can just use variabletag = 'PL1PLLL' df1=spark.read.format('xml').option('rootTag','XRoot').option('rowTag' ,tag).load("dbfs:/FileStore/tables/ins/file.xml")

2 kudos

10-14-2022 4:42:33 AM

by PriyaTech • New Contributor

09-26-2022 11:47:08 PM

3936 Views
1 replies
2 kudos

Resolved! Converting Dataframe into Nested xml

e.g.dataframe is having firstname,lastname,middlename,id,salaryI need to convert dataframe in xml file but in nested format.output as nested xml<Name> <firatname> <middlename> <lastname> </Name><id></id><salary></salary>Anyone has ides ho...

Data Engineering

3936 Views
1 replies
2 kudos

09-26-2022 11:47:08 PM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

09-27-2022 2:42:38 AM

2 kudos

databricks has a xml connector:https://docs.databricks.com/data/data-sources/xml.htmlBasically you just define a df with the correct structure and write it to xml.To create a nested df, here you can find some info.

2 kudos

09-27-2022 2:42:38 AM

by wyzer • Contributor II

04-12-2022 5:12:10 AM

5371 Views
8 replies
4 kudos

Unable to read an XML file of 9 GB

Hello,We have a large XML file (9 GB) that we can't read.We have this error : VM size limitBut how can we change the VM size limit ?We have tested many clusters, but no one can read this file.Thank you for your help.

Data Engineering

5371 Views
8 replies
4 kudos

04-12-2022 5:12:10 AM

View Replies

Latest Reply

jose_gonzalez
Databricks Employee

07-25-2022 2:14:39 PM

4 kudos

Hi @Salah K.,Just a friendly follow-up. Did any of the responses help you to resolve your question? if it did, please mark it as best. Otherwise, please let us know if you still need help.

4 kudos

07-25-2022 2:14:39 PM

7 More Replies

by Ben_Spark • New Contributor III

04-14-2022 3:11:54 AM

7595 Views
4 replies
2 kudos

Resolved! Databricks Spark XML parser : support for namespace declared at the ancestor level.

I'm trying to use Spark-XML API and I'm facing issue with the XSD validation option.Actually when I parser an XML file using the "rowValidationXSDPath" option the parser can't recognize the Prefixes/Namespaces declared at the root level. For this to...

Data Engineering

7595 Views
4 replies
2 kudos

04-14-2022 3:11:54 AM

View Replies

Latest Reply

Ben_Spark
New Contributor III

05-11-2022 6:34:54 AM

2 kudos

Hi sorry for the late response got busy looking for a permanent solution to this problem .At the end we are giving up on the XSDpath parser. This option does not work when Prefixes namespaces are declared at the ancestor level .Thank you anyway for ...

2 kudos

05-11-2022 6:34:54 AM

3 More Replies

Databricks Community

Resolved! How do we pass the row tags dynamically while reading a XML file into a dataframe?

Resolved! Converting Dataframe into Nested xml

Unable to read an XML file of 9 GB

Resolved! Databricks Spark XML parser : support for namespace declared at the ancestor level.