cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

vr
by Contributor
  • 5831 Views
  • 5 replies
  • 6 kudos

Resolved! How to avoid trimming in EXPLAIN?

I am looking on EXPLAIN EXTENDED plan for a statement.In == Physical Plan == section, I go down to FileScan node and see a lot of ellipsis, like +- FileScan parquet schema.table[Time#8459,TagName#8460,Value#8461,Quality#8462,day#8...

  • 5831 Views
  • 5 replies
  • 6 kudos
Latest Reply
SS2
Valued Contributor
  • 6 kudos

I also faced the same ​

  • 6 kudos
4 More Replies
Retko
by Contributor
  • 19835 Views
  • 5 replies
  • 8 kudos

Databricks notebook sometime takes too long to run query (even on empty table)

Hi,sometime I notice that running a query takes too long - even simple queries - and next time when I run same query it runs much faster. I have cluster running (DBR 10.4 LTS • 5 workers) and it has constantly several workers.An Example of query is s...

  • 19835 Views
  • 5 replies
  • 8 kudos
Latest Reply
j_afanador
Contributor II
  • 8 kudos

Probably the cluster is always in use and the query always falls into the processing query, or the cluster auto stops every time that you use it.

  • 8 kudos
4 More Replies
augustin
by New Contributor II
  • 5286 Views
  • 5 replies
  • 5 kudos

Mount an uncrypted AWS EFS in AWS Databricks

Hi,I want to mount an uncrypted AWS EFS in AWS Databricks. When I do:mount -t nfs4 -o nfsvers=4.1,rsize=1048576,wsize=1048576,hard,timeo=600,retrans=2,noresvport fs-abcdef.efs.region.amazonaws.com:/ /mnt/efs-uncryptedI get this error:mount.nfs4: moun...

  • 5286 Views
  • 5 replies
  • 5 kudos
Latest Reply
Andrei_Radulesc
Contributor III
  • 5 kudos

"To support NFS under LXC, some of the apparmor protections need to be lifted." (see https://theorangeone.net/posts/mount-nfs-inside-lxc/)

  • 5 kudos
4 More Replies
sqlshep
by New Contributor III
  • 3957 Views
  • 3 replies
  • 1 kudos
  • 3957 Views
  • 3 replies
  • 1 kudos
Latest Reply
sqlshep
New Contributor III
  • 1 kudos

Its broken again, i am seeing this several times a week, and it is offline for hours at a time.

  • 1 kudos
2 More Replies
hitesh1
by New Contributor III
  • 8186 Views
  • 1 replies
  • 5 kudos

java.util.NoSuchElementException: key not found

Hello,We are using a Azure Databricks with Standard DS14_V2 Cluster with Runtime 9.1 LTS, Spark 3.1.2 and Scala 2.12 and facing the below issue frequently when running our ETL pipeline. As part of the operation that is failing there are several joins...

  • 8186 Views
  • 1 replies
  • 5 kudos
Latest Reply
Aviral-Bhardwaj
Esteemed Contributor III
  • 5 kudos

Hey man,Please use these configuration in your cluster and it will work,spark.sql.storeAssignmentPolicy LEGACYspark.sql.parquet.binaryAsString truespark.speculation falsespark.sql.legacy.timeParserPolicy LEGACYif it wont work let me know what problem...

  • 5 kudos
Jack
by New Contributor II
  • 7614 Views
  • 1 replies
  • 1 kudos

Python: Generate new dfs from a list of dataframes using for loop

I have a list of dataframes (for this example 2) and want to apply a for-loop to the list of frames to generate 2 new dataframes. To start, here is my starting dataframe called df_final:First, I create 2 dataframes: df2_b2c_fast, df2_b2b_fast:for x i...

df_long view
  • 7614 Views
  • 1 replies
  • 1 kudos
Latest Reply
Aviral-Bhardwaj
Esteemed Contributor III
  • 1 kudos

thanks

  • 1 kudos
isaac_gritz
by Databricks Employee
  • 1645 Views
  • 1 replies
  • 6 kudos

Databricks Security Review

Conducting a security review or vendor assessment of Databricks and looking to learn more about our security features, compliance information, and privacy policies?You can find the latest on Databricks security features, architecture, compliance and ...

  • 1645 Views
  • 1 replies
  • 6 kudos
Latest Reply
Aviral-Bhardwaj
Esteemed Contributor III
  • 6 kudos

thanks man

  • 6 kudos
SRK
by Contributor III
  • 3160 Views
  • 3 replies
  • 5 kudos

Resolved! I met with an issue when I was trying to use autoloader to read json files from Azure ADLS Gen2. I am getting this issue for specific files only. I checked the file are good and not corrupted.

I met with an issue when I was trying to use autoloader to read json files from Azure ADLS Gen2. I am getting this issue for specific files only. I checked the file are good and not corrupted.Following is the issue:java.lang.IllegalArgumentException:...

  • 3160 Views
  • 3 replies
  • 5 kudos
Latest Reply
SRK
Contributor III
  • 5 kudos

I got the issue resolved. The issues was by mistake we have duplicate columns in the schema files. Because of that it was showing that error. However, the error is totally mis-leading, that's why didn't able to rectify it.

  • 5 kudos
2 More Replies
KVNARK
by Honored Contributor II
  • 1660 Views
  • 2 replies
  • 12 kudos

Resolved! How to get list of users who created the tables in different workspaces and the operations they have done.

Hi,I have 10 workspaces linked to different departments. We have overall 4 users doing some activity on these 10 workspaces . I want to get the list of users who are all operating on which tables and what operation they have performed and all in all ...

  • 1660 Views
  • 2 replies
  • 12 kudos
Latest Reply
Aviral-Bhardwaj
Esteemed Contributor III
  • 12 kudos

Hi Ranjit,for tablets, I believe it's hard but if you want to combine all 10 workspaces you can use the databricks API for cluster lists https://docs.databricks.com/dev-tools/api/latest/index.htmland then you can check their IAM roles to understand w...

  • 12 kudos
1 More Replies
Thanapat_S
by Contributor
  • 4179 Views
  • 3 replies
  • 5 kudos

Resolved! How could I export an Alert object for deployment to another Azure Databricks resource?

IntroductionI would like to use Alert feature for monitor job status (from log table) in Databricks-SQL.So, I have write a query in a query notebook (or object) to return result from log table. Also, I have set the alert object for monitoring and tri...

image image image
  • 4179 Views
  • 3 replies
  • 5 kudos
Latest Reply
Harun
Honored Contributor
  • 5 kudos

I am not seeing any direct option to export or version control the alert object other than the migrate option.https://docs.databricks.com/sql/api/queries-dashboards.html - check this link, it might help you in other way.

  • 5 kudos
2 More Replies
Paradox_Parijat
by New Contributor III
  • 2615 Views
  • 5 replies
  • 8 kudos

Hello World! ​This my first databricks community post. Looking forward to contribute from my end. ​Peace out!​ @Dinesh Mergu​ 

Hello World! ​This my first databricks community post. Looking forward to contribute from my end. ​Peace out!​@Dinesh Mergu​ 

  • 2615 Views
  • 5 replies
  • 8 kudos
Latest Reply
Harshjot
Contributor III
  • 8 kudos

Welcome !!

  • 8 kudos
4 More Replies
KVNARK
by Honored Contributor II
  • 1496 Views
  • 2 replies
  • 6 kudos

Resolved! Scope of Data Governance in Databricks

Scope of Data Governance in Databricks. How we can implement it and is there any data limit for this to implement. I would like to know more about Cost wise. 

  • 1496 Views
  • 2 replies
  • 6 kudos
Latest Reply
KVNARK
Honored Contributor II
  • 6 kudos

I see. Thank you @karthik p​. Got it.

  • 6 kudos
1 More Replies
Taha_Hussain
by Databricks Employee
  • 3132 Views
  • 1 replies
  • 5 kudos

Ask your technical questions at Databricks Office Hours! November 16 - 8:00 AM - 9:00 AM PT: Register HereNovember 30 - 11:00 AM - 12:00 PM PT: Regist...

Ask your technical questions at Databricks Office Hours!November 16 - 8:00 AM - 9:00 AM PT: Register HereNovember 30 - 11:00 AM - 12:00 PM PT: Register HereDatabricks Office Hours connects you directly with experts to answer all your Databricks quest...

  • 3132 Views
  • 1 replies
  • 5 kudos
Latest Reply
Taha_Hussain
Databricks Employee
  • 5 kudos

Q&A Recap from 11/30 Office HoursQ: What is the downside of using z-ordering and auto optimize? It seems like there could be a tradeoff with writing small files (whereas it is good at reading a larger file), is that true?A: By default, Delta Lake on ...

  • 5 kudos
Ancil
by Contributor II
  • 17763 Views
  • 11 replies
  • 1 kudos

Any on please suggest how we can effectively loop through PySpark Dataframe .

Scenario: I Have a dataframe with more than 1000 rows, each row having a file path and result data column. I need to loop through each row and write files to the file path, with data from the result column.what is the easiest and time effective way ...

image
  • 17763 Views
  • 11 replies
  • 1 kudos
Latest Reply
NhatHoang
Valued Contributor II
  • 1 kudos

Hi,​I agree with Werners, try to avoid loop with Pyspark Dataframe.If your dataframe is small, as you said, only about 1000 rows, you may consider to use Pandas.Thanks.​

  • 1 kudos
10 More Replies

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now
Labels