Start your journey with Databricks by joining discussions on getting started guides, tutorials, and introductory topics. Connect with beginners and experts alike to kickstart your Databricks experience.
I would like to know if there is any difference if I save dataframe during tranformation to itself as first code or to new dataframe as second example.Thankslog_df = log_df.withColumn("process_timestamp",from_utc_timestamp(lit(current_timestamp()),"E...
Hi fellasi am working on databricks using icebergat first i have configured my notebook as belowspark.conf.set("spark.sql.catalog.spark_catalog","org.apache.iceberg.spark.SparkCatalog")spark.conf.set("spark.sql.catalog.spark_catalog.type", "hadoop")s...
Hi,we are using databricks jdbc https://mvnrepository.com/artifact/com.databricks/databricks-jdbc/2.6.33it seems like there is a thread leakage when getConnection failscould anyone advice?can be reproduced with @Test
void databricksThreads() {...
Hi,none of the above suggestion will not work...we already contacted databricks jdbc team, thread leakage was confirmed and was fixed in version 2.6.34https://mvnrepository.com/artifact/com.databricks/databricks-jdbc/2.6.34this leakage still exist if...
Hi,I need to process nearly 30 files from different locations and insert records to RDS. I am using multi-threading to process these files parallelly like below. Test data: I have configuration like below based on column 4: If column 4=0:...
Regarding --files option in spark submit task of Databricks jobs, would like to understand how it works and what is the syntax to pass multiple files to --files? I tried using --files and --py-files and my understanding is, it should make available t...
Hi, could you please check if this helps: https://docs.databricks.com/en/files/index.html
Also please tag @Debayan​ with your next response which will notify me, Thank you!
Hi, You can try checking the below resources:
https://www.databricks.com/resources/ebook/migration-guide-hadoop-to-databricks
https://www.databricks.com/solutions/migration/hadoop
https://www.databricks.com/blog/2021/08/06/5-key-steps-to-successfull...
Hi, You can try checking the below resources:
https://www.databricks.com/resources/ebook/migration-guide-hadoop-to-databricks
https://www.databricks.com/solutions/migration/hadoop
https://www.databricks.com/blog/2021/08/06/5-key-steps-to-successfull...
Hi, You can try checking the below resources on Hadoop migration:
https://www.databricks.com/resources/ebook/migration-guide-hadoop-to-databricks
https://www.databricks.com/solutions/migration/hadoop
https://www.databricks.com/blog/2021/08/06/5-key-...
Hi, You can try checking the below resources:
https://www.databricks.com/resources/ebook/migration-guide-hadoop-to-databricks
https://www.databricks.com/solutions/migration/hadoop
https://www.databricks.com/blog/2021/08/06/5-key-steps-to-successfull...
I am using a notebook to copy over my database on a schedule (I had no success connecting through the Data Explorer UI). When I run the notebook on its own, it works. When I run it as a scheduled job, I get this error. org.apache.spark.SparkSQLExcept...
Hi, the error code is minimal, could you please post the whole error if that is possible?
Also please tag @Debayan​ with your next response which will notify me, Thank you!
Hi Databricks TeamI had scheduled my exam on 6th sep 2023, during exam same pop up came up, stating that I am looking in some other direction. I told them that my laptop mouse is not working properly, so I was looking at it. But still they suspended ...
Hello Team,I encountered Pathetic experience while attempting my 1st DataBricks certification. Abruptly, Proctor asked me to show my desk, after showing he/she asked multiple times.. wasted my time and then suspended my exam.I want to file a complain...
Sub: My exam Datbricks Data Engineer Associate got suspended_need immediate help please (10/09/2023)I encountered Pathetic experience while attempting my DataBricks Data engineer certification. Abruptly, Proctor asked me to show my desk, after showin...
Hi, I need to process nearly 30 files from different locations and insert records to RDS. I am using multi-threading to process these files parallelly like below. Test data: I have configuration like below based on column 4: If colum...
Looks like you are comparing to strings like "1", not values like 1 in your filter condition. It's hard to say, there are some details missing like the rest of the code and the DF schema, and what output you are observing.
Need help recovering account owner.Problem: Account owner cannot sign in with its password after SSO was configured. Account owner is a DL for team ownership so it doesn't have an AWS account and can't configure in AD group since it has "+" in the em...
By default SHS uses spark.history.fs.numReplayThreads = 25% of avaliable cores (Number of threads that will be used by history server to process event logs)How can we increase the number of cores for Spark History Server ?