cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

SamCallister
by New Contributor II
  • 19386 Views
  • 8 replies
  • 3 kudos

Dynamic Partition Overwrite for Delta Tables

Spark supports dynamic partition overwrite for parquet tables by setting the config: spark.conf.set("spark.sql.sources.partitionOverwriteMode","dynamic") before writing to a partitioned table. With delta tables is appears you need to manually specif...

  • 19386 Views
  • 8 replies
  • 3 kudos
Latest Reply
alijen
New Contributor II
  • 3 kudos

@SamCallister wrote: Spark supports dynamic partition overwrite for parquet tables by setting the config:spark.conf.set("spark.sql.sources.partitionOverwriteMode","dynamic")before writing to a partitioned table. With delta tables is appears you need ...

  • 3 kudos
7 More Replies
AleksandraFrolo
by New Contributor III
  • 1243 Views
  • 0 replies
  • 0 kudos

Web scraping with Databricks

Hello,What is the easiest way to do web scraping in Databricks? Let's imagine that from this link: http://automated.pythonanywhere.com , I need to grab this element "/html/body/div[1]/div/h1[1]" and return a text, how can I do it? Can somebody write ...

  • 1243 Views
  • 0 replies
  • 0 kudos
Erik_L
by Contributor II
  • 1890 Views
  • 1 replies
  • 1 kudos

Structured Streaming from TimescaleDB?

I realize that the best practice would be to integrate our service with Kafka as a streaming source for Databricks, but given that the service already stores data into TimescaleDB, how can I stream data from TimescaleDB into DBX? Debezium doesn't wor...

  • 1890 Views
  • 1 replies
  • 1 kudos
Latest Reply
-werners-
Esteemed Contributor III
  • 1 kudos

https://github.com/noctarius/timescaledb-event-streamer/Might help.

  • 1 kudos
DatabricksPract
by New Contributor II
  • 9202 Views
  • 2 replies
  • 2 kudos

Resolved! Get metadata of tables in hive metastore

Hi team,I have a requirement to get the metadata of tables available in databricks hive metastore.Is there any way to get the metadata of all the tables inspite of looping through tables using Describe table_name.As hive metastore doesnot support inf...

  • 9202 Views
  • 2 replies
  • 2 kudos
Latest Reply
DatabricksPract
New Contributor II
  • 2 kudos

@Tharun-Kumar - Thanks for your quick reply, it worked.

  • 2 kudos
1 More Replies
brickster_2018
by Databricks Employee
  • 2963 Views
  • 2 replies
  • 0 kudos

Resolved! The driver is temporarily unavailable

My job fails with Driver is temporarily unavailable. Apparently, it's permanently unavailable, because the job is not pausing but failing.

  • 2963 Views
  • 2 replies
  • 0 kudos
Latest Reply
Chalki
New Contributor III
  • 0 kudos

I am facing the same issues .  I am writing in batches using a simple for loop. I don't have any collect statements inside the loop. I am rewriting the partitions with partition overwrite dynamic mode in a huge wide delta table - several tb. The incr...

  • 0 kudos
1 More Replies
Kratik
by New Contributor III
  • 8079 Views
  • 10 replies
  • 1 kudos

How to run sql query/notebook present on S3 on databricks cluster?

I have code files (.hql) in s3 which was running on Hive. But now, I need to run them on the Databricks cluster. I can rename the files to .sql and add comment "-- Databricks notebook source" on top of the files to treat them as single cell notebook....

  • 8079 Views
  • 10 replies
  • 1 kudos
Latest Reply
Debayan
Databricks Employee
  • 1 kudos

Hi, Sorry missed seeing the post, did the issue got resolved? 

  • 1 kudos
9 More Replies
PriyanshuKumar
by New Contributor
  • 1913 Views
  • 1 replies
  • 1 kudos

try_to_timestamp not working as expected

try_to_timestamp should return NULL for following expression but it is throwing error: try_to_timestamp( '2019-02-28 23:59:59.000', 'yyyy-MM-dd HH:mm:ss' ) I understand above expression is wrong as the date format does no...

  • 1913 Views
  • 1 replies
  • 1 kudos
Latest Reply
Kumaran
Databricks Employee
  • 1 kudos

Hi @PriyanshuKumar,Thank you for your question in the Databricks community.The try_to_timestamp() function is designed to attempt to convert a string to timestamp format based on a specified pattern and return null if it fails to do so. However, in t...

  • 1 kudos
Mihai_Cog
by Contributor
  • 7540 Views
  • 5 replies
  • 8 kudos

Resolved! Change Data Feed Databricks

Hello,I am doing some testing with this feature Change Data Feed using Databricks and Pyspark, of course the Delta format and I don't understand something:I created a tableSaved some data insideEnabled Change Data Feed featureApply a merge with a dat...

  • 7540 Views
  • 5 replies
  • 8 kudos
Latest Reply
Tharun-Kumar
Databricks Employee
  • 8 kudos

@Mihai_Cog You have to split your merge statement into 2 parts. (Update and Insert/Delete).MERGE INTO test t USING src s ON s.Id = t.Id and s.date_field = t.date_field and s.fields <> t.fields WHEN MATCHED THEN UPDATE SET * MERGE INTO test t USING sr...

  • 8 kudos
4 More Replies
Smitha1
by Valued Contributor II
  • 7092 Views
  • 10 replies
  • 9 kudos

Resolved! Request for reattempt voucher. Databricks Certified Associate Developer for Apache Spark 3.0 exam

Hi,I gave Databricks Certified Associate Developer for Apache Spark 3.0 exam today but missed by one percent. I got 68.33% and pass is 70%.I am planning to reattempt the exam, could you kindly give me another opportunity and provide reattempt voucher...

  • 7092 Views
  • 10 replies
  • 9 kudos
Latest Reply
shriya
New Contributor II
  • 9 kudos

Hi,I gave Databricks Certified Associate Developer for Apache Spark 3.0 Python exam yesterday but missed by three percent. I got 66.66% and pass is 70%.I am planning to reattempt the exam, could you kindly give me another opportunity and provide reat...

  • 9 kudos
9 More Replies
-werners-
by Esteemed Contributor III
  • 6550 Views
  • 6 replies
  • 12 kudos

Resolved! SSRS (on-prem) on Databricks SQL

Has anybody succeeded in querying Databricks SQL with an on-prem SSRS (so an on-prem Report Server and Report Builder)?I manage to create a connection that works (according to the connection test anyway), but the moment I try to create a dataset on t...

  • 6550 Views
  • 6 replies
  • 12 kudos
Latest Reply
Haider93
New Contributor III
  • 12 kudos

Hi @-werners- , I am able to build connection between Microsoft visual studio and data bricks using Simba Spark ODBC driver. I can query delta tables sitting in Databricks from Microsoft Visual studio (SSRS). However, when I am deploying the report t...

  • 12 kudos
5 More Replies
Ronis
by New Contributor
  • 6290 Views
  • 4 replies
  • 1 kudos

SSRS Connect to Databricks

Hi ,I need to connect databricks query from microsoft SSRS.is it possible ? How do you make the connection?

  • 6290 Views
  • 4 replies
  • 1 kudos
Latest Reply
Tharun-Kumar
Databricks Employee
  • 1 kudos

@Haider93 https://www.databricks.com/company/partners/technology This is the list of supported databricks technology partners. SSRS is not yet part of it. We do have official support for Power BI. You can use this document to mimic it for SSRS - http...

  • 1 kudos
3 More Replies
qasimhassan
by Contributor
  • 743 Views
  • 0 replies
  • 0 kudos

Why no Practice Test for Databricks Professional Exam ?

Hi Databricks Team/Community,I'm planning for Databricks Data Engineering Professional Certification but what I have seen is that Databricks don't give any practice test for this professional/advanced certification. I don't know why, If anyone is rea...

  • 743 Views
  • 0 replies
  • 0 kudos
andresfp14
by New Contributor II
  • 27970 Views
  • 2 replies
  • 0 kudos

Can you delete a widget, or force a value to it?

Hello everyone! Basically, I have a dropdown widget, and at some point I want to change the options available, but still maintain the same selected option. The problems that I find are: - If I want to delete the widget and create a new one, it seems...

0693f000007OoHmAAK
  • 27970 Views
  • 2 replies
  • 0 kudos
Latest Reply
Gim
Contributor
  • 0 kudos

I know this is 5+ years late but I would like to add an answer (or work around) to this in case anyone else was looking for the same solution.What I did was to essentially 'reset' the widget whenever I had to assign (or force) a value to it.#Cell #1 ...

  • 0 kudos
1 More Replies
Swostiman
by New Contributor II
  • 5921 Views
  • 5 replies
  • 4 kudos

Consuming data from databricks[Hive metastore] sql endpoint using pyspark

I was trying to read some delta data from databricks[Hive metastore] sql endpoint using pyspark, but while doing so I encountered that all the values of the table after fetching are same as the column name.Even when I try to just show the data it giv...

  • 5921 Views
  • 5 replies
  • 4 kudos
Latest Reply
sucan
New Contributor II
  • 4 kudos

Encountered the same issue and downgrading to 2.6.22 helped me resolve this issue.

  • 4 kudos
4 More Replies
pg1
by New Contributor II
  • 1369 Views
  • 2 replies
  • 1 kudos

How to expand display of workspace name in top right window?

How to expand display of workspace name in top right window? My workspace name is little bit longer and it cuts out, could not find a way to expand name window.

  • 1369 Views
  • 2 replies
  • 1 kudos
Latest Reply
Tharun-Kumar
Databricks Employee
  • 1 kudos

@pg1 Is your workspace Unity Catalog enabled?We have identified this behaviour in Unity Catalog enabled workspaces and already working to show up the entire workspace name when we hover over it.

  • 1 kudos
1 More Replies

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group
Labels