Home | Databricks

Community Activity

Sorted by:

Ask a technical question

Get answers to Databricks product questions from our community of expert practitioners

Start a community discussion

Talk to the Databricks community about a topic you're interested in

by Dharinip > • New Contributor

4 hours ago

17 Views
1 replies
0 kudos

How to decide on creating views vs Tables in Gold layer?

We have the following use case:We receive raw form of data from an application and that is ingested in the Iron Layer. The raw data is in the JSON FormatThe Bronze layer will the first level of transformation. The flattening of the JSON file happens ...

Data Engineering

17 Views
1 replies
0 kudos

4 hours ago

Latest Reply

madams
New Contributor III

an hour ago

0 kudos

Hi Dharinip. I've had similar conversations internally with developers asking very similar things. This is my general advice for this situation, but I think there are a lot of considerations to how you create your gold layer.How large is this datas...

0 kudos

an hour ago

by trentlglover > • New Contributor

7 hours ago

16 Views
1 replies
0 kudos

Notebooks running long in workflow

I have deployed a new databricks environment for development. I've copied a workflow from production to this environment with exactly the same compute configuration. Four notebooks that complete within minutes do not complete after 2 hours in develop...

Data Engineering

16 Views
1 replies
0 kudos

7 hours ago

Latest Reply

Alberto_Umana
Databricks Employee

3 hours ago

0 kudos

Hi @trentlglover, It sounds like you're experiencing a significant performance issue with your notebooks in the new development environment. Here are a few potential areas to investigate: Cluster Configuration: Even though you mentioned that the comp...

0 kudos

3 hours ago

by ML • Databricks Employee

4 hours ago

16 Views
0 replies
0 kudos

Next user group meetup! December 10th.

Hello Phoenix Friends! Our final user group meeting for 2024 (where did the time go!) will be Tuesday December 10th. Register here. In addition to food, beverages, networking, and connecting, we have a couple of great speakers. We are thrilled to ha...

Greater Phoenix

16 Views
0 replies
0 kudos

4 hours ago

by Dp15 > • Contributor

4 hours ago

14 Views
0 replies
0 kudos

Executing Python code inside a SQL Function

Hi ,I am trying to create a SQL UDF and I am trying to run some python code involving pyspark, I am not able to create a spark session inside the python section of the function, here is how my code looks, CREATE OR REPLACE FUNCTION test.getValuesFro...

Data Engineering

14 Views
0 replies
0 kudos

4 hours ago

by isai-ds > • New Contributor

5 hours ago

7 Views
0 replies
0 kudos

Salesforce LakeFlow connect - Deletion Salesforce records

Hello, I am new in databricks and related to data engineering. I am running a POC to sync data between a Salesforce sandbox and Databricks using LakeFlow connect.I already make the connection and i successfully sync data between salesforce and databr...

Data Engineering

7 Views
0 replies
0 kudos

5 hours ago

by Brad > • Contributor II

3 weeks ago

227 Views
7 replies
0 kudos

why latestOffset and getBatch takes so long time

Hi team,Kinesis -> delta table raw -> job with trigger=availableNow -> delta table target. The Kinesis->delta table raw is running continuously. The job is daily with trigger=availableNow. The job reads from raw, do some transformation, and run a MER...

Data Engineering

227 Views
7 replies
0 kudos

3 weeks ago

Latest Reply

Brad
Contributor II

5 hours ago

0 kudos

Thanks. I tracked there with log but cannot figure out which parts make the 18000 version apply slow. It is the same with CDF if I feed a big range to table_changes function. Any idea on this?

0 kudos

5 hours ago

6 More Replies

by 15460 > • New Contributor

5 hours ago

11 Views
0 replies
0 kudos

Idempotency token

Hi Team, I have used idempotency token in my dag code to avoid duplicate runs.note: Idempotency token given as static valueIssue: If dag fails once ...because of this idempotency token, airflow is not allowing to connect dbx ...can you please help me...

Data Engineering

11 Views
0 replies
0 kudos

5 hours ago

by RajeshRK > • Contributor

03-17-2023 4:06:45 AM

8855 Views
7 replies
3 kudos

Resolved! Download event, driver, and executor logs

Hi Team, I can see logs in Databricks console by navigating workflow -> job name -> logs. These logs are very generic like stdout, stderr and log4-avtive.log. How to download event, driver, and executor logs at once for a job? Regards,Rajesh.

Data Engineering

8855 Views
7 replies
3 kudos

03-17-2023 4:06:45 AM

Latest Reply

RajeshRK
Contributor

03-21-2023 2:13:22 AM

3 kudos

@Kaniz Fatma @John Lourdu @Vidula Khanna Hi Team,I managed to download logs using the Databricks command line as below: Installed the Databricks command line on my Desktop (pip install databricks-cli)Configured the Databricks cluster URL and perso...

3 kudos

03-21-2023 2:13:22 AM

6 More Replies

by Gaurav_Lokhande > • New Contributor II

2 weeks ago

197 Views
5 replies
0 kudos

We are trying to connect to AWS RDS MySQL instance from DBX with PySpark using JDBC

We are trying to connect to AWS RDS MySQL instance from DBX with PySpark using JDBC: jdbc_df = (spark.read.format("jdbc").options(url=f"jdbc:mysql://{creds['host']}:{creds['port']}/{creds['database']}", driver="com.mysql.cj.jdbc.Driver", dbtable="(SE...

Data Engineering

197 Views
5 replies
0 kudos

2 weeks ago

Latest Reply

VZLA
Databricks Employee

Wednesday

0 kudos

Yes, that seems correct for the inbound traffic at least: Control plane services, including webapp: nvirginia.cloud.databricks.com, 3.237.73.224/28SCC relay: tunnel.us-east-1.cloud.databricks.comSCC relay for PrivateLink: tunnel.privatelink.us-east-1...

0 kudos

Wednesday

4 More Replies

by guapsilva > • Visitor

7 hours ago

22 Views
0 replies
0 kudos

Not received my certificate after passing Data Engineer Associate exam

Hello, I took my exam and passed it successfully, but I still haven't received my certificate. You can help, because I need to deliver the certificate to my manager.My id is Gustavo.aps06@gmail.comtks

DELETE

22 Views
0 replies
0 kudos

7 hours ago

by pinaki1 > • New Contributor III

2 weeks ago

182 Views
3 replies
2 kudos

Serverless compute databricks

1. How to connect s3 bucket to databricks since dbfs mount is not supported.?2. In serverless compute Spark Context (sc), spark.sparkContext, and sqlContext are not supported?. Does it means it will not leverage power of distributed processing?3. Wha...

Data Engineering

182 Views
3 replies
2 kudos

2 weeks ago

Latest Reply

User16653924625
Databricks Employee

7 hours ago

2 kudos

please see this documentation for accessing cloud storage by setting Unity Catalog objects: Storage Credential and External Location. https://docs.databricks.com/en/connect/unity-catalog/cloud-storage/index.html

2 kudos

7 hours ago

2 More Replies

by hkmodi > • New Contributor

yesterday

56 Views
3 replies
0 kudos

Perform row_number() filter in autoloader

I have created an autoloader job that reads data from S3 (files with no extension) having json using (cloudFiles.format, text). Now this job is suppose to run every 4 hours and read all the new data that arrived. But before writing into a delta table...

Data Engineering

56 Views
3 replies
0 kudos

yesterday

Latest Reply

szymon_dybczak
Contributor III

12 hours ago

0 kudos

HI @hkmodi ,Basically, as @daniel_sahal said, bronze layer should reflect the source system. The silver layer is dedicated for deduplication/cleaning/enrichment of dataset. If you still need to deduplicate at bronze layer you have 2 options:- use me...

0 kudos

12 hours ago

2 More Replies

by 17780 > • New Contributor II

04-05-2023 9:57:36 PM

6775 Views
4 replies
0 kudos

How to delete Databricks Account

I created and used a Databricks Account for testing purposes. I want to delete that account. In the Databricks Account Web UI, there is no menu to delete an account. How should I delete it?

Data Engineering

6775 Views
4 replies
0 kudos

04-05-2023 9:57:36 PM

Latest Reply

Sridhar15082003
New Contributor

9 hours ago

0 kudos

I created and used a Databricks Account for testing purposes.I want to delete that account.In the Databricks Account Web UI, there is no menu to delete an account.How should I delete it?

0 kudos

9 hours ago

3 More Replies

by Kyle2 > • New Contributor II

09-25-2024 8:36:23 AM

537 Views
2 replies
0 kudos

Databricks JDBC driver fails with socket read timeout

We work with a application that connects to our Databricks serverless SQL warehouse via Databricks JDBC driver. It runs a few thousand SQL select statements everyday, and a small percentage of them will fail with the following error details: java.sql...

Warehousing & Analytics