cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 
Data + AI Summit 2024 - Data Engineering & Streaming

Forum Posts

Phani1
by Valued Contributor II
  • 1284 Views
  • 1 replies
  • 1 kudos

Resolved! DLT best practices

Hi Team,Could you please recommend the best practices to implement the delta live tables?Regards,Phanindra

  • 1284 Views
  • 1 replies
  • 1 kudos
Latest Reply
Ryan_Chynoweth
Esteemed Contributor
  • 1 kudos

Hi Phani, what exactly are you looking for with best practices? At a high level:Always provide an external storage location (S3, ADLS, GCS) for your pipelineUse Auto Scaling! Python imports can be leverage to reuse code With regards to providing a st...

  • 1 kudos
NOOR_BASHASHAIK
by Contributor
  • 6142 Views
  • 1 replies
  • 2 kudos

Resolved! Azure Databricks PATs expire even before validity

Hi all,we have this issue in our environment - even thought we give 365 days validity for Databricks PATS generation, the PATs expire every now and then. Is there any problem with the command we use : curl --location --request POST 'https://<<HOST_NA...

  • 6142 Views
  • 1 replies
  • 2 kudos
Latest Reply
karthik_p
Esteemed Contributor
  • 2 kudos

@NOOR BASHA SHAIK​ It looks you are providing 365 days, can you please post your response. if you won't provide any lifetime then it should be valid indefinitely. can you please add 90 days validity and test

  • 2 kudos
Chinu
by New Contributor III
  • 832 Views
  • 1 replies
  • 1 kudos

API to get Databricks Status AWS.

Hi, Do you have an api endpoint to call to get the databricks status for AWS?Thanks,

  • 832 Views
  • 1 replies
  • 1 kudos
Latest Reply
karthik_p
Esteemed Contributor
  • 1 kudos

@Chinu Lee​ you have webhook/slack that can be used to fetch status https://docs.databricks.com/resources/status.html#webhookare you specifically looking for your account workspace/above one

  • 1 kudos
marcin-sg
by New Contributor III
  • 1161 Views
  • 1 replies
  • 2 kudos

Create (account wide) groups without account admin permissions

The use case is quite simple: each environment - databricks workspace (prod, test, dev) will be created by a separate service principal (which for isolation purpose should not have account wide admin permission) with terraform, but will belong to the...

  • 1161 Views
  • 1 replies
  • 2 kudos
Latest Reply
marcin-sg
New Contributor III
  • 2 kudos

Another thing would be also to assign workspace to a metastore without account admin permission - for similar reason.

  • 2 kudos
Anonymous
by Not applicable
  • 4503 Views
  • 6 replies
  • 2 kudos

Resolved! Delta Sharing - Unity Catalog difference

Delta Sharing and Unity catalog both have elements of data sharing. Can you please explain when one would use Delta sharing vs Unity Catalog?

  • 4503 Views
  • 6 replies
  • 2 kudos
Latest Reply
DBXC
Contributor
  • 2 kudos

Based on the Databricks reply from the post below: "Unity Catalog does not currently support separating data by workspace or Azure subscription. As you noted, data from all catalogs within a region can be accessed by any workspace within that region,...

  • 2 kudos
5 More Replies
deficiant_codge
by Contributor II
  • 5335 Views
  • 10 replies
  • 11 kudos

Resolved! Can't able to run following queries

I cannot able to run following queriesALTER TABLE iot_events ADD ATTRIBUTE pii ON emailALTER TABLE users ADD ATTRIBUTE pii ON phoneGRANT SELECT ON DATABASE iot_data HAVING ATTRIBUTE NOT IN (pii) TO product_managersand GRANT SELECT ON iot_events TO ...

  • 5335 Views
  • 10 replies
  • 11 kudos
Latest Reply
karthik_p
Esteemed Contributor
  • 11 kudos

@Kaniz Fatma​ can anyone from data Bircks help on why attribute-based access control function is not working in unity catalog @Rahul Mishra​ below commands ALTER TABLE iot_events ADD ATTRIBUTE pii ON emailALTER TABLE users ADD ATTRIBUTE pii ON phoneG...

  • 11 kudos
9 More Replies
goldentown
by New Contributor III
  • 2859 Views
  • 1 replies
  • 2 kudos

Resolved! The Jupiter note-book doesn't update imports after updating the .py file

Please help. Here's an example:I have one .py file and one .ipynb, and the .py file contains the test function, but after adding the new function test1, it doesn't appear in .ipynb. Even after re-running the .py file and reimporting it in .ipynb. How...

  • 2859 Views
  • 1 replies
  • 2 kudos
Latest Reply
goldentown
New Contributor III
  • 2 kudos

%load_ext autoreload%autoreload 2

  • 2 kudos
ROB2
by New Contributor
  • 1065 Views
  • 1 replies
  • 0 kudos

clickfortranslation.com

What are the Different Types of Industry Specific Translation Services Manual  ?GET TO KNOW MORE CLICK THE LINK BELOW.https://clickfortranslation.com/manual-translation.php

Best Manual Translation Services by Click For Translation
  • 1065 Views
  • 1 replies
  • 0 kudos
Latest Reply
Joe_Suarez
New Contributor III
  • 0 kudos

Do you provide all types of industry-specific translation services? I'm asking because many translation companies pretend to provide all industry-specific translation services, but the final result is far from being a qualitative and professional one...

  • 0 kudos
ivanychev
by Contributor II
  • 1669 Views
  • 2 replies
  • 2 kudos

Does anyone run Databricks jobs using Docker + ARM (Graviton) instances?

Graviton instances do not support Container services on paper (https://docs.databricks.com/clusters/graviton.html#unsupported-features) but if you try to build Docker ARM image and run it on Graviton, it will work. Does anyone use this combination in...

  • 1669 Views
  • 2 replies
  • 2 kudos
Latest Reply
Debayan
Esteemed Contributor III
  • 2 kudos

Graviton is not supported by Databricks Container Services. How are you planning to run it on Databricks? Please tag @Debayan​ with your next comment so that I will get notified. Thank you!

  • 2 kudos
1 More Replies
shawnbarrick
by New Contributor III
  • 8864 Views
  • 2 replies
  • 2 kudos

Resolved! How to resolve SAT driver errors

I was able to follow the SAT setup instructions, but ran into the same error whether I ran it "manually" or via terraform. The initialization seemed to run fine. Can anyone suggest any steps to troubleshoot this?

Screenshot 2023-05-01 at 10.19.23 AM
  • 8864 Views
  • 2 replies
  • 2 kudos
Latest Reply
shawnbarrick
New Contributor III
  • 2 kudos

Thanks - I also spoke with Arun, who was very helpful. Our databricks admin users all require an okta login, which is causing the error. We're looking into a "break glass" admin user for this purpose.

  • 2 kudos
1 More Replies
xneg
by Contributor
  • 4042 Views
  • 5 replies
  • 4 kudos

Is there a way to clone job cluster or edit cluster using JSON?

I've created workflow job (let say job A) and set up job cluster configuration for it.Now I want to create another workflow job (job B) but use almost the same settings for job cluster.I can see cluster settings in JSON (for both jobs) but I can't ed...

  • 4042 Views
  • 5 replies
  • 4 kudos
Latest Reply
artsheiko
Honored Contributor
  • 4 kudos

Also you can use terraform exporter with -match flag to get a .tf definition for a job A. Once initialized, you can define job B.Another option is to use dbx

  • 4 kudos
4 More Replies
Leszek
by Contributor
  • 2175 Views
  • 1 replies
  • 1 kudos

IDENTITY column duplication when using BY DEFAULT parameter

Hi, I created delta table with identity column using this syntax:Id BIGINT GENERATED BY DEFAULT AS IDENTITYMy steps:1) Created table with Id using syntax above.2) Added two rows with Id = 1 and Id = 2 (BY DEFAULT allows to do that).3) Run Insert (wit...

image.png
  • 2175 Views
  • 1 replies
  • 1 kudos
Latest Reply
dileep_vikram
New Contributor II
  • 1 kudos

Use below alter command to sync the identity column.alter table table_name change column col_name sync identity

  • 1 kudos
DataEngineer92
by New Contributor II
  • 1470 Views
  • 3 replies
  • 0 kudos

databricks-connect in Azure DevOps Pipeline jobs runs not showing on remote cluster

Hi Team,I am trying to run Azure DevOps pipeline as mentioned in the blog below.​​https://benalexkeen.com/unit-testing-with-databricks-part-2/​The pipeline is running successfully however I am not able to see any runs in remote cluster.​Does databric...

  • 1470 Views
  • 3 replies
  • 0 kudos
Latest Reply
Anonymous
Not applicable
  • 0 kudos

Hi @Rey Jhon​ Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers your...

  • 0 kudos
2 More Replies
thib
by New Contributor III
  • 4526 Views
  • 3 replies
  • 2 kudos

Can we use multiple git repos for a job running multiple tasks?

I have a job running multiple tasks :Task 1 runs a machine learning pipeline from git repo 1Task 2 runs an ETL pipeline from git repo 1Task 2 is actually a generic pipeline and should not be checked in repo 1, and will be made available in another re...

image
  • 4526 Views
  • 3 replies
  • 2 kudos
Latest Reply
trijit
New Contributor II
  • 2 kudos

The way to go about this would be to create Databricks repos in the workspace and then use that in the task formation. This way we can refer multiple repos in different tasks.

  • 2 kudos
2 More Replies
JonsData
by New Contributor II
  • 1328 Views
  • 2 replies
  • 1 kudos

DataBricks Extension on Azure using SPN

Is there any extension for deploying Databricks in Azure DevOps using SPN?

  • 1328 Views
  • 2 replies
  • 1 kudos
Latest Reply
Anonymous
Not applicable
  • 1 kudos

Hi @Amadin Naomi​ Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers ...

  • 1 kudos
1 More Replies

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group
Labels