Community Discussions

by k2 • Visitor

7 hours ago

28 Views
0 replies
0 kudos

log delivery are not creating data in s3 bucket

Hiii, Does anyone have an idea about the typical duration for Databricks to create logs in an S3 bucket using the databricks_mws_log_delivery Terraform resource? I've implemented the code provided in the Databricks official documentation, but I've be...

Community Discussions

Reply

28 Views
0 replies
0 kudos

7 hours ago

by anonymous_567 • New Contributor II

12 hours ago

81 Views
3 replies
0 kudos

Autoloader update table when new changes are made

Hello,Everyday a new file of the same name gets sent to my storage account with old and new data appended at the end. Columns may also be added during one of these file updates. This file does a complete overwrite of the previous file. Is it possibl...

Community Discussions

Reply

81 Views
3 replies
0 kudos

12 hours ago

View Replies

Latest Reply

data-grassroots
New Contributor

7 hours ago

0 kudos

This may be helpful - the bit on allow overwritehttps://docs.databricks.com/en/ingestion/auto-loader/faq.html

0 kudos

7 hours ago

2 More Replies

by Miguel_Grafana • Visitor

11 hours ago

32 Views
0 replies
0 kudos

Azure Oauth Passthrough with the Go Driver

Can anyone point me towards some resources for achieving this? I already have the token.Trying with: dbsql.WithAccessToken(settings.Token)But I'm getting the following error:Unable to load OAuth Config: request error after 1 attempt(s): unexpected HT...

Community Discussions

Reply

32 Views
0 replies
0 kudos

11 hours ago

by Alexandru • New Contributor II

Friday

118 Views
3 replies
0 kudos

Resolved! vscode python project for development

Hi,I'm trying to set up a local development environment using python / vscode / poetry. Also, linting is enabled (Microsoft pylance extension) and the python.analysis.typeCheckingMode is set to strict.We are using python files for our code (.py) whit...

Community Discussions

Reply

118 Views
3 replies
0 kudos

Friday

View Replies

Latest Reply

artsheiko
Valued Contributor III

yesterday

0 kudos

Hi Alexandru, Take a look at VSCode extension for Databricks : https://marketplace.visualstudio.com/items?itemName=databricks.databricks

0 kudos

yesterday

2 More Replies

by databricksdev • New Contributor II

13 hours ago

25 Views
0 replies
0 kudos

Can we customize job run name when running azure data bricks notebook jobs from azure data factory

Hi All,we are executing databricks notebook activity inside the child pipeline thru ADF. we are getting child pipeline name in job name while executing databricks job. Is it possible to get master pipeline name as job name or customize job name thr...

Community Discussions

Reply

25 Views
0 replies
0 kudos

13 hours ago

by Archana_Mathan • Visitor

15 hours ago

21 Views
1 replies
1 kudos

Maintaining Order Consistency: Table Creation in Databricks SQL vs. DLT Pipeline

I have a CTE table with the below names as values. My objective is to create another table by concatenating all the rows from the CTE table in ascending order, resulting in the final output sequence: "Abi, Rahul, ram, Siva". When executing the query ...

Community Discussions

Reply

21 Views
1 replies
1 kudos

15 hours ago

View Replies

Latest Reply

-werners-
Esteemed Contributor III

15 hours ago

1 kudos

when writing, order is not guaranteed due to the nature of distributed processing.If you want the order to be guaranteed, you should order it when reading the data.Your query does not write any data, DLT does, that is the difference.

1 kudos

15 hours ago

by amit_jbs • New Contributor

yesterday

56 Views
1 replies
0 kudos

In databricks deployment .py files getting converted to notebooks

A critical issue has arisen that is impacting our deployment planning for our client. We have encountered a challenge with our Azure CI/CD pipeline integration, specifically concerning the deployment of Python files (.py). Despite our best efforts, w...

Community Discussions

Reply

56 Views
1 replies
0 kudos

yesterday

View Replies

Latest Reply

-werners-
Esteemed Contributor III

18 hours ago

0 kudos

What is your pipeline? We propagate notebooks using Azure Devops Repos with PRs and merges. like that files do not get converted.

0 kudos

18 hours ago

by Nagarathna • New Contributor

Monday

57 Views
2 replies
0 kudos

File not found error when trying to read json file from aws s3 using with open.

I am trying to reading json from aws s3 using with open in databricks notebook using shared cluster.Error message:No such file or directory:'/dbfs/mnt/datalake/input_json_schema.json'In single instance cluster the above error is not found.

Community Discussions

Reply

57 Views
2 replies
0 kudos

Monday

View Replies

Latest Reply

Nagarathna
New Contributor

yesterday

0 kudos

Hey,Thanks for suggesting this approach.But I want to know why the json file cannot be read from AWS S3 bucket using "with open" in python with shared instance mode cluster. The code works perfectly fine if I'm using a single instance mode cluster.co...

0 kudos

yesterday

1 More Replies

by Hogan • New Contributor

Monday

81 Views
1 replies
0 kudos

Can browse external Storage, but can not create a Table from there - VNET, ADLSGen2

Hi there!Hope somebody here can help me. We have created a new Databricks Account on Azure with the ARM template for VNET injection.We have all the subnets etc., unitiy catalog active and the connector for databricks.I want now to create my first tab...

Community Discussions

Reply

81 Views
1 replies
0 kudos

Monday

View Replies

Latest Reply

Hogan
New Contributor

yesterday

0 kudos

Hi,To solve this problem, the following Microsoft documentation can be used to configure the NCC to enable the connection between the private Azure storage and the serverless resources.https://learn.microsoft.com/en-us/azure/databricks/security/netwo...

0 kudos

yesterday

by sai_sathya • New Contributor III

Thursday

164 Views
6 replies
1 kudos

DataFrame to CSV write has issues due to multiple commas inside an row value

Hi alliam working on a data containing JSON fields with embedded commas into CSV format. iam facing challenges due to the commas within the JSON being misinterpreted as column delimiters during the conversion process.i tried several methods to modify...

Community Discussions

Reply

164 Views
6 replies
1 kudos

Thursday

View Replies

Latest Reply

artsheiko
Valued Contributor III

yesterday

1 kudos

Hi Sai, I assume that the problem comes not from the PySpark, but from Excel. I tried to reproduce the error and didn't find the way - that a good thing, right ? Please try the following : df.write.format("csv").save("/Volumes/<my_catalog_name>/<m...

1 kudos

yesterday

5 More Replies

by Nithya_r • New Contributor II

Thursday

85 Views
1 replies
0 kudos

Access Delta sharing from Azure Data Factory

I recently got access to delta sharing and I am looking to access the data from the tables in share through ADF. I used linked services such as REST API and HTTP and successfully established connection using the credential file token and http path, h...

Community Discussions

Reply

85 Views
1 replies
0 kudos

Thursday

View Replies

Latest Reply

artsheiko
Valued Contributor III

yesterday

0 kudos

Hey, I think you'll need to use a Databricks activity instead of Copy See : https://learn.microsoft.com/en-us/azure/data-factory/connector-overview#integrate-with-more-data-storeshttps://learn.microsoft.com/en-us/azure/data-factory/transform-data-dat...

0 kudos

yesterday

by databird • New Contributor II

2 weeks ago

729 Views
4 replies
1 kudos

Redefine ETL strategy with pypskar approach

Hey everyone!I've some previous experience with Data Engineering, but totally new in Databricks and Delta Tables.Starting this thread hoping to ask some questions and asking for help on how to design a process.So I have essentially 2 delta tables (sa...

Community Discussions

Reply

729 Views
4 replies
1 kudos

2 weeks ago

View Replies

Latest Reply

artsheiko
Valued Contributor III

yesterday

1 kudos

Hi @databird , You can review the code of each demo by opening the content via "View the Notebooks" or by exploring the following repo : https://github.com/databricks-demos (you can try to search for "merge" to see all the occurrences, for example) T...

1 kudos

yesterday

3 More Replies

by sai_sathya • New Contributor III

yesterday

41 Views
0 replies
0 kudos

fetching metadata for tables in a database stored in unity catalogue

Hi everyoneiam trying to fetch the metadata of every columns from an table and every tables from the database under an catalogue for that iam trying to use the samples catalogue that provided by databricks and get details for tpch database that provi...

Community Discussions

Reply

41 Views
0 replies
0 kudos

yesterday

by vinay076 • New Contributor II

a week ago

304 Views
2 replies
0 kudos

There is no certification number in my Databricks certificate that i had received after passing the

I enrolled myself for the Databricks data engineer certification recently and gave a shot at the exam and i did clear it successfully. I have received the certificate in the form of a pdf file along with a URL in which i can see my certificate and ba...

Community Discussions

Reply

304 Views
2 replies
0 kudos

a week ago

View Replies

Latest Reply

Cert-Team
Honored Contributor III

yesterday

0 kudos

Hi @vinay076 Thanks for asking! Our support team can provide you with a credential ID. Please file a ticket with our support team, give them your email associated with your certification, and they can get you the credential ID.

0 kudos

yesterday

1 More Replies

by VabethRamirez • New Contributor II

2 weeks ago

999 Views
5 replies
4 kudos

Resolved! How obtain a list of workflows in Databricks?

I need to obtain a list of my Databricks workflows with their job IDs in a notebook Databricks

Community Discussions

Reply

999 Views
5 replies
4 kudos

2 weeks ago

View Replies

Latest Reply

artsheiko
Valued Contributor III

yesterday

4 kudos

Hi @VabethRamirez , Also, instead of using directly the API, you can use databricks Python sdk : %pip install databricks-sdk --upgrade dbutils.library.restartPython()from databricks.sdk import WorkspaceClient w = WorkspaceClient() job_list = w.jobs...

4 kudos

yesterday

4 More Replies

Databricks

Forum Posts

log delivery are not creating data in s3 bucket

Autoloader update table when new changes are made

Azure Oauth Passthrough with the Go Driver

Resolved! vscode python project for development

Can we customize job run name when running azure data bricks notebook jobs from azure data factory

Maintaining Order Consistency: Table Creation in Databricks SQL vs. DLT Pipeline

In databricks deployment .py files getting converted to notebooks

File not found error when trying to read json file from aws s3 using with open.

Can browse external Storage, but can not create a Table from there - VNET, ADLSGen2

DataFrame to CSV write has issues due to multiple commas inside an row value

Access Delta sharing from Azure Data Factory

Redefine ETL strategy with pypskar approach

fetching metadata for tables in a database stored in unity catalogue

There is no certification number in my Databricks certificate that i had received after passing the

Resolved! How obtain a list of workflows in Databricks?

vscode python project for development

Is it possible to get Azure Databricks cluster met...

Can we get SQL Serverless warehouses monitoring da...

Notebook Detached java.net.SocketTimeoutException:...

Pros and cons of physically separating data in dif...