Data Engineering

Forum Posts

Sorted by:

by MunikrishnaS • New Contributor II

01-08-2024 7:55:50 PM

1156 Views
7 replies
0 kudos

What are optimized solutions for moving on-premise IBM DB2 CDC data to Databricks Delta table

Hi Team,My requirement is to move build a solution to move zos(db2) CDC data to Delta table on Realtime bases(at least near realtime) , data volume and number of tables are little huge (100 tables) I have researched I dont find any inbuild options in...

Data Engineering

1156 Views
7 replies
0 kudos

01-08-2024 7:55:50 PM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 3:07:02 AM

0 kudos

Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers your question?This...

0 kudos

01-18-2024 3:07:02 AM

6 More Replies

by Pratibha • New Contributor II

01-08-2024 3:40:56 AM

581 Views
2 replies
0 kudos

how max_retry_interval_millis works with retry_on_timeout in Data bricks.

my project I want if job take longer time then it will terminate and again it will try even if there is timeout error and in databricks launched status should show retry by scheduler and it should follow min_retry_interval_millis before start retry...

Data Engineering

min_retry_interval_millis

581 Views
2 replies
0 kudos

01-08-2024 3:40:56 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 3:04:16 AM

0 kudos

0 kudos

01-18-2024 3:04:16 AM

1 More Replies

by DH_Fable • New Contributor II

01-08-2024 3:55:43 AM

434 Views
2 replies
0 kudos

Downloading multiple excel files at once from repo

I have a notebook that produces lots of excel files which I want downloading on my local machine.I can only currently download one by one which takes a long time when there are a lot of them.Is there a way without using Azure CLI to download all of t...

Data Engineering

434 Views
2 replies
0 kudos

01-08-2024 3:55:43 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 3:04:07 AM

0 kudos

0 kudos

01-18-2024 3:04:07 AM

1 More Replies

by Databricks-acn • New Contributor II

01-06-2024 3:21:49 AM

885 Views
5 replies
0 kudos

Unable to load data in DLT tables from Federated data sources

I tried to run this query and failing to load the data .What do I need to do load from federated data sources using DLT if this is not correct CREATE OR REPLACE LIVE TABLE bulkuploadhistory COMMENT 'Table generated for bulkuploadhistory.' TBLPROPERTI...

Data Engineering

dlt

885 Views
5 replies
0 kudos

01-06-2024 3:21:49 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 2:51:48 AM

0 kudos

0 kudos

01-18-2024 2:51:48 AM

4 More Replies

by pawelzak • New Contributor III

01-17-2024 3:17:16 AM

1636 Views
3 replies
1 kudos

Dashboard update through API

Hi,I would like to create / update dashboard definition based on the json file. How can one do it? I tried the following:databricks api post /api/2.0/preview/sql/dashboards/$dashboard_id --json @file.json But it does not update the widgets...How can...

Data Engineering

1636 Views
3 replies
1 kudos

01-17-2024 3:17:16 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 12:55:43 AM

1 kudos

1 kudos

01-18-2024 12:55:43 AM

2 More Replies

by Kaushik2 • New Contributor

01-04-2024 12:28:24 PM

365 Views
2 replies
0 kudos

Reports on list of users and roles that have access to Databricks workspace

Are there any in-built reports available in Databricks UI that list the users and roles with access to the workspace?

Data Engineering

365 Views
2 replies
0 kudos

01-04-2024 12:28:24 PM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 2:21:00 AM

0 kudos

0 kudos

01-18-2024 2:21:00 AM

1 More Replies

by Ruby8376 • Valued Contributor

01-04-2024 4:31:57 PM

636 Views
3 replies
1 kudos

Query endpoint on Azure sql or databricks?

Hi Currently all data reauired resides in Az sql database. We have a project in which we need to query on demand this data in Salesforce data cloud to be further used for reporting in CRMA dashboard.do we need to move this data from az sql to delta l...

Data Engineering

636 Views
3 replies
1 kudos

01-04-2024 4:31:57 PM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 2:20:26 AM

1 kudos

1 kudos

01-18-2024 2:20:26 AM

2 More Replies

by dataman17 • New Contributor

01-05-2024 7:02:49 AM

661 Views
2 replies
0 kudos

Stop a interactive cluster running a continuous job to allow for a restart

Hello,I am wondering if there is a way in Databricks to run a job continuously except for 1 or 2 hours every night in which the cluster could restart. We are using interactive clusters for our jobs and development in Dev and UAT. In Prod we are still...

Data Engineering

661 Views
2 replies
0 kudos

01-05-2024 7:02:49 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 2:17:31 AM

0 kudos

0 kudos

01-18-2024 2:17:31 AM

1 More Replies

by Murthy1 • Contributor II

02-22-2023 10:13:32 PM

3185 Views
2 replies
0 kudos

How can we use existing all purpose cluster for a DLT pipeline?

I understand that DLT is a separate job compute but I would like to use an existing all purpose cluster for the DLT pipeline. Is there a way I can achieve this?

Data Engineering

3185 Views
2 replies
0 kudos

02-22-2023 10:13:32 PM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 2:16:38 AM

0 kudos

0 kudos

01-18-2024 2:16:38 AM

1 More Replies

by feed • New Contributor III

03-21-2023 2:51:35 AM

3809 Views
7 replies
3 kudos

TesseractNotFoundError

TesseractNotFoundError: tesseract is not installed or it's not in your PATH. See README file for more information. in databricks

Data Engineering

3809 Views
7 replies
3 kudos

03-21-2023 2:51:35 AM

View Replies

Latest Reply

neha_ayodhya
New Contributor II

12-21-2023 3:03:16 AM

3 kudos

%sh apt-get install -y tesseract-ocr this command is not working in my new Databricks free trail account, earlier it worked fine in my old Databricks instance. I get below error: E: Could not open lock file /var/lib/dpkg/lock-frontend - open (13: Per...

3 kudos

12-21-2023 3:03:16 AM

6 More Replies

by sandeephenkel23 • New Contributor II

01-03-2024 10:00:56 PM

1070 Views
2 replies
1 kudos

How to run Powershell file script.ps1 using the databricks notebook

Hello All,Following command on running through databricks notebook is not working Command%sh# Bash code to print 'Hello, PowerShell!'echo 'Hello, PowerShell!'# powershell.exe -ExecutionPolicy Restricted -File /dbfs:/FileStore/Read_Vault_Inventory.ps1...

Data Engineering

1070 Views
2 replies
1 kudos

01-03-2024 10:00:56 PM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 1:59:17 AM

1 kudos

1 kudos

01-18-2024 1:59:17 AM

1 More Replies

by hv129 • New Contributor

01-04-2024 12:17:35 AM

857 Views
2 replies
1 kudos

java.lang.OutOfMemoryError on Data Ingestion and Storage Pipeline

I have around 25GBs of data in my Azure storage. I am performing data ingestion using Autoloader in databricks. Below are the steps I am performing:Setting the enableChangeDataFeed as true.Reading the complete raw data using readStream.Writing as del...

Data Engineering

857 Views
2 replies
1 kudos

01-04-2024 12:17:35 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 1:58:27 AM

1 kudos

1 kudos

01-18-2024 1:58:27 AM

1 More Replies

by AndyKeel • New Contributor II

01-04-2024 4:19:25 AM

442 Views
3 replies
0 kudos

Creating an ADLS storage credential for an AWS Workspace

I'd like to create a storage credential for an Azure Storage Account in an AWS workspace. I then plan to use this storage credential to create an external volume.Is this possible, and if so what are the steps? Thanks for any help!

Data Engineering

442 Views
3 replies
0 kudos

01-04-2024 4:19:25 AM

View Replies

Latest Reply

AndyKeel
New Contributor II

01-08-2024 6:38:08 AM

0 kudos

Thanks for your help.I'm struggling to create the Storage Credential. I have created a managed identity via an Azure Databricks Access Connector and am making an API call based on what I'm reading in the API docs: Create a storage credential | Storag...

0 kudos

01-08-2024 6:38:08 AM

2 More Replies

by NirmalaSathiya • New Contributor

01-02-2024 11:20:45 PM

439 Views
2 replies
0 kudos

Not able to use _metadata to retrieve file name while reading xml files

We are trying to retrieve xml file name using _metadata but not working. we are not able to use input _file_name() also as we are using shared cluster.we are reading the xml files using com.datadricks.spark.xml library

Data Engineering

filename

read

XML

439 Views
2 replies
0 kudos

01-02-2024 11:20:45 PM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 1:34:57 AM

0 kudos

0 kudos

01-18-2024 1:34:57 AM

1 More Replies

by ElaPG • New Contributor III

01-01-2024 10:48:49 AM

1516 Views
7 replies
1 kudos

Command restrictions

Is there any possibility to restrict usage of specified commands (like mount/unmount or SQL grant) based on group assignment? I do not want everybody to be able to execute these commands.

Data Engineering

1516 Views
7 replies
1 kudos

01-01-2024 10:48:49 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-18-2024 1:30:56 AM

1 kudos

1 kudos

01-18-2024 1:30:56 AM

6 More Replies

User

Count

1601

736

343

284

246

Databricks

Forum Posts

What are optimized solutions for moving on-premise IBM DB2 CDC data to Databricks Delta table

how max_retry_interval_millis works with retry_on_timeout in Data bricks.

Downloading multiple excel files at once from repo

Unable to load data in DLT tables from Federated data sources

Dashboard update through API

Reports on list of users and roles that have access to Databricks workspace

Query endpoint on Azure sql or databricks?

Stop a interactive cluster running a continuous job to allow for a restart

How can we use existing all purpose cluster for a DLT pipeline?

TesseractNotFoundError

How to run Powershell file script.ps1 using the databricks notebook

java.lang.OutOfMemoryError on Data Ingestion and Storage Pipeline

Creating an ADLS storage credential for an AWS Workspace

Not able to use _metadata to retrieve file name while reading xml files

Command restrictions

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...

Addressing Pipeline Error Handling in Databricks b...