Data Engineering

Forum Posts

Sorted by:

by KrzysztofPrzyso • New Contributor III

02-06-2024 9:46:58 AM

1631 Views
3 replies
1 kudos

databricks-connect, dbutils, abfss path, URISyntaxException

When trying to use `dbutils.fs.cp` in the #databricks-connect #databricks-connect context to upload files to Azure Datalake Gen2 I get a malformed URI errorI have used the code provided here:https://learn.microsoft.com/en-gb/azure/databricks/dev-tool...

Data Engineering

abfss

databricks-connect

1631 Views
3 replies
1 kudos

02-06-2024 9:46:58 AM

View Replies

Latest Reply

Kaniz_Fatma
Community Manager

02-14-2024 2:02:25 AM

1 kudos

Hi @KrzysztofPrzyso, It appears that you’re encountering an issue with relative paths in absolute URIs when using dbutils.fs.cp in the context of Databricks Connect to upload files to Azure Data Lake Gen2. Let’s break down the problem and explore po...

1 kudos

02-14-2024 2:02:25 AM

2 More Replies

by 564824 • New Contributor II

08-25-2023 3:34:00 AM

3671 Views
6 replies
0 kudos

Resolved! Why is Photon increasing DBU used per hour?

I noticed that enabling photon acceleration is increasing the number of DBU utilized per hour which in turn increases our cost.In light of this, I am interested in gaining clarity on the costing of Photon acceleration as I was led to believe that Pho...

Data Engineering

3671 Views
6 replies
0 kudos

08-25-2023 3:34:00 AM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

06-09-2024 11:58:43 PM

0 kudos

well that depends on what kinds of tests you do. In data warehousing there are different kinds of loads.What have you tested? Data transformations or analytical queries. Because for the latter databricks sql is a better choice than a common spark ...

0 kudos

06-09-2024 11:58:43 PM

5 More Replies

by high-energy • New Contributor III

06-09-2024 4:52:43 AM

460 Views
1 replies
0 kudos

Resolved! Accessing a series in a DataFrame

Frequently I see this syntax to access a series in DBX. df['column_name'] However, I get this as my output from that.Column<'derived_value'>What's the correct way to access a series

Data Engineering

460 Views
1 replies
0 kudos

06-09-2024 4:52:43 AM

View Replies

Latest Reply

high-energy
New Contributor III

06-09-2024 6:25:40 AM

0 kudos

I realized I was looking at the wrong dataframe type.I needed a Pandas dataframe, not a databricks dataframe.

0 kudos

06-09-2024 6:25:40 AM

by akshayauser • New Contributor

06-07-2024 11:01:08 AM

490 Views
2 replies
1 kudos

Create a table name without back tick when using set variable

When i tried to create a table name with variable like this-- Set a string variableSET table_suffix = 'suffix';-- Use dynamic SQL to create a table with the variable as a suffix in the table nameCREATE TABLE IF NOT EXISTS <dbname>.my_table_${table_su...

Data Engineering

490 Views
2 replies
1 kudos

06-07-2024 11:01:08 AM

View Replies

Latest Reply

brockb
Valued Contributor

06-08-2024 8:33:12 PM

1 kudos

Hi,It's possible that the `identifier` clause is what you're looking for (https://docs.databricks.com/en/sql/language-manual/sql-ref-names-identifier-clause.html#identifier-clause). If so, this basic example should work: DECLARE mytab = '`mycatalog`....

1 kudos

06-08-2024 8:33:12 PM

1 More Replies

by nehaa • New Contributor II

06-06-2024 8:50:43 AM

374 Views
1 replies
0 kudos

Filter in DBX dashboards

How to add a column from Table1 as a filter to Table2 (Also called as on-click action filter) in databricks Dashboards?Both the tables are getting data through sql query

Data Engineering

374 Views
1 replies
0 kudos

06-06-2024 8:50:43 AM

View Replies

Latest Reply

Walter_C
Honored Contributor

06-08-2024 1:53:55 PM

0 kudos

To add a column from Table1 as a filter to Table2 in Databricks Dashboards, you can use the dashboard parameters feature. Here are the steps: Create a visualization for each of your SQL queries. You can do this by clicking the '+' next to the Resul...

0 kudos

06-08-2024 1:53:55 PM

by Dishankio • New Contributor

06-08-2024 5:52:20 AM

181 Views
0 replies
0 kudos

Metric Flow Commands Not Recognised in Data bricks dbt core Workflows

Hi Databricks community,I'm working on a project that combines dbt core with MetricFlow for metric management. I've set up my pipeline within a Databricks workflow, using the "dbt" workflow task type. While standard dbt commands (like dbt debug or d...

Data Engineering

181 Views
0 replies
0 kudos

06-08-2024 5:52:20 AM

by high-energy • New Contributor III

06-01-2024 10:36:12 AM

970 Views
3 replies
2 kudos

Resolved! Union and Column data types

I have three data frames that I create in python. I want to write all three of these to the same delta table. In code I bring the three of them together using the union operation.When I do this the data in the columns is no longer aligned correctly.I...

Data Engineering

970 Views
3 replies
2 kudos

06-01-2024 10:36:12 AM

View Replies

Latest Reply

high-energy
New Contributor III

06-08-2024 5:43:35 AM

2 kudos

Aligning the data types and column order across all three data frames before attempting to union them together solved the problem. The below snippet highlights what was happening.data = [[2021, "test", "Albany", "M", 42]] df1 = spark.createDataFrame...

2 kudos

06-08-2024 5:43:35 AM

2 More Replies

by dataengutility • New Contributor III

06-07-2024 3:16:31 PM

139 Views
0 replies
0 kudos

Asset Bundles - YAML file replacing job cluster with an all-purpose cluster

Hi all,I have been having some trouble running a workflow that consists of 3 tasks that run sequentially. Task1 runs on an all-purpose cluster and kicks off Task2 that needs to run on a job cluster. Task2 kicks off Task3 which also uses a job cluster...

Data Engineering

139 Views
0 replies
0 kudos

06-07-2024 3:16:31 PM

by semsim • Contributor

06-05-2024 1:12:38 PM

917 Views
4 replies
0 kudos

Resolved! Installing LibreOffice on Databricks

Hi, I need to install libreoffice to do a document conversion from .docx to .pdf. The requirement is no use of containers. Any idea on how I should go about this? Environment: Databricks 13.3 LTSThanks,Sem

Data Engineering

917 Views
4 replies
0 kudos

06-05-2024 1:12:38 PM

View Replies

Latest Reply

Yeshwanth
Honored Contributor

06-05-2024 10:10:06 PM

0 kudos

Hi @semsim Good day! I just wanted to check if you have tried the following commands already. %sh sudo apt-get install -y libreoffice sudo apt-get install -y unoconv

0 kudos

06-05-2024 10:10:06 PM

3 More Replies

by nistrate • New Contributor III

05-15-2023 11:40:32 AM

6406 Views
2 replies
5 kudos

Resolved! Restricting Workflow Creation and Implementing Approval Mechanism in Databricks

Hello Databricks Community,I am seeking assistance understanding the possibility and procedure of implementing a workflow restriction mechanism in Databricks. Our aim is to promote a better workflow management and ensure the quality of the notebooks ...

Data Engineering

6406 Views
2 replies
5 kudos

05-15-2023 11:40:32 AM

View Replies

Latest Reply

Avvar2022
Contributor

06-07-2024 12:34:25 PM

5 kudos

I believe this has to happen in 2 steps.step1: Currently admin can't restrict workflow creation in databricks currently any user with workspace access can create workflows. Admins should be able to restrict workflow creation. Databricks doesn't have...

5 kudos

06-07-2024 12:34:25 PM

1 More Replies

by CaptainJack • New Contributor II

06-07-2024 9:57:46 AM

212 Views
1 replies
0 kudos

Giving coworker "runing" permision on workflow but without allowing him access to notebooks.

I noticed that there is can_manage_run permission on workflow level, and someone can run a workflow only with these permission (without needing can_run permission on notebook level). Problem is that coworker can go to run details and then click on ta...

Data Engineering

212 Views
1 replies
0 kudos

06-07-2024 9:57:46 AM

View Replies

Latest Reply

Ravivarma
New Contributor III

06-07-2024 12:23:58 PM

0 kudos

Hello @CaptainJack , In Databricks, the can_manage_run permission lets a user manage workflow executions but does not hide the code in the tasks. If someone has this permission, they can see the details and code of the workflow runs. At...

0 kudos

06-07-2024 12:23:58 PM

by AhsanKhawaja • New Contributor

01-17-2023 7:25:04 AM

3459 Views
4 replies
0 kudos

using databricks sql warehouse as web app backend

Hi,I wanted to ask if anyone is using Databricks SQL Warehouse as backend for small to large scale web application? What are your thoughts about it, specially what Databricks team thinks of it ?Kind Regards,A

Data Engineering

3459 Views
4 replies
0 kudos

01-17-2023 7:25:04 AM

View Replies

Latest Reply

Robert-Scott
New Contributor II

06-07-2024 11:06:16 AM

0 kudos

Using Databricks SQL Warehouse as a backend for a web application involves integrating Databricks with your web app to handle data processing, querying, and analytics. Here are the steps to achieve this:1. Set Up Databricks SQL WarehouseCreate a Data...

0 kudos

06-07-2024 11:06:16 AM

3 More Replies

by yj940525 • New Contributor II

06-07-2024 9:30:48 AM

332 Views
0 replies
0 kudos

question of changing cluster key in liquid cluster

If i already have a cluster key1 for existing table, i want to change cluster key to key2 using ALTER TABLE table CLUSTER BY (key2), then run OPTIMIZE table, based on databrick document , existing files will not be rewritten (verified by my test as w...

Data Engineering

332 Views
0 replies
0 kudos

06-07-2024 9:30:48 AM

by Therdpong • New Contributor III

06-06-2024 11:29:32 PM

989 Views
5 replies
1 kudos

Resolved! How to migrate Delta live table.

I need guidance on migrating the Delta live table to another pipeline without a full refresh

Data Engineering

989 Views
5 replies
1 kudos

06-06-2024 11:29:32 PM

View Replies

Latest Reply

Hkesharwani
Contributor II

06-07-2024 12:23:48 AM

1 kudos

Hi, Yes you can use the same checkpoint location in the new pipeline.

1 kudos

06-07-2024 12:23:48 AM

4 More Replies

by yatharth • New Contributor III

02-13-2024 4:37:34 AM

4113 Views
1 replies
1 kudos

AWS CLI Commands

I wish to run aws CLI command in databricks, is there a way i can achieve the same, to be more specific i would like to run:aws cloudwatch get-metric-statistics --metric-name BucketSizeBytes --namespace AWS/S3 --start-time 2017-03-06T00:00:00Z --end-...

Data Engineering

AWS-CLI

4113 Views
1 replies
1 kudos

02-13-2024 4:37:34 AM

View Replies

Latest Reply

Yeshwanth
Honored Contributor

06-07-2024 7:29:38 AM

1 kudos

@yatharth please check this: https://docs.databricks.com/en/compute/access-mode-limitations.html#network-and-file-system-access-limitations-for-unity-catalog-shared-access-mode:~:text=Cannot%20connect%20to%20the%20instance%20metadata%20service%20(IMD...

1 kudos

06-07-2024 7:29:38 AM

User

Count

1603

744

348

285

247

Databricks Community

Forum Posts

databricks-connect, dbutils, abfss path, URISyntaxException

Resolved! Why is Photon increasing DBU used per hour?

Resolved! Accessing a series in a DataFrame

Create a table name without back tick when using set variable

Filter in DBX dashboards

Metric Flow Commands Not Recognised in Data bricks dbt core Workflows

Resolved! Union and Column data types

Asset Bundles - YAML file replacing job cluster with an all-purpose cluster

Resolved! Installing LibreOffice on Databricks

Resolved! Restricting Workflow Creation and Implementing Approval Mechanism in Databricks

Giving coworker "runing" permision on workflow but without allowing him access to notebooks.

using databricks sql warehouse as web app backend

question of changing cluster key in liquid cluster

Resolved! How to migrate Delta live table.

AWS CLI Commands

Lost Databricks' dependency in a job.

Compute Policy Does Not Install Libraries

Is there a way to let the DLT pipeline retry by it...

Can't create Catalog on Databricks on AWS

Executing Notebooks - Run All Cells vs Run All Bel...