cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

Punit_Prajapati
by Databricks Partner
  • 5215 Views
  • 3 replies
  • 6 kudos

Resolved! SERVERLESS SQL WAREHOUSE

Hello All,I have two questions regarding the serverless SQL warehouse which are following:1.) If I create a small Serverless SQL Warehouse in Databricks that shows 12 DBUs/hour, will I be charged 12 DBUs even if I don’t run any queries in that hour? ...

  • 5215 Views
  • 3 replies
  • 6 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 6 kudos

Shua42 hits the nail on the head. If I can be so bold as to summarize: You are only charged when the Warehouse is running regardless of how much or how little you use it.  We do have an auto stop feature you can configure. Essentially, you set a time...

  • 6 kudos
2 More Replies
640913
by New Contributor III
  • 16164 Views
  • 3 replies
  • 1 kudos

%pip install requirements.txt - path not found

Hi everyone,I was just testing things out to come up with a reasonable way of working with version management in DB and was inspired by the commands specified here. For my team and I, it makes no sense to put the requirements file in the dbfs locatio...

  • 16164 Views
  • 3 replies
  • 1 kudos
Latest Reply
actualhuman_012
New Contributor III
  • 1 kudos

Hello, were you able to find a solution to this?

  • 1 kudos
2 More Replies
notwarte
by New Contributor III
  • 2689 Views
  • 4 replies
  • 0 kudos

Unity Catalog storage amounts

Hi,I am using Azure and I do have predictive optimization enable on the catalog. I have wrote a script to calculate the data amounts of all of the tables -  looping over all of the tables and running "describe detail".All of the tables amount to ~ 1....

wiselka_1-1744630645723.png
  • 2689 Views
  • 4 replies
  • 0 kudos
Latest Reply
Isi
Honored Contributor III
  • 0 kudos

Hey @notwarte,Using the __databricks_internal catalog to trace the underlying storage location is a solid approach for investigating their footprint.Regarding your question about storage duplication: yes, materialized views in Databricks do store a p...

  • 0 kudos
3 More Replies
Malthe
by Valued Contributor II
  • 1219 Views
  • 1 replies
  • 0 kudos

Parametrize DLT pipeline

If I'm using Databricks Asset Bundles, how would I parametrize a DLT pipeline based on a static configuration file.In pseudo-code, I would have a .py-file:import dlt # Something that pulls a pipeline resource (or artifact) and parses from JSON table...

  • 1219 Views
  • 1 replies
  • 0 kudos
Latest Reply
Emmitt18Lefebvr
New Contributor II
  • 0 kudos

Hello!To parametrize a Databricks DLT pipeline with a static configuration file using Asset Bundles, include your JSON/YAML config file in the bundle. In your DLT pipeline code, read this file using Python's file I/O (referencing its deployed path). ...

  • 0 kudos
dc-rnc
by Contributor
  • 5255 Views
  • 3 replies
  • 2 kudos

Resolved! DAB | Set tag based on job parameter

Hi Community.Since I wasn't able to find a way to set a job tag dynamically at runtime based on a parameter that is passed to the job, I was wondering if it is possible or if there is an equivalent way to do it.Thank you. Regards.

  • 5255 Views
  • 3 replies
  • 2 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 2 kudos

Based on the provided context, it appears that there isn't a direct way within Databricks to dynamically set job tags at runtime based on a parameter passed to the job. However, there are alternative approaches you can consider to work around this li...

  • 2 kudos
2 More Replies
cs_de
by New Contributor II
  • 2631 Views
  • 4 replies
  • 3 kudos

How do I deploy or run one job if I have multiple jobs in a Databricks Asset Bundle?

How do I deploy or run a single job if I have 2 or more jobs defined in my asset bundle?$databricks bundle deploy job1 #does not work I do not see a flag to identify what job to run.

  • 2631 Views
  • 4 replies
  • 3 kudos
Latest Reply
mark_ott
Databricks Employee
  • 3 kudos

I haven't done it with multiple jobs, but I think under resources you name multiple jobs, then when you deploy you just call that job key.  

  • 3 kudos
3 More Replies
Chris_sh
by Databricks Partner
  • 4910 Views
  • 2 replies
  • 1 kudos

[STREAMING_TABLE_OPERATION_NOT_ALLOWED.REQUIRES_SHARED_COMPUTE]

Currently trying to refresh a Delta Live Table using a Full Refresh but an error keeps coming up saying that we have to use a shared cluster or a SQL warehouse. I've tried both a shared cluster and a SQL warehouse and the same error keeps coming up. ...

  • 4910 Views
  • 2 replies
  • 1 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 1 kudos

You are not using "No Isolation Shared" mode, right?  Also, can you share the chunk of code that is causing the failure? Thanks, Louis.

  • 1 kudos
1 More Replies
guest0
by New Contributor III
  • 2938 Views
  • 6 replies
  • 3 kudos

Spark UI Simulator Not Accessible

Hello,The Spark UI Simulator is not accessible since the last few days. I was able to refer to it last week, at https://www.databricks.training/spark-ui-simulator/index.html. I already have access to partner academy (if that is any relevant).  <Error...

Data Engineering
simulator
spark-ui
  • 2938 Views
  • 6 replies
  • 3 kudos
Latest Reply
guest0
New Contributor III
  • 3 kudos

Just a short update: the request I raised was closed saying there is no active support contract with the org (from the email I used) to look into this. Perhaps someone else could try raising a request using the link above.

  • 3 kudos
5 More Replies
Vasu_Kumar_T
by Databricks Partner
  • 1864 Views
  • 3 replies
  • 1 kudos

Data Migration using Bladebridge

Hi,We are planning to migrate from Teradata to Databricks using Bladebridge. Going through various portals, I am not able to conclude the component that facilitates Data movement between Teradata and databricks.Please clarify end to end tool and acti...

  • 1864 Views
  • 3 replies
  • 1 kudos
Latest Reply
RiyazAliM
Honored Contributor
  • 1 kudos

I'm not aware if blade bridge has data movement tool handy with them.I don't see anything advertised by them though.Let me know if you find anything on this.

  • 1 kudos
2 More Replies
yashojha1995
by New Contributor
  • 1265 Views
  • 1 replies
  • 0 kudos

Error while running update statement using delta lake linked service through ADF

Hi All, I am getting the below error while running an update query in a lookup activity using the delta lake linked service:ErrorCode=AzureDatabricksCommandError,Hit an error when running the command in Azure Databricks. Error details: <span class='a...

  • 1265 Views
  • 1 replies
  • 0 kudos
Latest Reply
RiyazAliM
Honored Contributor
  • 0 kudos

Hi @yashojha1995 EOL while scanning string literal hints that there might be a syntax error in the update query.could you share your update query here, and any other info such as how are you creating a Linked service to your delta lake? Does it mean ...

  • 0 kudos
Dharinip
by Contributor
  • 9491 Views
  • 5 replies
  • 3 kudos

Resolved! How to decide on creating views vs Tables in Gold layer?

We have the following use case:We receive raw form of data from an application and that is ingested in the Iron Layer. The raw data is in the JSON FormatThe Bronze layer will the first level of transformation. The flattening of the JSON file happens ...

  • 9491 Views
  • 5 replies
  • 3 kudos
Latest Reply
artus2050189155
New Contributor II
  • 3 kudos

The whole medallion architecture is unnecesarily complex.   Bronze, Silver, Gold.  Some places I have seen people do -  RAW , Trusted RAW , Silver, Trusted Silver, Gold

  • 3 kudos
4 More Replies
manish_tanwar
by New Contributor III
  • 4225 Views
  • 5 replies
  • 4 kudos

Databricks streamlit app for data ingestion in a table

I am using this code in a notebook to save data row on table. And it is working perfectly. And now I am using the same function to save data from a chatbot in streamlit chatbot application of databricks and I am getting error for ERROR ##############...

  • 4225 Views
  • 5 replies
  • 4 kudos
Latest Reply
pradeepvatsvk
New Contributor III
  • 4 kudos

Hi @manish_tanwar  how can we work with streamlit apps in databricks , i have a use case where i want to ingest data from different csv files and ingest it into delta tables 

  • 4 kudos
4 More Replies
harman
by New Contributor II
  • 2480 Views
  • 3 replies
  • 0 kudos

Serverless Compute

Hi Team,We are using Azure Databricks Serverless Compute to execute workflows and notebooks. My question is :Does serverless compute support Maven library installations?I appreciate any insights or suggestions you might have. Thanks in advance for yo...

  • 2480 Views
  • 3 replies
  • 0 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 0 kudos

So, it appears that the there is conflicting documentation on this topic.  I checked with our internal documention and what I found was that you CANNOT install JDBC or ODBC drivers on Serverless.  See limitations here: https://docs.databricks.com/aws...

  • 0 kudos
2 More Replies
annagriv
by New Contributor II
  • 7121 Views
  • 6 replies
  • 5 kudos

Resolved! How to get git commit ID of the repository the script runs on?

I have a script in a repository on DataBricks. The script should log the current git commit ID of the repository. How can that be implemented? I tried various command, for example: result = subprocess.run(['git', 'rev-parse', 'HEAD'], stdout=subproce...

  • 7121 Views
  • 6 replies
  • 5 kudos
Latest Reply
bestekov
New Contributor II
  • 5 kudos

Here is a version of @vr 's solution that can be run from any folder within the rep. It uses regex to extract the root from the path in the form of \Repos\<username>\<some-repo:import os import re from databricks.sdk import WorkspaceClient w = Worksp...

  • 5 kudos
5 More Replies
Vasu_Kumar_T
by Databricks Partner
  • 1545 Views
  • 3 replies
  • 0 kudos

Default Code generated by Bladebridge converter

Hello all ,1. What is the default code generated by Bladebridge converter.for eg : When we migrate Teradat, Oracle to Databricks using Bladebridge whats the default code base.2.If the generated code is PYSPARK, do I have any control over the generate...

  • 1545 Views
  • 3 replies
  • 0 kudos
Latest Reply
RiyazAliM
Honored Contributor
  • 0 kudos

Hello @Vasu_Kumar_T - We've used Bladebridge to convert from Redshift to Databricks. Bladebridge can definetly convert to Spark SQL, not sure about Scala Spark though.

  • 0 kudos
2 More Replies
Labels