Get Started Discussions

by Sudheer2 • New Contributor III

10-18-2024 3:12:21 AM

370 Views
1 replies
0 kudos

Migrating ML Model Experiments Using Python REST APIs

Hi everyone,I’m looking to migrate ML model experiments from a source Databricks workspace to a target workspace. Specifically, I want to use Python and the available REST APIs for this process.Can anyone help me on this!Thanks in advance!

Get Started Discussions

Reply

370 Views
1 replies
0 kudos

10-18-2024 3:12:21 AM

View Replies

Latest Reply

gchandra
Databricks Employee

10-18-2024 12:56:22 PM

0 kudos

You can use https://github.com/mlflow/mlflow-export-import utility. The example given below doesn't use Python but uses CLI and CICD pipeline to do the same. https://medium.com/@gchandra/databricks-copy-ml-models-across-unity-catalog-metastores-188...

0 kudos

10-18-2024 12:56:22 PM

by redapplesonly • New Contributor II

10-17-2024 12:38:46 PM

1709 Views
2 replies
3 kudos

Resolved! Access Databricks Table with Simple Python3 Script

Hi, I'm super new to Databricks. I'm trying to do a little API scripting against my company's DB instance.I have this supersimple python (ver 3) which is meant to run a remote host. The script tries to a simple SQL query against my Databricks instan...

Get Started Discussions

Reply

1709 Views
2 replies
3 kudos

10-17-2024 12:38:46 PM

View Replies

Latest Reply

redapplesonly
New Contributor II

10-18-2024 12:51:47 PM

3 kudos

@gchandra Yes! This is the documentation I was seeking! Thank you so much

3 kudos

10-18-2024 12:51:47 PM

1 More Replies

by Linda19 • New Contributor

07-18-2024 12:06:43 AM

1804 Views
3 replies
2 kudos

What is the Best Postman Alternative?

Hey guys, I have been using Postman for quite some time now and getting disappointed recently and want to make a swtich. Is there something better than Postman? I've heard about that APIDog is much easier to use with a much better UI, and support all...

Get Started Discussions

Reply

1804 Views
3 replies
2 kudos

07-18-2024 12:06:43 AM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

10-18-2024 5:29:09 AM

2 kudos

There can be only one:curl

2 kudos

10-18-2024 5:29:09 AM

2 More Replies

by Phani1 • Valued Contributor II

10-17-2024 7:43:33 AM

841 Views
1 replies
0 kudos

incremental loads without date column

Hi All,We are facing a situation where our data source is Snowflake, and the data is saved in a storage location(adls) in parquet format. However, the tables or data lack a date column or any incremental column for performing incremental loads to Dat...

Get Started Discussions

Reply

841 Views
1 replies
0 kudos

10-17-2024 7:43:33 AM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

10-18-2024 1:32:46 AM

0 kudos

Ideally you would have some change tracking system (cdc f.e.) on the source tables (Streams in the case of Snowflake, Introduction to Streams | Snowflake Documentation).But that is not the case.So I think you approach is ok. You cannot track what is...

0 kudos

10-18-2024 1:32:46 AM

by abubakar-saddiq • New Contributor

10-17-2024 7:33:31 AM

2227 Views
2 replies
1 kudos

How to Pass Dynamic Parameters (e.g., Current Date) in Databricks Workflow UI?

I'm setting up a job in the Databricks Workflow UI and I want to pass a dynamic parameter, like the current date (run_date), each time the job runs.In Azure Data Factory, I can use expressions like @utcnow() to calculate this at runtime. However, I w...

Get Started Discussions

Reply

2227 Views
2 replies
1 kudos

10-17-2024 7:33:31 AM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

10-18-2024 12:16:47 AM

1 kudos

As szymon mentioned, dynamic parameter values exist, but the functionality is still far from what Data Factory has to offer.I am pretty sure though that this will be extended.So for the moment I suggest you do the value derivation in data factory, an...

1 kudos

10-18-2024 12:16:47 AM

1 More Replies

by oleprince • New Contributor

10-14-2023 9:03:23 AM

5339 Views
1 replies
0 kudos

Delta table definition - Identity column

Hello,Would anyone know if it is possible to create a delta table using Python that includes a column that is generated by default as identity (identity column for which the value inserted can be manually overriden)?There seems to be a way to create ...

Get Started Discussions

Reply

5339 Views
1 replies
0 kudos

10-14-2023 9:03:23 AM

View Replies

Latest Reply

gmiguel
Contributor

10-17-2024 9:29:22 AM

0 kudos

Hi @oleprince ,As far as I know, it's not possible yet to create tables with Identity columns using pyspark (DeltaTable api). You can create generated columns, but Identity columns are not allowed.The only way to achieve this is through Spark Sql.

0 kudos

10-17-2024 9:29:22 AM

by david_nagy • New Contributor III

10-17-2024 4:13:30 AM

1830 Views
7 replies
1 kudos

Databricks bundle

Hey, I am new to Databricks, and I am trying to test the mlops-stack bundle. Within that bundle there is a feature-engineering workflow and I have a problem to make it run. The main problem is the following.the bundle specified the target to be $bund...

Get Started Discussions

Reply

1830 Views
7 replies
1 kudos

10-17-2024 4:13:30 AM

View Replies

Latest Reply

david_nagy
New Contributor III

10-17-2024 6:43:50 AM

1 kudos

Yes it is.

1 kudos

10-17-2024 6:43:50 AM

6 More Replies

by fiverrpromotion • New Contributor

10-14-2024 2:19:25 AM

585 Views
1 replies
0 kudos

Addressing Memory Constraints in Scaling XGBoost and LGBM: A Comprehensive Approach for High-Volume

Scaling XGBoost and LightGBM models to handle exceptionally large datasets—those comprising billions to tens of billions of rows—presents a formidable computational challenge, particularly when constrained by the limitations of in-memory processing o...

Get Started Discussions

Reply

585 Views
1 replies
0 kudos

10-14-2024 2:19:25 AM

View Replies

Latest Reply

earntodiessaz
New Contributor II

10-17-2024 3:22:27 AM

0 kudos

Well, that's a superb article! Thank you for this great information, you write very well which I like very much. I am really impressed by your post. run 3

0 kudos

10-17-2024 3:22:27 AM

by Phani1 • Valued Contributor II

10-16-2024 11:32:57 PM

754 Views
1 replies
0 kudos

Oracle -> Oracle Golden Gate ->Databricks Delta lake

Hi All,We have a situation where we are collecting data from different Oracle instances.The customer is using Oracle GoldenGate to replicate this data into a storage location.From there, we can use Auto Loader or Delta Live Tables to read Avro files ...

Get Started Discussions

Reply

754 Views
1 replies
0 kudos

10-16-2024 11:32:57 PM

View Replies

Latest Reply

szymon_dybczak
Esteemed Contributor III

10-17-2024 1:48:29 AM

0 kudos

Hi @Phani1 ,In my opinion this is really good setup. You have push scenario where Oracle GoldenGate is responsible for delivering data into storage, so you don't have to bother about extraction part. And autoloader is the best choice when it comes t...

0 kudos

10-17-2024 1:48:29 AM

by Phani1 • Valued Contributor II

10-16-2024 11:31:15 PM

253 Views
0 replies
0 kudos

Delta Lake to Oracle Essbase

Hi All,How can we connect Databricks Delta Lake to Essbase in OCI? We know that Essbase supports JDBC/ODBC. Is it possible to use Python or PySpark to read from Delta Lake and load the data into Essbase? I think using JDBC/ODBC might affect performan...

Get Started Discussions

Reply

253 Views
0 replies
0 kudos

10-16-2024 11:31:15 PM

by Phani1 • Valued Contributor II

10-16-2024 4:19:28 AM

549 Views
0 replies
0 kudos

Denodo Connection Parameters.

Hi All,We are establishing a connection from Denodo to Databricks. During the development phase, we utilized a personal access token associated with developer account. However, this approach is not considered a best practice for production environm...

Get Started Discussions

Reply

549 Views
0 replies
0 kudos

10-16-2024 4:19:28 AM

by itamarwe • New Contributor II

07-14-2024 5:26:02 AM

1162 Views
2 replies
1 kudos

Google PubSub for DLT - Error

I'm trying to create a delta live table from a Google PubSub stream.Unfortunately I'm getting the following error:org.apache.spark.sql.streaming.StreamingQueryException: [PS_FETCH_RETRY_EXCEPTION] Task in pubsub fetch stage cannot be retried. Partiti...

Get Started Discussions

Reply

1162 Views
2 replies
1 kudos

07-14-2024 5:26:02 AM

View Replies

Latest Reply

itamarwe
New Contributor II

07-17-2024 12:59:07 AM

1 kudos

Hi @Retired_mod, it was indeed a permissions issue. Nevertheless, I must admit that the error message is slightly misleading.Thanks.

1 kudos

07-17-2024 12:59:07 AM

1 More Replies

by Surajv • New Contributor III

03-19-2024 3:22:24 AM

1437 Views
2 replies
0 kudos

Restrict access of user/entity to hitting only specific Databricks Rest APIs

Hi community,Assume I generate a personal access token for an entity. Post generation, can I restrict the access of the entity to specific REST APIs? In other words, consider this example where once I use generate the token and setup a bearer token b...

Get Started Discussions

Reply

1437 Views
2 replies
0 kudos

03-19-2024 3:22:24 AM

View Replies

Latest Reply

Panda
Valued Contributor

10-15-2024 5:31:41 AM

0 kudos

@Surajv You have to rely on access control settings on resources and entities (users or service principals or create some cluster policies), rather than directly restricting the API endpoints at the token level.Note: API access based on fine-grained ...

0 kudos

10-15-2024 5:31:41 AM

1 More Replies

by Chris_Shehu • Valued Contributor III

07-27-2023 12:38:26 PM

1350 Views
1 replies
1 kudos

Feature Request: GUI: Additional Collapse options

When you're using a very large notebook sometimes it gets frustrating scrolling through all the code blocks. It would be nice to have a few additional options to make this easier. 1) Add a collapse all code cells button to the top.2) Add a collapse a...

Get Started Discussions

Enhancement

Feature

GUI

Request

Reply

1350 Views
1 replies
1 kudos

07-27-2023 12:38:26 PM

View Replies

Latest Reply

fdawoud
New Contributor II

10-14-2024 2:47:18 PM

1 kudos

this feature please

1 kudos

10-14-2024 2:47:18 PM

by qwerty3 • Contributor

10-14-2024 12:58:45 PM

726 Views
1 replies
0 kudos

Resolved! Does a queued databricks job incur cost?

Does a queued databricks job incur cost?

Get Started Discussions

Reply

726 Views
1 replies
0 kudos

10-14-2024 12:58:45 PM

View Replies

Latest Reply

filipniziol
Esteemed Contributor

10-14-2024 1:05:00 PM

0 kudos

Hi @qwerty3 ,No, it does not. When a job is queued (waiting for an available cluster or resources), there is no compute usage, so no charges are incurred for Databricks units (DBUs) or cloud infrastructure (VMs). The queued state is essentially a wai...

0 kudos

10-14-2024 1:05:00 PM

Databricks Community

Forum Posts

Migrating ML Model Experiments Using Python REST APIs

Resolved! Access Databricks Table with Simple Python3 Script

What is the Best Postman Alternative?

incremental loads without date column

How to Pass Dynamic Parameters (e.g., Current Date) in Databricks Workflow UI?

Delta table definition - Identity column

Databricks bundle

Addressing Memory Constraints in Scaling XGBoost and LGBM: A Comprehensive Approach for High-Volume

Oracle -> Oracle Golden Gate ->Databricks Delta lake

Delta Lake to Oracle Essbase

Denodo Connection Parameters.

Google PubSub for DLT - Error

Restrict access of user/entity to hitting only specific Databricks Rest APIs

Feature Request: GUI: Additional Collapse options

Resolved! Does a queued databricks job incur cost?

Join Us as a Local Community Builder!

using Azure Databricks vs using Databricks directl...

Simple notebook sync

Cluster by auto pyspark

DBX Community Pending Answers

Is there a way to iterate over a combination of pa...