cancel
Showing results for 
Search instead for 
Did you mean: 
Get Started Discussions
Start your journey with Databricks by joining discussions on getting started guides, tutorials, and introductory topics. Connect with beginners and experts alike to kickstart your Databricks experience.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

Sudheer2
by New Contributor III
  • 370 Views
  • 1 replies
  • 0 kudos

Migrating ML Model Experiments Using Python REST APIs

Hi everyone,I’m looking to migrate ML model experiments from a source Databricks workspace to a target workspace. Specifically, I want to use Python and the available REST APIs for this process.Can anyone help me on this!Thanks in advance!

  • 370 Views
  • 1 replies
  • 0 kudos
Latest Reply
gchandra
Databricks Employee
  • 0 kudos

You can use https://github.com/mlflow/mlflow-export-import  utility. The example given below doesn't use Python but uses CLI and CICD pipeline to do the same.  https://medium.com/@gchandra/databricks-copy-ml-models-across-unity-catalog-metastores-188...

  • 0 kudos
redapplesonly
by New Contributor II
  • 1709 Views
  • 2 replies
  • 3 kudos

Resolved! Access Databricks Table with Simple Python3 Script

Hi, I'm super new to Databricks. I'm trying to do a little API scripting against my company's DB instance.I have this supersimple python (ver 3) which is meant to run a remote host.  The script tries to a simple SQL query against my Databricks instan...

  • 1709 Views
  • 2 replies
  • 3 kudos
Latest Reply
redapplesonly
New Contributor II
  • 3 kudos

@gchandra Yes!  This is the documentation I was seeking!  Thank you so much

  • 3 kudos
1 More Replies
Linda19
by New Contributor
  • 1804 Views
  • 3 replies
  • 2 kudos

What is the Best Postman Alternative?

Hey guys, I have been using Postman for quite some time now and getting disappointed recently and want to make a swtich. Is there something better than Postman? I've heard about that APIDog is much easier to use with a much better UI, and support all...

  • 1804 Views
  • 3 replies
  • 2 kudos
Latest Reply
-werners-
Esteemed Contributor III
  • 2 kudos

There can be only one:curl

  • 2 kudos
2 More Replies
Phani1
by Valued Contributor II
  • 841 Views
  • 1 replies
  • 0 kudos

incremental loads without date column

Hi All,We are facing a situation where our data source is Snowflake, and the data is saved in a storage location(adls) in parquet format. However, the tables or data lack a date column or any incremental column for performing incremental loads to Dat...

  • 841 Views
  • 1 replies
  • 0 kudos
Latest Reply
-werners-
Esteemed Contributor III
  • 0 kudos

Ideally you would have some change tracking system (cdc f.e.) on the source tables (Streams in the case of Snowflake, Introduction to Streams | Snowflake Documentation).But that is not the case.So I think you approach is ok.  You cannot track what is...

  • 0 kudos
abubakar-saddiq
by New Contributor
  • 2227 Views
  • 2 replies
  • 1 kudos

How to Pass Dynamic Parameters (e.g., Current Date) in Databricks Workflow UI?

I'm setting up a job in the Databricks Workflow UI and I want to pass a dynamic parameter, like the current date (run_date), each time the job runs.In Azure Data Factory, I can use expressions like @utcnow() to calculate this at runtime. However, I w...

  • 2227 Views
  • 2 replies
  • 1 kudos
Latest Reply
-werners-
Esteemed Contributor III
  • 1 kudos

As szymon mentioned, dynamic parameter values exist, but the functionality is still far from what Data Factory has to offer.I am pretty sure though that this will be extended.So for the moment I suggest you do the value derivation in data factory, an...

  • 1 kudos
1 More Replies
oleprince
by New Contributor
  • 5339 Views
  • 1 replies
  • 0 kudos

Delta table definition - Identity column

Hello,Would anyone know if it is possible to create a delta table using Python that includes a column that is generated by default as identity (identity column for which the value inserted can be manually overriden)?There seems to be a way to create ...

  • 5339 Views
  • 1 replies
  • 0 kudos
Latest Reply
gmiguel
Contributor
  • 0 kudos

Hi @oleprince ,As far as I know, it's not possible yet to create tables with Identity columns using pyspark (DeltaTable api). You can create generated columns, but Identity columns are not allowed.The only way to achieve this is through Spark Sql.  

  • 0 kudos
david_nagy
by New Contributor III
  • 1830 Views
  • 7 replies
  • 1 kudos

Databricks bundle

Hey, I am new to Databricks, and I am trying to test the mlops-stack bundle. Within that bundle there is a feature-engineering workflow and I have a problem to make it run. The main problem is the following.the bundle specified the target to be $bund...

  • 1830 Views
  • 7 replies
  • 1 kudos
Latest Reply
david_nagy
New Contributor III
  • 1 kudos

Yes it is.

  • 1 kudos
6 More Replies
fiverrpromotion
by New Contributor
  • 585 Views
  • 1 replies
  • 0 kudos

Addressing Memory Constraints in Scaling XGBoost and LGBM: A Comprehensive Approach for High-Volume

Scaling XGBoost and LightGBM models to handle exceptionally large datasets—those comprising billions to tens of billions of rows—presents a formidable computational challenge, particularly when constrained by the limitations of in-memory processing o...

  • 585 Views
  • 1 replies
  • 0 kudos
Latest Reply
earntodiessaz
New Contributor II
  • 0 kudos

Well, that's a superb article! Thank you for this great information, you write very well which I like very much. I am really impressed by your post. run 3

  • 0 kudos
Phani1
by Valued Contributor II
  • 754 Views
  • 1 replies
  • 0 kudos

Oracle -> Oracle Golden Gate ->Databricks Delta lake

Hi All,We have a situation where we are collecting data from different Oracle instances.The customer is using Oracle GoldenGate to replicate this data into a storage location.From there, we can use Auto Loader or Delta Live Tables to read Avro files ...

  • 754 Views
  • 1 replies
  • 0 kudos
Latest Reply
szymon_dybczak
Esteemed Contributor III
  • 0 kudos

Hi @Phani1 ,In my opinion this is really good setup. You have push scenario where Oracle GoldenGate  is responsible for delivering data into storage, so you don't have to bother about extraction part. And autoloader is the best choice when it comes t...

  • 0 kudos
Phani1
by Valued Contributor II
  • 253 Views
  • 0 replies
  • 0 kudos

Delta Lake to Oracle Essbase

Hi All,How can we connect Databricks Delta Lake to Essbase in OCI? We know that Essbase supports JDBC/ODBC. Is it possible to use Python or PySpark to read from Delta Lake and load the data into Essbase? I think using JDBC/ODBC might affect performan...

  • 253 Views
  • 0 replies
  • 0 kudos
Phani1
by Valued Contributor II
  • 549 Views
  • 0 replies
  • 0 kudos

Denodo Connection Parameters.

 Hi All,We are establishing a connection from Denodo to Databricks. During the development phase, we utilized a personal access token associated with  developer account. However, this approach is not considered a best practice for production environm...

  • 549 Views
  • 0 replies
  • 0 kudos
itamarwe
by New Contributor II
  • 1162 Views
  • 2 replies
  • 1 kudos

Google PubSub for DLT - Error

I'm trying to create a delta live table from a Google PubSub stream.Unfortunately I'm getting the following error:org.apache.spark.sql.streaming.StreamingQueryException: [PS_FETCH_RETRY_EXCEPTION] Task in pubsub fetch stage cannot be retried. Partiti...

  • 1162 Views
  • 2 replies
  • 1 kudos
Latest Reply
itamarwe
New Contributor II
  • 1 kudos

Hi @Retired_mod, it was indeed a permissions issue. Nevertheless, I must admit that the error message is slightly misleading.Thanks.

  • 1 kudos
1 More Replies
Surajv
by New Contributor III
  • 1437 Views
  • 2 replies
  • 0 kudos

Restrict access of user/entity to hitting only specific Databricks Rest APIs

Hi community,Assume I generate a personal access token for an entity. Post generation, can I restrict the access of the entity to specific REST APIs? In other words, consider this example where once I use generate the token and setup a bearer token b...

  • 1437 Views
  • 2 replies
  • 0 kudos
Latest Reply
Panda
Valued Contributor
  • 0 kudos

@Surajv You have to rely on access control settings on resources and entities (users or service principals or create some cluster policies), rather than directly restricting the API endpoints at the token level.Note: API access based on fine-grained ...

  • 0 kudos
1 More Replies
Chris_Shehu
by Valued Contributor III
  • 1350 Views
  • 1 replies
  • 1 kudos

Feature Request: GUI: Additional Collapse options

When you're using a very large notebook sometimes it gets frustrating scrolling through all the code blocks. It would be nice to have a few additional options to make this easier. 1) Add a collapse all code cells button to the top.2) Add a collapse a...

Get Started Discussions
Enhancement
Feature
GUI
Request
  • 1350 Views
  • 1 replies
  • 1 kudos
Latest Reply
fdawoud
New Contributor II
  • 1 kudos

this feature please

  • 1 kudos
qwerty3
by Contributor
  • 726 Views
  • 1 replies
  • 0 kudos

Resolved! Does a queued databricks job incur cost?

Does a queued databricks job incur cost?

  • 726 Views
  • 1 replies
  • 0 kudos
Latest Reply
filipniziol
Esteemed Contributor
  • 0 kudos

Hi @qwerty3 ,No, it does not. When a job is queued (waiting for an available cluster or resources), there is no compute usage, so no charges are incurred for Databricks units (DBUs) or cloud infrastructure (VMs). The queued state is essentially a wai...

  • 0 kudos

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now
Labels