cancel
Showing results for 
Search instead for 
Did you mean: 
Get Started Discussions
Start your journey with Databricks by joining discussions on getting started guides, tutorials, and introductory topics. Connect with beginners and experts alike to kickstart your Databricks experience.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

pargit2
by New Contributor II
  • 1176 Views
  • 2 replies
  • 0 kudos

feature store

i need to build for data science team feature store that will return one big df after one hot encoding for almost each dimension,join and group by. should I create one feature store for final output that contain all the relevant data or create featur...

  • 1176 Views
  • 2 replies
  • 0 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 0 kudos

Here are some things to consider:   The best practice for designing a feature store in your scenario depends on balancing scalability, maintainability, and the dynamic nature of some dimensions like doctor names. Here's an outlined recommendation bas...

  • 0 kudos
1 More Replies
VigneshJaisanka
by New Contributor II
  • 1552 Views
  • 2 replies
  • 0 kudos

Databricks DLT ADLS Access issue

We have a DLT pipeline configure with spn inside the notebook, which was working fine. Now after credentials expiry, we created new one and updated the same in notebook. Now we are pipeline is not able to read from ADLS.SPN and my UserId is having co...

  • 1552 Views
  • 2 replies
  • 0 kudos
Latest Reply
SP_6721
Honored Contributor
  • 0 kudos

Hi @VigneshJaisanka The issue likely comes from a permissions or configuration mismatch. Here are a few things worth checking:Make sure the SPN is set as the pipeline owner and has the necessary permissions on the ADLS resource.If you’re using Unity ...

  • 0 kudos
1 More Replies
mooze456
by New Contributor
  • 589 Views
  • 1 replies
  • 0 kudos

Delta Sharing & UC: Understanding the Initial Empty Predicate Query

We're testing our Delta Sharing server with Unity Catalog (UC) and noticed a behavior where a simple query like SELECT COUNT(1) FROM table_name WHERE col1 = 'value' triggers two /query requests to our server.The initial request arrives with empty pre...

  • 589 Views
  • 1 replies
  • 0 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 0 kudos

The initial /query request during a Delta Sharing operation with Unity Catalog serves a critical purpose in the query lifecycle. It is intended to retrieve the schema and basic metadata of the table, which helps in query planning and optimization. Th...

  • 0 kudos
Pratikmsbsvm
by Contributor
  • 1461 Views
  • 2 replies
  • 0 kudos

Migration of PowerBI reports from Synapse to Databricks sql (DBSQL)

We have 250 powerbi reports build on top of Azure Synapse, now we are migrating from Azure Synapse to Databricks (DB SQL). How to plan for cutover and strategy for PowerBII just seeking high level points we have to take care for planning. Any techie ...

  • 1461 Views
  • 2 replies
  • 0 kudos
Latest Reply
NandiniN
Databricks Employee
  • 0 kudos

While your account Solution Architect (SA) will be able to guide you, if you still want to check what peers did here https://community.databricks.com/t5/warehousing-analytics/migrate-azure-synapse-analytics-data-to-databricks/td-p/90663 and here http...

  • 0 kudos
1 More Replies
NIK251
by New Contributor III
  • 2468 Views
  • 3 replies
  • 1 kudos

Resolved! Delta Live Table Pipeline

I have the error message when try to create a delta live table pipeline.My error is: com.databricks.pipelines.common.errors.deployment.DeploymentException: Failed to launch pipeline cluster 1207-112912-8e84v9h5: Encountered Quota Exhaustion issue in ...

  • 2468 Views
  • 3 replies
  • 1 kudos
Latest Reply
NIK251
New Contributor III
  • 1 kudos

Thanks sir, I solved it.

  • 1 kudos
2 More Replies
DebIT2011
by New Contributor III
  • 10352 Views
  • 4 replies
  • 9 kudos

Choosing between Azure Data Factory (ADF) and Databricks PySpark notebooks

I’m working on a project where I need to pull large datasets from Cosmos DB into Databricks for further processing, and I’m trying to decide whether to use Azure Data Factory (ADF) or Databricks PySpark notebooks for the extraction and processing tas...

  • 10352 Views
  • 4 replies
  • 9 kudos
Latest Reply
Johns404
New Contributor II
  • 9 kudos

Hi @DebIT2011,You're facing a classic architectural decision between orchestration with ADF versus direct transformation using Databricks PySpark notebooks. Both tools are powerful but serve different purposes depending on your project needs. Below i...

  • 9 kudos
3 More Replies
makerandcoder12
by New Contributor
  • 892 Views
  • 1 replies
  • 0 kudos

How can I leverage Databricks for building end-to-end machine learning pipelines?

I’ve been following practical tutorials on makerandcoder, which often showcase hands-on machine learning projects using Python, scikit-learn, and Spark. I’m looking to scale my projects using the Databricks platform for better collaboration, data han...

  • 892 Views
  • 1 replies
  • 0 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 0 kudos

Databricks enables the creation of scalable, end-to-end machine learning (ML) pipelines by providing a comprehensive and collaborative platform that integrates key components for data handling, experimentation, and model deployment. Here’s how Databr...

  • 0 kudos
rafal_walisko
by New Contributor II
  • 2495 Views
  • 1 replies
  • 0 kudos

Optimal Strategies for downloading large query results with Databricks API

Hi everyone,I'm currently facing an issue with handling a large amount of data using the Databricks API. Specifically, I have a query that returns a significant volume of data, sometimes resulting in over 200 chunks.My initial approach was to retriev...

  • 2495 Views
  • 1 replies
  • 0 kudos
Latest Reply
Datagyan
New Contributor II
  • 0 kudos

I am also facing the same issue now one approach tomorrow i will try I will create a job that using serverless job cluster. Then whenever user will click on download button from UI. This should trigger the job now this job. Will read the table as dat...

  • 0 kudos
arnas
by New Contributor II
  • 957 Views
  • 3 replies
  • 0 kudos

S3 limited bucket permissions

Hi,can I run Databricks on limited/restricted S3 bucket folder, no access to bucket root level as it is restricted per project folder in IAM?i.e s3://mybucket/myproject_abc/Now I configured all permissions as per documentationhttps://docs.databricks....

  • 957 Views
  • 3 replies
  • 0 kudos
Latest Reply
arnas
New Contributor II
  • 0 kudos

Thanks, but no thanks, spam resides in JUNK folder

  • 0 kudos
2 More Replies
MOUNIKASIMHADRI
by New Contributor
  • 19050 Views
  • 6 replies
  • 1 kudos

Insufficient Permissions Issue on Databricks

I have encountered a technical issue on Databricks.While executing commands both in Spark and SQL within the Databricks environment, I’ve run into permission-related errors from selecting files from DBFS. "org.apache.spark.SparkSecurityException: [IN...

  • 19050 Views
  • 6 replies
  • 1 kudos
Latest Reply
NandiniN
Databricks Employee
  • 1 kudos

Please refer to some of the other community articles with the no module error https://community.databricks.com/t5/data-engineering/udf-importing-from-other-modules/td-p/58988

  • 1 kudos
5 More Replies
Fz1
by New Contributor III
  • 4818 Views
  • 4 replies
  • 1 kudos

DLT Pipeline unable to find custom Libraries/Wheel packages

We have our DLT pipeline and we need to import our custom libraries packaged in wheel files.We are on Azure DBX and we are using Az DevOps CI/CD to build and deploy the wheel packages on our DBX environment. In the top of our DLT notebook we are impo...

Get Started Discussions
dbfs
dlt
Libraries
python
wheel
  • 4818 Views
  • 4 replies
  • 1 kudos
Latest Reply
Laurence_Fishbu
New Contributor II
  • 1 kudos

You might want to verify the file path and permissions within your CI/CD process—sometimes the context in which the pipeline runs lacks proper DBFS mount visibility. We've encountered similar visibility inconsistencies while working on data aggregati...

  • 1 kudos
3 More Replies
Sudheer2
by New Contributor III
  • 625 Views
  • 1 replies
  • 0 kudos

How to Migrate Legacy Dashboards from hive_metastore to Unity Catalog using Python

Hi all,After updating the legacy dashboard APIs, I’m looking to migrate legacy dashboards from the hive_metastore to Unity Catalog in Databricks. Specifically, I want to programmatically:Migrate SQL queries used in dashboardsRetain or recreate the as...

  • 625 Views
  • 1 replies
  • 0 kudos
Latest Reply
Louis_Frolio
Databricks Employee
  • 0 kudos

For your consideration:   To migrate legacy dashboards from the Hive Metastore to Unity Catalog in Databricks programmatically while retaining SQL queries, data visualizations, and ensuring compatibility with Unity Catalog schemas and tables using Py...

  • 0 kudos
dbsuersu
by New Contributor II
  • 5904 Views
  • 3 replies
  • 4 kudos

ArcGIS Connection

Hi,I am trying to connect to an ArcGIS instance using Data bricks. Is this possible? After connecting, I am trying to read the data into a Data fame.Please help me with this request. If its not possible to connect , please provide an alternative.Than...

  • 5904 Views
  • 3 replies
  • 4 kudos
Latest Reply
GISWhammy
New Contributor II
  • 4 kudos

I am trying to set up an ODBC or JDBC direct connection from ArcGIS Pro  and ArcGIS Enterprise Server; has anyone done this successfully? I was able to make DSN successful connection, but no tables are being delivered; I did not use a connection stri...

  • 4 kudos
2 More Replies
Kuchnhi
by New Contributor III
  • 3697 Views
  • 11 replies
  • 7 kudos

Facing issues while upgrading DBR version from 9.1 LTS to 15.4 LTS

Dear all,I am upgrading DBR version from 9.1 LTS to 15.4 LTS in Azure Databricks. for that I have created a new cluster with 15.4 DBR attached init script for installing application dependencies. Cluster has started successfully but it takes 30 min. ...

  • 3697 Views
  • 11 replies
  • 7 kudos
Latest Reply
SmithPoll
New Contributor III
  • 7 kudos

ust to add, you might also want to check the cluster logs (driver and init script logs) for any hidden errors or timeouts during startup. Sometimes dependencies silently fail to install,even if the cluster appears to be running. If possible, try brea...

  • 7 kudos
10 More Replies

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now
Labels