cancel
Showing results for 
Search instead for 
Did you mean: 
Community Articles
Dive into a collaborative space where members like YOU can exchange knowledge, tips, and best practices. Join the conversation today and unlock a wealth of collective wisdom to enhance your experience and drive success.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

kemp
by New Contributor II
  • 238 Views
  • 1 replies
  • 0 kudos

I built ar-io-mlflow: Open-Source MLflow Plugin for Verifiable & Tamper-Proof AI Provenance

Hey everyone!I've built and open-sourced ar-io-mlfow. This is a plugin that adds cryptographic provenance across the ML lifecycle (training runs, model registration, stage promotions, inference, and datasets).What it doesCreates signed Ed25519 crypto...

  • 238 Views
  • 1 replies
  • 0 kudos
Latest Reply
kemp
New Contributor II
  • 0 kudos

I also opened a discussion on Github, not sure which is the right place sorry if this isn't it - https://github.com/mlflow/mlflow/discussions/23355

  • 0 kudos
SumitSingh
by Contributor III
  • 16379 Views
  • 13 replies
  • 43 kudos

From Associate to Professional: My Learning Plan to ace all Databricks Data Engineer Certifications

In today’s data-driven world, the role of a data engineer is critical in designing and maintaining the infrastructure that allows for the efficient collection, storage, and analysis of large volumes of data. Databricks certifications holds significan...

SumitSingh_0-1721402402230.png SumitSingh_1-1721402448677.png SumitSingh_2-1721402469214.png
  • 16379 Views
  • 13 replies
  • 43 kudos
Latest Reply
RASHMI_BHOKARE
New Contributor II
  • 43 kudos

Hello , can u hep me with study material for associate level exam, where do we get it? the free version if possible? any noteS?

  • 43 kudos
12 More Replies
szymon_dybczak
by Esteemed Contributor III
  • 266 Views
  • 0 replies
  • 1 kudos

Access Databricks data using external systems

For a long time, one of the hardest questions in lakehouse architecture was:How do we let external engines access governed data without bypassing governance?Databricks is making this pattern much cleaner with Unity Catalog external access.The idea is...

szymon_dybczak_0-1779092161749.png
  • 266 Views
  • 0 replies
  • 1 kudos
emma_s
by Databricks Employee
  • 1364 Views
  • 1 replies
  • 5 kudos

Create an MCP for Azure DevOps To Use With Genie Code

Overview Prompted by a customer question, I wanted to see what was possible in terms of MCP integration into Genie Code, in order to try this out I decided to look at Azure Dev Ops, as it's a common workflow to want to see your tickets alongside the ...

Screenshot 2026-03-25 at 15.55.10.png
Community Articles
azure devops
Genie Code
MCP
  • 1364 Views
  • 1 replies
  • 5 kudos
Latest Reply
AndrewWooster
New Contributor II
  • 5 kudos

Azure DevOps now has a remote MCP server.  This would be much easier to use than creating a function for individual ADO API endpoints as you described above.  How can I configure a connection to this remote MCP from Databricks?I'd like to use EntraID...

  • 5 kudos
ericka-lorenz
by New Contributor II
  • 576 Views
  • 1 replies
  • 0 kudos

Databricks vs. BigQuery Through a Workload Lens

I came across a blog post comparing Databricks and Google BigQuery for AI-ready data teams. The workload angle stood out.That feels like a useful way to frame the discussion here in the Databricks Community. A lot of platform questions come back to t...

  • 576 Views
  • 1 replies
  • 0 kudos
Latest Reply
balajij8
Contributor III
  • 0 kudos

Evaluating pure analytics capabilities is an outdated framework that treats the data warehouse as an isolated silo. Databricks is aggressively moving to handle the entire enterprise footprint including BI & Agentic universe. With the maturity of Data...

  • 0 kudos
szymon_dybczak
by Esteemed Contributor III
  • 414 Views
  • 1 replies
  • 2 kudos

Finally! A simple way to validate whether your Materialized View can actually refresh incrementally

One of the more frustrating things when working with materialized views in Databricks was checking whether a view had refreshed incrementally. One way to verify it was by checking the event log, but that required running the pipeline and executing a ...

szymon_dybczak_0-1778493849822.png
  • 414 Views
  • 1 replies
  • 2 kudos
Latest Reply
Amit117
New Contributor II
  • 2 kudos

Hi. This is very helpful. Any idea whether incremental refresh ability is also true for non-algebraic functions like median etc. I was looking for a solution which will work for late arriving data and came across this. I also could not find any docum...

  • 2 kudos
szymon_dybczak
by Esteemed Contributor III
  • 577 Views
  • 0 replies
  • 1 kudos

Mitigation for "error downloading Terraform" during bundle deployments

If your CI/CD pipelines suddenly started failing out of nowhere with this error:"error downloading Terraform: unable to verify checksums signature: openpgp: key expired"and you’re using Databricks CLI - you’re probably hitting the same issue I did.Th...

szymon_dybczak_0-1778762387679.png
  • 577 Views
  • 0 replies
  • 1 kudos
SD_KCM
by New Contributor II
  • 1201 Views
  • 1 replies
  • 0 kudos

Azure Databricks Exclusive groups

Une nouvelle primitive de permissions pour empêcher le croisement de données entre usages hébergés sur le même workspace Databricks.

  • 1201 Views
  • 1 replies
  • 0 kudos
Latest Reply
SD_KCM
New Contributor II
  • 0 kudos

lien Medium :https://medium.com/@kacn12872/azure-databricks-exclusive-groups-garantir-létanchéité-entre-cas-d-usage-sur-la-lakehouse-e340ce28f332

  • 0 kudos
truplusphi
by New Contributor III
  • 232 Views
  • 0 replies
  • 1 kudos

TruProxy - Live Cost Estimator - Clusters

Hi everyone, I'm continuing to build a live cost estimator for Databricks to get immediate cost estimates every second instead of having to wait for the system tables to update. (see Live Cost Estimator - Databricks Community - 156374)I've finished t...

  • 232 Views
  • 0 replies
  • 1 kudos
RamuPilla
by New Contributor II
  • 435 Views
  • 0 replies
  • 1 kudos

Databricks Kafka Multi-Stream Ingestion Architecture: Scaling Beyond Single-Stream Bottlenecks

The Real Problem: Kafka Source Parallelism in SparkBefore discussing foreachBatch, multi-table writes, or any specific use case, it helps to understand the underlying issue. This is a problem with how Spark Structured Streaming consumes from Kafka, a...

RamuPilla_0-1778329737668.jpeg RamuPilla_2-1778329737752.jpeg
  • 435 Views
  • 0 replies
  • 1 kudos
DushendRaghavan
by New Contributor II
  • 1832 Views
  • 1 replies
  • 2 kudos

How to handle MERGE with Schema Evolution in Delta Lake

How to handle MERGE with Schema Evolution in Delta LakeHi everyone,Schema evolution during MERGE is one of the trickiest parts of building robust Delta Lake pipelines. Databricks actually has a native SQL syntax for this — plus Python API options for...

  • 1832 Views
  • 1 replies
  • 2 kudos
Latest Reply
nayan_wylde
Esteemed Contributor II
  • 2 kudos

Great post. Would also like to consider the following points:Guardrails: schema evolution is powerful — it can also accidentally add garbage columns if upstream sends unexpected fields.Recommendation: validate/allowlist schema changes in higher envir...

  • 2 kudos
Hammad-Arbisoft
by New Contributor II
  • 529 Views
  • 1 replies
  • 2 kudos

How Databricks Genie Turns Collaboration Tools into AI-Powered Intelligence Platforms

Most organizations don’t have a data problem anymore.They have a data access and usability problem.The dashboards exist. The warehouses are modernized. The lakehouse is running. Yet business teams still wait days for answers because analytics remains...

  • 529 Views
  • 1 replies
  • 2 kudos
Latest Reply
Hammad-Arbisoft
New Contributor II
  • 2 kudos

The Dashboard Era Is Ending. Conversation Is Replacing It.

  • 2 kudos
Labels