- 4063 Views
- 3 replies
- 16 kudos
(Episode 2: Reading Excel Files) - Learning Databricks one brick at a time, using the Free Edition
Episode 2: Reading Excel FilesLearning Databricks one brick at a time, using the Free Edition.You can download the accompanying Notebook and Excel files used in the demonstration over on my GitHub:Excel Files & Notebook: https://github.com/BSanalyst...
- 4063 Views
- 3 replies
- 16 kudos
- 16 kudos
Thanks for this, @BS_THE_ANALYST. Hugely beneficial.
- 16 kudos
- 569 Views
- 0 replies
- 1 kudos
Migrate External Tables to Managed
With managed tables, you can reduce your storage and compute costs thanks to predictive optimization or file list caching. Now it is really time to migrate external tables to managed ones, thanks to ALTER SET MANAGED functionality. Read more: - h...
- 569 Views
- 0 replies
- 1 kudos
- 2274 Views
- 1 replies
- 4 kudos
Hadoop Walked So Databricks Could Run
Are you familiar with this scenario: Your data team spends 80% of their time fixing infrastructure issues instead of extracting insights.In today’s data-driven world, organisations are drowning in data but starving for actionable insights. Traditiona...
- 2274 Views
- 1 replies
- 4 kudos
- 569 Views
- 0 replies
- 1 kudos
Validating pointer-based Delta comparison architecture using flatMapGroupsWithState in Structured St
Hi everyone,I’m leading an implementation where we’re comparing events from two real-time streams — a Source and a Target — in Databricks Structured Streaming (Scala).Our goal is to identify and emit “delta” differences between corresponding records ...
- 569 Views
- 0 replies
- 1 kudos
- 17579 Views
- 6 replies
- 3 kudos
Editing value of widget parameter within notebook code
I have a notebook with a text widget where I want to be able to edit the value of the widget within the notebook and then reference it in SQL code. For example, assuming there is a text widget named Var1 that has input value "Hello", I would want to ...
- 17579 Views
- 6 replies
- 3 kudos
- 3 kudos
It seems that only way to use parameters in sql code block is to use dbutils.widget and you cannot change those parameters without removing widget and setting it up again in code
- 3 kudos
- 651 Views
- 0 replies
- 1 kudos
Databricks Policies and Bundles Inheritance: Let Policies Rule Your DABS
Just the policy_id can specify the entire cluster configuration. Yes, we can inherit default and fixed values from policies. Updating runtime version for 100s of jobs, for example, is much easier this way. Read more: - https://databrickster.med...
- 651 Views
- 0 replies
- 1 kudos
- 1383 Views
- 1 replies
- 3 kudos
Relationship in databricks Genie
Now you can define relations also directly in Genie. It includes options like “Many to One”, “One to Many”, “One to One”, “Many to Many”. Read more: - https://databrickster.medium.com/relationship-in-databricks-genie-f8bf59a9b578 - https://www.su...
- 1383 Views
- 1 replies
- 3 kudos
- 2539 Views
- 1 replies
- 3 kudos
Real-Time Mode in Apache Spark Structured Streaming
Real-Time Mode in Spark StreamingApache Spark™ Structured Streaming has been the backbone of mission-critical pipelines for years — from ETL to near real-time analytics and machine learning.Now, Databricks has introduced something game-changing: Real...
- 2539 Views
- 1 replies
- 3 kudos
- 3 kudos
And now in Public Preview! Thank you for writing this up, @Yogesh_Verma_.
- 3 kudos
- 2019 Views
- 1 replies
- 4 kudos
How Databricks Helped Me See Data Engineering Differently
Over the years working as a data engineer, I’ve started to see my role very differently. In the beginning, most of my focus was on building pipelines—extracting, transforming, and loading data so it could land in the right place. Pipelines were the g...
- 2019 Views
- 1 replies
- 4 kudos
- 4 kudos
@Brahmareddy thanks for this! .Think you've nailed it on the head there. If the stakeholders trust the data and there's integrity, governance, and a single source of truth, you've got a recipe for a great product! Love this take @Brahmareddy . Really...
- 4 kudos
- 1576 Views
- 2 replies
- 6 kudos
(Teaser CHALLENGE - Community Data Pull) - Upcoming Challenge For the Community
Hey everybody, I've been dying to share this with the community. Over the last few weeks, I've been thinking about how I can do a Data Pull from the Community to highlight some of the cool stuff we all do! 拾. Below is a snippet of visual from the Dat...
- 1576 Views
- 2 replies
- 6 kudos
- 6 kudos
I'll aim to have the data for the challenge sorted and ready for next week! . I want to strip out some of the columns and figure out where is best to host the data . Potentially I could have it on the Databricks Marketplace or Github.All the best,BS
- 6 kudos
- 11663 Views
- 1 replies
- 2 kudos
9 Powerful 🚀 Spark Optimization Techniques in Databricks (With Real Examples)
IntroductionOne of our ETL pipelines used to take 10 hours to complete. After tuning and scaling in Databricks, it finished in just about 1 hour — a 90% reduction in runtime.That’s the power of Spark tuning.Databricks, built on Apache Spark, is a po...
- 11663 Views
- 1 replies
- 2 kudos
- 2 kudos
This is a fantastic breakdown of Spark optimization techniques, @savlahanish27!Definitely helpful for anyone working on performance tuning in Databricks.
- 2 kudos
- 1962 Views
- 0 replies
- 4 kudos
Semantic Modelling and Data Products using Agent
Hey Community!We have built something cool for Data Engineers in Databricks!Raw Files -> Semantic Model -> Data Products without writing ETL/ELT Code.Demo/ Guide - https://youtu.be/wjQYXrBwA-oNotebook - https://github.com/Intugle/data-tools/blob/main...
- 1962 Views
- 0 replies
- 4 kudos
- 4794 Views
- 2 replies
- 7 kudos
Data Quality at Scale: My Experience Using Databricks and AWS
Over the past few years working as a data engineer, I’ve seen how quickly companies are moving their platforms to Databricks and AWS. The flexibility and scale these platforms provide are amazing, but one challenge always comes up again and again: ho...
- 4794 Views
- 2 replies
- 7 kudos
- 7 kudos
Hi @Brahmareddy very good insights , I can summarize this as follows: Area Best Practice ExampleSchema ManagementDefine schemas in JSON/YAML, enforce with Delta LakeGovernanceUse Unity Catalog for access, lineage, and ownershipMonitoringSet up Lakeho...
- 7 kudos
- 10755 Views
- 4 replies
- 6 kudos
From Pilots to Production: Unlocking Enterprise AI Agents with Agent Bricks
Many enterprises today launch AI agents with high hopes but more often than not, those pilots never reach production. The culprit? Complexity, poor evaluations, ballooning costs, and governance gaps.Why do so many AI agent pilots never make it to pro...
- 10755 Views
- 4 replies
- 6 kudos
- 6 kudos
Agent Bricks looks solid for scaling AI, and I’ve seen platforms like Agentra.io also tackle the enterprise workflow side of this challenge.
- 6 kudos
- 14348 Views
- 4 replies
- 1 kudos
Databricks Academy Labs Coupon Instructions
To see all Databricks training and enablement offerings, please visit our Learning Library and Certifications Catalog. To use your Databricks Academy Labs coupons, please - Head to Databricks Academy Across the top navigation, select Subscriptions C...
- 14348 Views
- 4 replies
- 1 kudos
- 1 kudos
is this coupon still valid, not able to use it ?
- 1 kudos
-
Access Data
1 -
Access Delta Tables
1 -
ADF Linked Service
1 -
ADF Pipeline
1 -
Advanced Data Engineering
6 -
agent bricks
2 -
Agentic AI
3 -
AI
3 -
AI Agents
5 -
AI Readiness
1 -
AIBI
1 -
Analytics
1 -
Analytics Engineering
1 -
Apache spark
3 -
Apache Spark 3.0
2 -
ApacheSpark
1 -
Architecture
5 -
Associate Certification
2 -
Audit
1 -
Auto-loader
1 -
Automation
1 -
AWSDatabricksCluster
2 -
Azure
3 -
Azure databricks
3 -
Azure Databricks Delta Table
1 -
Azure Databricks Job
2 -
Azure Delta Lake
3 -
Azure devops integration
1 -
Azure Unity Catalog
2 -
AzureDatabricks
2 -
BI
1 -
BI Integrations
1 -
Big data
1 -
Billing and Cost Management
2 -
Blog
1 -
Caching
2 -
CDC
3 -
CDF
1 -
Certification
1 -
Certification Badge
1 -
Certification Exam
1 -
CICD
2 -
CICDForDatabricksWorkflows
1 -
Cluster
1 -
Cluster Policies
1 -
Cluster Pools
1 -
Collect
1 -
Community Event
1 -
CommunityArticle
2 -
Cost Optimization Effort
2 -
CostOptimization
2 -
custom compute policy
1 -
CustomLibrary
1 -
DABs
1 -
DAIS 0206
3 -
DAIS 2026
2 -
Dashboards
2 -
Data
1 -
Data Analysis with Databricks
1 -
Data Architecture
2 -
Data Driven AI Roadmap
1 -
Data Engineering
16 -
Data Governance
5 -
Data Ingestion
2 -
Data Ingestion & connectivity
1 -
data layout
1 -
Data Mesh
1 -
data optimization
1 -
Data Processing
1 -
Data Quality
2 -
Data warehouse
1 -
Data Warehousing
1 -
databricks
3 -
Databricks App
1 -
Databricks Apps
2 -
Databricks Assistant
2 -
Databricks Certified
1 -
Databricks Community
1 -
Databricks Dashboard
2 -
Databricks Delta Table
2 -
Databricks Demo Center
1 -
Databricks genAI associate
1 -
databricks genie
1 -
Databricks Job
2 -
Databricks Lakeflow
3 -
Databricks Lakehouse
2 -
Databricks Migration
3 -
Databricks Mlflow
1 -
Databricks News
1 -
Databricks Notebooks
1 -
Databricks Pyspark
3 -
Databricks Serverless
1 -
Databricks Support
1 -
Databricks Training
1 -
Databricks Unity Catalog
3 -
Databricks Workflows
3 -
DatabricksAutomation
1 -
DatabricksML
1 -
DatabricksOptimization
1 -
DataEngineering
1 -
DBR Versions
1 -
Declartive Pipelines
2 -
DeepLearning
1 -
Delta Lake
11 -
Delta Lake Files
1 -
Delta Live Table
2 -
Delta Live Tables
1 -
Delta Time Travel
1 -
Delta-lake
1 -
DeltaLake
1 -
DevOps
2 -
DimensionTables
1 -
DLT
2 -
DLT Pipelines
3 -
DLT-Meta
1 -
Dns
1 -
Dynamic
1 -
ETL Pipelines
2 -
fastapi
1 -
Free Databricks
3 -
Free Edition
1 -
GenAI
1 -
GenAI agent
2 -
GenAI and LLMs
4 -
GenAIGeneration AI
2 -
Generation AI
1 -
Generative AI
2 -
Generative AI Engineer
1 -
Genie
3 -
Git
1 -
Google Bigquery
1 -
Google cloud
1 -
Governance
2 -
Governed Tag
1 -
hackathon
1 -
Hive metastore
1 -
Hubert Dudek
42 -
Hybrid Lakehouse
1 -
Kafka streaming
2 -
LakeBase
4 -
Lakeflow
1 -
Lakeflow Pipelines
1 -
Lakehouse
3 -
Lakehouse Migration
1 -
Langchain
1 -
LangGraph
1 -
Lazy Evaluation
1 -
Learning
1 -
Library Installation
1 -
Lineage
2 -
LiquidClustering
2 -
Live Tables CDC
1 -
Llama
1 -
LLM
1 -
LLMs
1 -
Machine Learning
1 -
mcp
2 -
Medallion Architecture
3 -
MERGE Performance
2 -
Metadata
2 -
Metric Views
2 -
Microsoft Teams
1 -
Migration
1 -
Migrations
1 -
mosic ai search
1 -
MSExcel
3 -
Multi-Table Transactions
1 -
Multiagent
3 -
Networking
2 -
New Features
1 -
NotMvpArticle
1 -
Optimize Command
1 -
Partitioning
3 -
Partner
1 -
Performance
2 -
Performance Tuning
3 -
PII
1 -
Powerbi
1 -
PredictiveOptimization
1 -
Private Link
1 -
Pyspark
6 -
Pyspark Code
1 -
Pyspark Databricks
1 -
Pytest
1 -
Python
1 -
Reading-excel
2 -
Row Level Security
1 -
SAP
2 -
Sap Hana Driver
1 -
Scala Code
1 -
Scd Type 2
1 -
Scripting
1 -
SDK
1 -
Security
1 -
Semantic Layer
1 -
Serverless
2 -
slack
1 -
Spark
6 -
Spark Caching
1 -
Spark Performance
1 -
SparkSQL
1 -
SQL
3 -
Sql Scripts
2 -
SQL Serverless
1 -
streaming
1 -
streamlit
1 -
Structured streaming
1 -
Students
2 -
Support Ticket
1 -
Sync
1 -
Training
1 -
Tutorial
3 -
UCSD
1 -
Unit Test
1 -
Unity Catalog
12 -
Unity Cataloge
1 -
Unity Catlog
1 -
University Alliance
1 -
VACUUM Command
1 -
Variant
1 -
Warehousing
1 -
Workflow Jobs
1 -
Workflows
9 -
Zerobus
2 -
Zordering
1
- « Previous
- Next »
| User | Count |
|---|---|
| 85 | |
| 75 | |
| 67 | |
| 62 | |
| 44 |