- 1654 Views
- 2 replies
- 2 kudos
Data load issue
I have a job in Databricks which completed successfully but the data is not been written into the target table, I have checked all the possible ways, each n every thing is correct in the code, target table name, source table name, etc etc. It is a Fu...
- 1654 Views
- 2 replies
- 2 kudos
- 2 kudos
This looks like a misconfigured Query Watchdog, specifically the below config: spark.conf.get("spark.databricks.queryWatchdog.outputRatioThreshold") Please check the value of this config - it is 1000 by default. Also, we recommend using Jobs Comput...
- 2 kudos
- 333 Views
- 1 replies
- 1 kudos
Delta UniForm
When we save a delta table using the UniForm option we are seeing a 50% drop in table size. When we add UniForm to a delta table in post we are seeing no change in data size. Is this expected or are others seeing this as well?
- 333 Views
- 1 replies
- 1 kudos
- 1 kudos
Re:When we save a delta table using the UniForm option we are seeing a 50% drop in table size What format are you starting with? e.g. csv -> Delta.
- 1 kudos
- 449 Views
- 1 replies
- 2 kudos
Resolved! AutoLoader Pros/Cons When Extracting Data (Cross-Post)
Cross-posting from: https://community.databricks.com/t5/data-engineering/autoloader-pros-cons-when-extracting-data/td-p/127400Hi there, I am interested in using AutoLoader, but I'd like to get a bit of clarity if it makes sense in my case. Based on e...
- 449 Views
- 1 replies
- 2 kudos
- 2 kudos
You’ve already identified data duplication as a potential con of landing the data first, but there are several benefits to this approach that might not be immediately obvious:Schema Inference and Evolution: AutoLoader can automatically infer the sche...
- 2 kudos
- 421 Views
- 3 replies
- 2 kudos
Resolved! Python module import with Dedicated access mode
I currently have a repo connected in databricks and I was able to correctly import a python module from src folder located in the same root.Since I am using a Machine Learning runtime, I am force to choose a Dedicated (formerly: Single user) access m...
- 421 Views
- 3 replies
- 2 kudos
- 2 kudos
Thanks @szymon_dybczak ! I confirm that's a permission issue and assigning "CAN MANAGE" solves it.I still find it not really intuitive, since the goal is to use a shared cluster (with ML runtime) for development purposes. I mean, it would make sense ...
- 2 kudos
- 257 Views
- 0 replies
- 0 kudos
Automating technical documentation in ETL pipelines using LLMs
Generate pipeline documentation using LLMs and rich metadata extract As enterprise data environments expand, the complexity of maintaining accurate and current documentation across ETL pipelines has intensified. While modern platforms such as Databri...
- 257 Views
- 0 replies
- 0 kudos
- 423 Views
- 1 replies
- 1 kudos
Using merge Schema with spark.read.csv for inconsistent schemas
The Problem:A common data engineering challenge is reading a directory of CSV files where the schemas are inconsistent. For example, some files might have columns in a different order, or be missing certain columns altogether. The standard behavior o...
- 423 Views
- 1 replies
- 1 kudos
- 1 kudos
Very nice example and explanation helped me a lot, thanks !
- 1 kudos
- 214 Views
- 1 replies
- 0 kudos
Unity Catalog tool function with custom parameters not being used
I have created a UC tool that takes in a few custom STRING parameters. I gave this tool to an ai agent using the mosaic ai agent framework with hardcoded parameter values for testing. The issue is my ai agent hallucinates and injects its own ai gener...
- 214 Views
- 1 replies
- 0 kudos
- 0 kudos
Hi @Boban12335,Can we get UC function definition to understand your problem better?Best Regards,Nivethan V
- 0 kudos
- 378 Views
- 3 replies
- 3 kudos
Resolved! AutoLoader - Write To Console (Notebook Cell) Long Running Issue
Hi there,I am likely misunderstanding how to use AutoLoader properly while developing/testing. I am trying to write a simple AutoLoader notebook cell to *read* the contents of a path with json files, and *write* them to console (i.e. notebook cell) i...
- 378 Views
- 3 replies
- 3 kudos
- 3 kudos
Hi @ChristianRRL ,It looks like spark.readStream with Auto Loader creates a continuous streaming job by default, which means it keeps running while waiting for new files.To avoid this, you can control the behaviour using trigger(availableNow=True), w...
- 3 kudos
- 2897 Views
- 2 replies
- 3 kudos
Resolved! Documentation for spatial SQL public preview - Where is it?
Hi everybody,since DBR 17.1 spatial sql functions (st_point(), st_distancesphere, ... ) are in public preview.The functionality is presented in this talk Geospatial Insights With Databricks SQL: Techniques and Applications or discussed here in the fo...
- 2897 Views
- 2 replies
- 3 kudos
- 3 kudos
Is this what you were after?https://docs.databricks.com/aws/en/sql/language-manual/sql-ref-st-geospatial-functions
- 3 kudos
- 269 Views
- 1 replies
- 1 kudos
Resolved! Run_type has some null
Just wondering — we know that the run_type column in the job run timeline usually has only three values: JOB_RUN, SUBMIT_RUN, and WORKFLOW_RUN. So why do we also see a null value there? Any reason?
- 269 Views
- 1 replies
- 1 kudos
- 1 kudos
Hi @Danish1105 ,One possible explanation is that you see null values because of the following reason they stated in documentation:"Not populated for rows emitted before late August 2024."In case of my workspace, this seems valid. I have only nulls wh...
- 1 kudos
- 1173 Views
- 3 replies
- 2 kudos
PERMISSION_DENIED: Cannot access Spark Connect. when trying to run serverless databricks connect
I am not able to run a file as "run as workflow" nor "run with databricks connect" when I choose serverless run on my paid account. However I can perform this action in my free edition account . See error : pyspark.errors.exceptions.connect.SparkCon...
- 1173 Views
- 3 replies
- 2 kudos
- 2 kudos
Hi @ivan7256 ,This might be because serverless compute isn't enabled for workflows in your paid workspace.
- 2 kudos
- 645 Views
- 3 replies
- 5 kudos
Databricks Free Edition Needs Transparency About Data Access
When I first discovered the Databricks Free Edition, I thought it was a generous offering for data enthusiasts, researchers, and developers who just needed a personal sandbox. No cost. Easy setup. Promises of productivity. But what caught me off guar...
- 645 Views
- 3 replies
- 5 kudos
- 5 kudos
Thanks again for all the perspectives shared so far. I want to re-emphasize that the Databricks Free Edition offers real value. For data enthusiasts, learners, and builders, it’s a genuinely powerful environment to get hands-on without jumping throug...
- 5 kudos
- 4650 Views
- 3 replies
- 8 kudos
Resolved! Rss feeds for databricks releases
Hi,are there any rss feeds for the databricks platform, sql & runtime releases? We have a big tech stack so it is sometimes hard to keep up with the ever changing technologies. We are using rss feeds to keep up with all of that.Cant find anything for...
- 4650 Views
- 3 replies
- 8 kudos
- 8 kudos
Databricks recently published an RSS feed for all their updates. As far as I can find, it is only for AWS at the moment.https://docs.databricks.com/aws/en/feed.xml
- 8 kudos
- 81343 Views
- 7 replies
- 7 kudos
Resolved! How to create temporary table in databricks
Hi Team,I have a requirement where I need to create temporary table not temporary view.Can you tell me how to create temporary table in data bricks ?
- 81343 Views
- 7 replies
- 7 kudos
- 7 kudos
I see, thanks for sharing, can you mark the solution which worked for you @abueno as Accepted.
- 7 kudos
- 472 Views
- 1 replies
- 0 kudos
CLI: Export-dir provides LatestClone
Hi everyone,I want to download the current databricks codebase out of a workspace and tried viadatabricks databricks workspace export-dir /Sandbox/foo .Surprisingly, some of the subfolders are twice in the export target: One with the expected name (`...
- 472 Views
- 1 replies
- 0 kudos
- 0 kudos
Hi @holunder ,This could be because the backend stores both the original and cloned versions of folders, even if only one appears in the web UI. The Databricks CLI exports everything from the backend, not just what's visible in the UI.
- 0 kudos
Join Us as a Local Community Builder!
Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!
Sign Up Now-
.CSV
1 -
Access Data
2 -
Access Databricks
1 -
Access Delta Tables
2 -
Account reset
1 -
ADF Pipeline
1 -
ADLS Gen2 With ABFSS
1 -
Advanced Data Engineering
1 -
AI
1 -
Analytics
1 -
Apache spark
1 -
Apache Spark 3.0
1 -
API Documentation
3 -
Architecture
1 -
asset bundle
1 -
Asset Bundles
2 -
Auto-loader
1 -
Autoloader
4 -
AWS
3 -
AWS security token
1 -
AWSDatabricksCluster
1 -
Azure
5 -
Azure data disk
1 -
Azure databricks
14 -
Azure Databricks SQL
6 -
Azure databricks workspace
1 -
Azure Unity Catalog
5 -
Azure-databricks
1 -
AzureDatabricks
1 -
AzureDevopsRepo
1 -
Big Data Solutions
1 -
Billing
1 -
Billing and Cost Management
1 -
Blackduck
1 -
Bronze Layer
1 -
Certification
3 -
Certification Exam
1 -
Certification Voucher
3 -
CICDForDatabricksWorkflows
1 -
Cloud_files_state
1 -
CloudFiles
1 -
Cluster
3 -
Community Edition
3 -
Community Event
1 -
Community Group
2 -
Community Members
1 -
Compute
3 -
Compute Instances
1 -
conditional tasks
1 -
Connection
1 -
Contest
1 -
Credentials
1 -
CustomLibrary
1 -
Data
1 -
Data + AI Summit
1 -
Data Engineering
3 -
Data Explorer
1 -
Data Ingestion & connectivity
1 -
Databrick add-on for Splunk
1 -
databricks
2 -
Databricks Academy
1 -
Databricks AI + Data Summit
1 -
Databricks Alerts
1 -
Databricks Assistant
1 -
Databricks Certification
1 -
Databricks Cluster
2 -
Databricks Clusters
1 -
Databricks Community
10 -
Databricks community edition
3 -
Databricks Community Edition Account
1 -
Databricks Community Rewards Store
3 -
Databricks connect
1 -
Databricks Dashboard
2 -
Databricks delta
2 -
Databricks Delta Table
2 -
Databricks Demo Center
1 -
Databricks Documentation
2 -
Databricks genAI associate
1 -
Databricks JDBC Driver
1 -
Databricks Job
1 -
Databricks Lakehouse Platform
6 -
Databricks Migration
1 -
Databricks notebook
2 -
Databricks Notebooks
3 -
Databricks Platform
2 -
Databricks Pyspark
1 -
Databricks Python Notebook
1 -
Databricks Repo
1 -
Databricks Runtime
1 -
Databricks SQL
5 -
Databricks SQL Alerts
1 -
Databricks SQL Warehouse
1 -
Databricks Terraform
1 -
Databricks UI
1 -
Databricks Unity Catalog
4 -
Databricks Workflow
2 -
Databricks Workflows
2 -
Databricks workspace
3 -
Databricks-connect
1 -
DatabricksJobCluster
1 -
DataCleanroom
1 -
DataDays
1 -
Datagrip
1 -
DataMasking
2 -
DataVersioning
1 -
dbdemos
2 -
DBFS
1 -
DBRuntime
1 -
DBSQL
1 -
DDL
1 -
Dear Community
1 -
deduplication
1 -
Delt Lake
1 -
Delta
22 -
Delta Live Pipeline
3 -
Delta Live Table
5 -
Delta Live Table Pipeline
5 -
Delta Live Table Pipelines
4 -
Delta Live Tables
7 -
Delta Sharing
2 -
deltaSharing
1 -
Deny assignment
1 -
Development
1 -
Devops
1 -
DLT
10 -
DLT Pipeline
7 -
DLT Pipelines
5 -
Dolly
1 -
Download files
1 -
Dynamic Variables
1 -
Engineering With Databricks
1 -
env
1 -
ETL Pipelines
1 -
External Sources
1 -
External Storage
2 -
FAQ for Databricks Learning Festival
2 -
Feature Store
2 -
Filenotfoundexception
1 -
Free trial
1 -
GCP Databricks
1 -
GenAI
1 -
Getting started
2 -
Google Bigquery
1 -
HIPAA
1 -
import
1 -
Integration
1 -
JDBC Connections
1 -
JDBC Connector
1 -
Job Task
1 -
Lineage
1 -
LLM
1 -
Login
1 -
Login Account
1 -
Machine Learning
2 -
MachineLearning
1 -
Materialized Tables
2 -
Medallion Architecture
1 -
Migration
1 -
ML Model
2 -
MlFlow
2 -
Model Training
1 -
Module
1 -
Networking
1 -
Notebook
1 -
Onboarding Trainings
1 -
Pandas udf
1 -
Permissions
1 -
personalcompute
1 -
Pipeline
2 -
Plotly
1 -
PostgresSQL
1 -
Pricing
1 -
Pyspark
1 -
Python
5 -
Python Code
1 -
Python Wheel
1 -
Quickstart
1 -
Read data
1 -
Repos Support
1 -
Reset
1 -
Rewards Store
2 -
Schedule
1 -
Serverless
3 -
Session
1 -
Sign Up Issues
2 -
Spark
3 -
Spark Connect
1 -
sparkui
2 -
Splunk
2 -
SQL
8 -
Summit23
7 -
Support Tickets
1 -
Sydney
2 -
Table Download
1 -
Tags
1 -
Training
2 -
Troubleshooting
1 -
Unity Catalog
4 -
Unity Catalog Metastore
2 -
Update
1 -
user groups
1 -
Venicold
3 -
Voucher Not Recieved
1 -
Watermark
1 -
Weekly Documentation Update
1 -
Weekly Release Notes
2 -
Women
1 -
Workflow
2 -
Workspace
3
- « Previous
- Next »
User | Count |
---|---|
133 | |
97 | |
52 | |
42 | |
30 |