- 1685 Views
- 5 replies
- 7 kudos
From Associate to Professional: My Learning Plan to ace all Databricks Data Engineer Certifications
In today’s data-driven world, the role of a data engineer is critical in designing and maintaining the infrastructure that allows for the efficient collection, storage, and analysis of large volumes of data. Databricks certifications holds significan...
- 1685 Views
- 5 replies
- 7 kudos
- 7 kudos
@ms_ccg You are correct. I got that error too. Seems like Databricks has removed some of these. I would suggest you to search for those separately via Databricks Academy or external resources. Let me know if you need any help.
- 7 kudos
- 876 Views
- 0 replies
- 0 kudos
Monitoring a Streaming Job
If you have a streaming job, you need to check the batch metrics to be able to understand the stream progress. However, here are some other suggestions which we can use to monitor a streaming job and be stuck in a "hung" state. Streaming Listeners sp...
- 876 Views
- 0 replies
- 0 kudos
- 751 Views
- 0 replies
- 0 kudos
Why configure a job timeout?
If you use Databricks Jobs for your workloads, it is possible you might have run into a situation where you find your jobs to be in "hung" state. Before cancelling the job it is important to collect the thread dump as I described here to be able to f...
- 751 Views
- 0 replies
- 0 kudos
- 2623 Views
- 0 replies
- 1 kudos
Databricks Community Edition Login - Sign Up/Sign In/Forgot Password
Sign Up Go to https://www.databricks.com/try-databricksFill in the 2 steps box on the right hand side Note - It is important to select the Personal use section in the above step. Sign In Enter your details here https://accounts.cloud.databricks.com/...
- 2623 Views
- 0 replies
- 1 kudos
- 1115 Views
- 1 replies
- 0 kudos
A handy tool called spark-column-analyser
I just wanted to share a tool I built called spark-column-analyzer. It's a Python package that helps you dig into your Spark DataFrames with ease.Ever spend ages figuring out what's going on in your columns? Like, how many null values are there, or h...
- 1115 Views
- 1 replies
- 0 kudos
- 0 kudos
An example added to README in GitHubDoing analysis for column PostcodeJson formatted output{"Postcode": {"exists": true,"num_rows": 93348,"data_type": "string","null_count": 21921,"null_percentage": 23.48,"distinct_count": 38726,"distinct_percentage"...
- 0 kudos
- 738 Views
- 0 replies
- 2 kudos
Schema evolution clause added to SQL merge syntax
You can now add the WITH SCHEMA EVOLUTION clause to a SQL merge statement to enable schema evolution for the operation. For more information: https://docs.databricks.com/en/delta/update-schema.html#sql-evo #Databricks
- 738 Views
- 0 replies
- 2 kudos
- 1085 Views
- 0 replies
- 1 kudos
VariantType + Parse_json()
In Spark 4.0, there are no more data type mismatches when converting dynamic JSONs, as the new data type VariantType comes with a new function to parse JSONs. Stay tuned for 4.0 release.
- 1085 Views
- 0 replies
- 1 kudos
- 1244 Views
- 0 replies
- 1 kudos
Type widening is in Public Preview
You can now enable type widening on tables backed by Delta Lake. Tables with type widening enabled allow changing the type of columns to a wider data type without rewriting underlying data files. For more information:https://docs.databricks.co...
- 1244 Views
- 0 replies
- 1 kudos
- 928 Views
- 1 replies
- 0 kudos
How to convert txt files to delta tables
Hello members of Databricks's comunity,I am currently working on a project where we collect data from machines, that data is in .txt format. The data is currently in an Azure container, I need to clean the files and convert them to delta tables, how ...
- 928 Views
- 1 replies
- 0 kudos
- 0 kudos
https://docs.databricks.com/en/ingestion/add-data/upload-data.html
- 0 kudos
- 551 Views
- 0 replies
- 0 kudos
RocksDB for storing state stream
Now, you can keep the state of stateful streaming in RocksDB. For example, retrieving keys from memory to check for duplicate records inside the watermark is now faster. #databricks
- 551 Views
- 0 replies
- 0 kudos
- 662 Views
- 0 replies
- 1 kudos
State of stateful streaming
For stateful streaming in #databricks, you can now easily read what is in the state.
- 662 Views
- 0 replies
- 1 kudos
- 1430 Views
- 4 replies
- 0 kudos
Unable to mount GCS bucket with underscores in the name
I have two buckets with the same configurations and labels.One is named my-bucket and the other is my_bucket. I am able to mount my-bucket but get an opaque error message when trying to mount my_bucket. Is this known/expected behavior? Are underscore...
- 1430 Views
- 4 replies
- 0 kudos
- 0 kudos
Hi @legobricks , Curious on the error that you are getting. However, for GCS - https://cloud.google.com/storage/docs/buckets#naming I do see underscores are allowed but there is also a note below: You can use a bucket name in a DNS record as part of ...
- 0 kudos
- 2060 Views
- 2 replies
- 1 kudos
Debug stream
One of the easiest ways to debug a stream is to stream it to memory and query it with SQL #databricks
- 2060 Views
- 2 replies
- 1 kudos
- 1028 Views
- 0 replies
- 0 kudos
Financial Crime detection with the help of Apache Spark, Data Mesh and Data Lake
For those interested in Data Mesh and Data Lakes for FinCrime detection:Data mesh is a relatively new architectural concept for data management that emphasizes domain-driven data ownership and self-service data availability. It promotes the decentral...
- 1028 Views
- 0 replies
- 0 kudos
- 1119 Views
- 0 replies
- 0 kudos
Hiring Databricks Data Architect roles
Hi,I am a recruiter and I am looking for places to post some data bricks I have coming out. I have several fully remote, high-level data databricks, architect roles. Of course I will post to LinkedIn, but I was just curious if there are any other pla...
- 1119 Views
- 0 replies
- 0 kudos
- 1742 Views
- 1 replies
- 2 kudos
Databricks Notebook Workflow
Exciting news for Databricks users! The ability to view job details within the notebook workflow section, particularly for multithreaded jobs, is available now. Instead of manually inspecting each job for failures, this feature enables us to swiftly ...
- 1742 Views
- 1 replies
- 2 kudos
Connect with Databricks Users in Your Area
Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.
If there isn’t a group near you, start one and help create a community that brings people together.
Request a New Group-
ADF Linked Service
1 -
ADF Pipeline
1 -
Append blob
1 -
AWS
1 -
Azure databricks
1 -
Azure DevOps
1 -
Azure devops integration
1 -
ChangingSchema
1 -
CICD
1 -
CICDForDatabricksWorkflows
1 -
Clone
1 -
Cluster
1 -
Cluster Pools
1 -
compute policies
1 -
compute policy
1 -
Cost
1 -
Cost Optimization Effort
1 -
custom compute policy
1 -
CustomLibrary
1 -
Data Engineering
1 -
Databricks Delta Table
1 -
Databricks Demo Center
1 -
Databricks jobs
1 -
Databricks Migration
1 -
Databricks Mlflow
1 -
Databricks Support
1 -
Databricks Unity Catalog
2 -
Databricks Workflows
1 -
DatabricksML
1 -
DatabricksWorkflowsCICD
1 -
Date
1 -
Delta Lake
3 -
Devops
1 -
DimensionTables
1 -
Dns
1 -
Dynamic
1 -
Hive metastore
1 -
Jobs & Workflows
1 -
LakeFlow
1 -
Library Installation
1 -
Mlops
1 -
Networking
1 -
Partner
1 -
Private Link
1 -
Pyspark Code
1 -
Question
1 -
Scala Code
1 -
Schema
1 -
Schema Evaluation
1 -
Serverless SQL Datawarehouse
1 -
Spark
4 -
SparkSQL
1 -
Support Ticket
1 -
Sync
1 -
ucx
1 -
Unity Catalog
2 -
Unity Catlog
1 -
Workflow Jobs
1 -
Workflows
1
- « Previous
- Next »
User | Count |
---|---|
40 | |
12 | |
10 | |
7 | |
6 |