Get Started Resources
Explore essential resources to kickstart your journey with Databricks. Access tutorials, guides, and...
Explore essential resources to kickstart your journey with Databricks. Access tutorials, guides, and...
Stay updated on Databricks events, including webinars, conferences, and workshops. Discover opportun...
Find answers to common questions and troubleshoot issues with Databricks support FAQs. Access helpfu...
Explore in-depth articles, tutorials, and insights on data analytics and machine learning in the Dat...
Dive into a collaborative space where members like YOU can exchange knowledge, tips, and best practi...
Stay up-to-date with the latest announcements from Databricks. Learn about product updates, new feat...
Community-produced videos to help you leverage Databricks in your Data & AI journey. Tune in to expl...
This post is written by Ellen Hirt, Senior Specialist Solutions Engineer, and Pascal Vogel, Solutions Architect. Over the past year, Databricks has supported many teams with building Retrieval Augmented Generation (RAG) applications. We've noticed th...
Great post! Do you have a recommended set of best practices for clustering queries when doing embedding model evaluation and selection? I've found that without clustering, users have to choose to either use a broad average on their whole dataset, or ...
Introduction Announced at the Data + AI Summit in June 2023, Lakehouse Federation in Databricks is a groundbreaking new capability that allows you to query data across external data sources - including Snowflake, Synapse, many others and even Databri...
thanks @mido1978 for your reply. So if i understand correctly it's been looked at by the Databricks development team and that's a good news ! (for now yes we are already copying the delta tables out of OneLake to Gen2 which obviously defeat the purp...
This is part 2 of a two-part series on Structured Extraction with LLM on Databricks. Read here for part 1! Introduction In part 1 of this series, I demonstrated how to use a large language model (LLM) with structured output and AI_QUERY to perform ...
The global economy in 2024 is a tale of two forces: optimism sparked by falling interest rates, and the uncertainty caused by geopolitical unrest. As companies focus on sustainable growth, such a volatile environment inevitably tests their resilience...
The world of artificial intelligence (AI) and data analytics is about to get a significant boost, thanks to Databricks’ collaboration with NVIDIA. This work brings together the cutting-edge capabilities of Databricks’ Mosaic AI platform and NVIDIA AI...
What is a Databricks Workspace IP Access List?The Databricks Workspace IP Access List is a security feature that allows administrators to control access to the Databricks workspace by specifying which IP addresses or IP ranges are allowed or denied a...
Organizations on Microsoft Azure commonly use Microsoft Dynamics 365 as their CRM or ERP application. Leveraging these applications' data for Business Intelligence, Machine Learning, or Artificial Intelligence gives organizations a real competitive ...
Apologies my above comment on the datetime format is wrong of course, the format in the article is correct, I just forgot to add the legacy timeParserPolicy settings. #To Parse Timestamps correctlyspark.conf.set("spark.sql.legacy.timeParserPolicy", "...
Data sharing among collaborators has become increasingly important across industries, but it must be done in compliance with data protection regulations like CCPA and GDPR. Databricks offers two key solutions to enable secure and compliant data colla...
Do we have part 2 series for this one?
Have you ever wondered what the difference between managed and external tables is? Olivia Zhang, a Solutions Architect at Databricks, goes over the differences and explains when to use what using an example in this quick video! Target Audience - Dat...
Here is a basic difference between managed table and external table:Managed Tables Managed tables are fully controlled by Databricks, including both the data and metadata lifecycle.Data is stored in a Databricks-managed storage location configured b...
Databricks Observability using Grafana and Prometheus As software systems scale, the amount of data they process and the work they do grows with them. This is not limited only to cloud-native services but also data pipelines that support them. When y...
Thank you @aleksandar . Can you provide the link for Grafana dashboard file
Upskill on SQL analytics and BI with three short self-paced videos As organizations seek to democratize their data, there is an increasing demand to enable users to better understand and work with data. Learn how to better understand and analyze data...
In this video, a Senior Specialist Solutions Architect at Databricks, goes over Observability on Databricks and how this can be achieved using Systems Tables. Observability is the ability to monitor and understand how an application is behaving. Thi...
Looking to securely connect your private resources to Databricks Serverless? In our latest blog series, we explore how Private Link offers a seamless, secure connection between your cloud environments and Databricks' serverless offerings. Learn how t...
Private and dedicated connectivity patterns for Databricks Serverless using Private Link allow organizations to securely connect to Databricks services without exposing their traffic to the public internet. This enhances security, compliance, and net...
Data teams face many challenges to quickly access the right data primarily due to data fragmentation, time and cost involved in consolidating data, and difficulties in managing data governance across many systems.With Lakehouse Federation capabiliti...
This blog talks about common ways in which Hive-style partitioning is used as a workaround for efficient data storage. Liquid Clustering improves partitioning and zorder techniques by simplifying data layout decisions while optimizing query performan...
you have mentioned target size in picture, what it refers. also does it resolve concurrentAppend Excecption