Engage in discussions on data warehousing, analytics, and BI solutions within the Databricks Community. Share insights, tips, and best practices for leveraging data for informed decision-making.
Here's your Data + AI Summit 2024 - Warehousing & Analytics recap as you use intelligent data warehousing to improve performance and increase your organization’s productivity with analytics, dashboards and insights.
Keynote: Data Warehouse presente...
Today was the first day of summit and learned about how different models can be deployed for different decision points. I'm curious to know what users have found to be model control issues regarding this. Thanks!
@Eric Kieft​ :In Databricks, the Recents view shows the recently accessed notebooks, dashboards, and folders. However, it does not show the exact location of the item. To determine the location of an item in the Recents view, you can try the followin...
We have noticed that users can schedule SQL queries, but currently we haven't found a way to find these scheduled queries (this does not show up in the jobs workplane). Therefore, we don't know that people scheduled this. The only way is to look at t...
@Paulo Rijnberg​ :In Databricks, you can use the following approaches to prevent users from scheduling SQL queries and to receive notifications when such queries are scheduled:Cluster-level permissionsJobs APINotification hooksAudit logs and monitori...
Hi @Shubham Agagwral​ Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answ...
I am building a data pipeline using Delta Live table in Azure Databricks to move data from a raw data table to a feature table and model inference results table. However, I am concerned about the potential for duplication issues in future operations,...
Hi @Chengcheng Guo​ Great to meet you, and thanks for your question! Let's see if your peers in the community have an answer to your question. Thanks.
Hi, I'm currently starting to use SQL Warehouse, and we have most of our lake in a compression different than snappy.
How can I set the SQL warehouse to use a compression like gzip, zstd, on CREATE, INSERT, etc?
Tried this:
set spark.sql.parquet.comp...
Hi @Alejandro Martinez​ Great to meet you, and thanks for your question! Let's see if your peers in the community have an answer to your question. Thanks.
I created the following model:
which calls get_identifier_information() which is as follows:
This is how I log the model
And this is the error I am running into:
RuntimeError: It appears that you are attempting to reference SparkContext...
Hi @Nikhil Gajghate​ Great to meet you, and thanks for your question! Let's see if your peers in the community have an answer to your question. Thanks.
Hello, we're receiving an error when running glue jobs to try and connect to and read from a Databricks SQL endpoint.
Hello, we're receiving an error when running glue jobs to try and connect to and read from a Databricks SQL endpoint.
An error occ...
Hello @Vidula Khanna​ @Debayan Mukherjee​ ,I wanted to give you an update that might be helpful for your future customers, we worked with @Pavan Kumar Chalamcharla​ and through lots of trial and error we figured out a combination that works for SQL e...
I am trying to identify errors coming from Databricks. So I can handle them in my code.Sometimes I get a descriptive error, that points me to the exact problem, but then if I run the exact same test, I sometimes get "request error: invalid operation ...
Can you share what command were you executing ? Also were you currently doing exception handling within your code with try/catch exception ? Can you also check the driver logs during the time error happened ? The driver logs should have more detail...
For purpose of posting video, can we post it on Google Drive, and share relevant link while submitting? Video will be made accessible to anyone with the link.Tagging @Karen Bajza-Terlouw​ and @Michelle Brain​
Hello! The video should be uploaded to and made publicly visible on YouTube, Vimeo, Facebook Video, or Youku so that it can playback on Devpost. This makes it easier for the judges
https://community.databricks.com/s/question/0D58Y00009fClizSAC/ssh-connection-with-paramiko i read in this post that you can help me with that problem, please can you give me more advice what i have to do to connect to an sftp through ssh
Hi!Could you please let me know what your current blocker is? Are you looking for a code snippet that can help you get file through sftp? Or if you are looking for the spark config that whitelist the port? Or if you are blocked by some other error?
Availability of SQL Warehouse to Data Science and Engineering persona
​
Hi All,
Now we can use SQL Warehouse in our notebook execution.
It's in preview now and soon will be GA.
Hi team,
I am looking for a way to find DBU cost for DLT clusters, does it get stored anywhere I have been looking into event_logs but did not find information related to cost. it does have cluster resource utilization details.
here is what I found...
Hi @Chhaya Vishwakarma​ Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best an...