- 3018 Views
- 1 replies
- 0 kudos
Extending the duration of Ganglia metrics logs
Any insights on how to analyse Ganglia Metrics logs for an extended duration of time, not just 15 minute snapshots? We need to visualize cluster CPU utilization for the duration of cluster uptime
- 3018 Views
- 1 replies
- 0 kudos
- 0 kudos
One option here would be integration with Observability tools such as Datadog which can capture the cluster metrics on a more NRT basis. More details are here - https://docs.datadoghq.com/integrations/databricks/?tab=driveronly
- 0 kudos
- 1409 Views
- 0 replies
- 0 kudos
What's the right batch size in deep learning training?
Using Ganglia you can monitor how busy is the GPU(s). Increasing the batch size would increase that utilization. Bigger batches improve how well each batch updates the model (up to a point) with more accurate gradients. That in turn can allow trainin...
- 1409 Views
- 0 replies
- 0 kudos
- 4004 Views
- 0 replies
- 0 kudos
What's Early Stopping in Hyperopt? When should it be used?
It’s advantageous to stop running trials if progress has stopped. Hyperopt offers an early_stop_fn parameter, which specifies a function that decides when to stop trials before max_evals has been reached. Hyperopt provides a function no_progress...
- 4004 Views
- 0 replies
- 0 kudos
- 1731 Views
- 1 replies
- 0 kudos
Resolved! How to determine if am using the same DBR minor version?
DBR minor version details are not exposed. However, in the documentation, it mentioned Databricks performs maintenance releases every 2 weeks. How can I determine if I am using the same minor version
- 1731 Views
- 1 replies
- 0 kudos
- 0 kudos
The below code snippet can help to determine the DBR Hash string for the DBR version. DBR hash string is unique for the DBR minor version. val scalaVersion = scala.util.Properties.versionString val hadoopVersion = org.apache.hadoop.util.VersionInf...
- 0 kudos

- 1778 Views
- 2 replies
- 0 kudos
- 1778 Views
- 2 replies
- 0 kudos
- 0 kudos
Delete workspace doesn't delete the root bucket. You could choose to use the same root bucket for more than one workspace ( though not recommended ) It is recommended to automate the infrastructure creation via terraform or quickstart so that cleanup...
- 0 kudos

- 2851 Views
- 1 replies
- 0 kudos
Monitoring jobs
Are there any event streams that are or could be exposed in AWS (such as Cloudwatch Eventbridge events or SNS messages? In particular I'm interested in events that detail jobs being run. The use case here would be for monitoring jobs from our web app...
- 2851 Views
- 1 replies
- 0 kudos
- 0 kudos
You could write code to call the PutLogEvents api at the beginning of each job to write out custom events to cloudwatch / or use aws sdk to send and SNS notification and route it to a desired consumer.
- 0 kudos
- 2529 Views
- 1 replies
- 0 kudos
Azure Databricks Repos and HIPAA
Are Repos HIPAA compliant, or is there a plan and timeline to support this? Customer is getting a warning when trying to enable the Repos feature in a HIPAA deployment on Azure Databricks.
- 2529 Views
- 1 replies
- 0 kudos
- 0 kudos
There is a plan to support this. For timeline, please reach out to your Databricks account team.
- 0 kudos
- 1889 Views
- 1 replies
- 0 kudos
- 1889 Views
- 1 replies
- 0 kudos
- 0 kudos
Unfortunately this is not possible. The default user workspace name will be the user's email address.
- 0 kudos
- 1407 Views
- 1 replies
- 0 kudos
What do I need to think about for Disaster Recovery planning?
I am working on a disaster recovery plan for my environment which includes Databricks. Where do I start with my planning? What all do I need to consider when building a DR plan?
- 1407 Views
- 1 replies
- 0 kudos
- 0 kudos
Depending on your RPO/RTOs there are different recovery solution strategies that could be considered (active/passive, active/active) for Databricks deployments. A detailed explanation of these approaches are mentioned here
- 0 kudos
- 1481 Views
- 1 replies
- 0 kudos
Can you use credential passthrough for users running jobs?
I would like it if I could make it so that the credentials of the user who initiates a job are used as the credentials for the job run. Is this possible?
- 1481 Views
- 1 replies
- 0 kudos
- 0 kudos
Is this in Azure? If so, it is not supported currently. https://docs.microsoft.com/en-us/azure/databricks/security/credential-passthrough/adls-passthrough
- 0 kudos
- 2017 Views
- 1 replies
- 0 kudos
- 2017 Views
- 1 replies
- 0 kudos
- 0 kudos
Yes. We support this.Please see https://docs.databricks.com/administration-guide/workspace/storage.html#modify-the-storage-location-for-notebook-results and https://docs.databricks.com/administration-guide/workspace/storage.html#configure-the-storage...
- 0 kudos
- 1470 Views
- 1 replies
- 0 kudos
- 1470 Views
- 1 replies
- 0 kudos
- 0 kudos
All Databricks workspaces are deployed with this setting as of Q1 of 2021.
- 0 kudos
- 1551 Views
- 1 replies
- 0 kudos
- 1551 Views
- 1 replies
- 0 kudos
- 0 kudos
Yes. Users can have access to multiple workspaces. Workspaces are units of collaboration.
- 0 kudos
- 2124 Views
- 1 replies
- 1 kudos
Resolved! If I use cluster pools, am I charged for the machines in the pool that are not in active use?
- 2124 Views
- 1 replies
- 1 kudos
- 1 kudos
Databricks only charges for compute time while machines are being used. If a machine is on "IDLE" then Databricks does not charge you for those machines. The cloud providers will charge you for the machines that are running regardless if they are IDL...
- 1 kudos
- 1807 Views
- 1 replies
- 0 kudos
- 1807 Views
- 1 replies
- 0 kudos
- 0 kudos
Yes, you need an Azure Databricks account on the Premium plan to enable Databricks SQL
- 0 kudos
Join Us as a Local Community Builder!
Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!
Sign Up Now-
Access control
1 -
Apache spark
1 -
AWS
5 -
Azure
7 -
Azure databricks
5 -
Billing
2 -
Cluster
1 -
Compliance
1 -
Data Ingestion & connectivity
5 -
Databricks Runtime
1 -
Databricks SQL
2 -
DBFS
1 -
Dbt
1 -
Delta
4 -
Delta Sharing
1 -
DLT Pipeline
1 -
GA
1 -
Gdpr
1 -
Github
1 -
Partner
16 -
Public Preview
1 -
Service Principals
1 -
Unity Catalog
1 -
Workspace
2
- « Previous
- Next »
User | Count |
---|---|
43 | |
33 | |
25 | |
17 | |
10 |