cancel
Showing results for 
Search instead for 
Did you mean: 
Administration & Architecture
Explore discussions on Databricks administration, deployment strategies, and architectural best practices. Connect with administrators and architects to optimize your Databricks environment for performance, scalability, and security.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

User16765131552
by Databricks Employee
  • 2801 Views
  • 1 replies
  • 0 kudos

Resolved! Databricks SQL dashboard refresh

In Databricks SQL, can you prohibit a dashboard from being refreshed?

  • 2801 Views
  • 1 replies
  • 0 kudos
Latest Reply
User16765131552
Databricks Employee
  • 0 kudos

It looks like this can be done by not granting CAN_RUN to a user/grouphttps://docs.databricks.com/sql/user/security/access-control/dashboard-acl.html#dashboard-permissions

  • 0 kudos
Anonymous
by Not applicable
  • 1481 Views
  • 1 replies
  • 0 kudos
  • 1481 Views
  • 1 replies
  • 0 kudos
Latest Reply
User16826994223
Databricks Employee
  • 0 kudos

One way to increase the CIDR range id the ip list is available or create a complete different work space on same vpc with different subnets

  • 0 kudos
User16826992666
by Databricks Employee
  • 1606 Views
  • 1 replies
  • 0 kudos

Okta Integration

My company uses Okta as a SSO provider. Can I integrate Okta with Databricks for a SSO experience?

  • 1606 Views
  • 1 replies
  • 0 kudos
Latest Reply
brickster_2018
Databricks Employee
  • 0 kudos

Yes, okta is among the supported identity providers. Read more here : https://docs.databricks.com/administration-guide/users-groups/single-sign-on/index.html

  • 0 kudos
MoJaMa
by Databricks Employee
  • 1412 Views
  • 1 replies
  • 0 kudos
  • 1412 Views
  • 1 replies
  • 0 kudos
Latest Reply
MoJaMa
Databricks Employee
  • 0 kudos

Yes. There is a property called dbus_per_hour that you can add to your cluster policy.See https://docs.databricks.com/administration-guide/clusters/policies.html#cluster-policy-virtual-attribute-pathsHere's an example policy that uses it:https://docs...

  • 0 kudos
brickster_2018
by Databricks Employee
  • 7832 Views
  • 1 replies
  • 0 kudos

Resolved! My cluster is running an init script, and I want to see what's going on.

After I log in to the workspace, where can I find the logs?

  • 7832 Views
  • 1 replies
  • 0 kudos
Latest Reply
brickster_2018
Databricks Employee
  • 0 kudos

See public docs: https://docs.databricks.com/clusters/init-scripts.html#cluster-scoped-init-script-logsDon't forget to enable cluster log delivery: https://docs.databricks.com/clusters/configure.html#cluster-log-deliveryNote that this only works for ...

  • 0 kudos
User16790091296
by Databricks Employee
  • 2611 Views
  • 1 replies
  • 0 kudos
  • 2611 Views
  • 1 replies
  • 0 kudos
Latest Reply
User16790091296
Databricks Employee
  • 0 kudos

At a high level a Lakehouse must contain the following properties:Open direct access data formats (Apache Parquet, Delta Lake etc.)First class support for machine learning and data science workloadsstate of the art performance Databricks is the firs...

  • 0 kudos
User16826987838
by Databricks Employee
  • 3414 Views
  • 1 replies
  • 0 kudos

Extending the duration of Ganglia metrics logs

 Any insights on how to analyse Ganglia Metrics logs for an extended duration of time, not just 15 minute snapshots? We need to visualize cluster CPU utilization for the duration of cluster uptime

  • 3414 Views
  • 1 replies
  • 0 kudos
Latest Reply
aladda
Databricks Employee
  • 0 kudos

One option here would be integration with Observability tools such as Datadog which can capture the cluster metrics on a more NRT basis. More details are here - https://docs.datadoghq.com/integrations/databricks/?tab=driveronly

  • 0 kudos
User16789201666
by Databricks Employee
  • 4906 Views
  • 0 replies
  • 0 kudos

What's Early Stopping in Hyperopt? When should it be used?

It’s advantageous to stop running trials if progress has stopped. Hyperopt offers an  early_stop_fn  parameter, which specifies a function that decides when to stop trials before  max_evals  has been reached. Hyperopt provides a function  no_progress...

  • 4906 Views
  • 0 replies
  • 0 kudos
brickster_2018
by Databricks Employee
  • 2406 Views
  • 1 replies
  • 0 kudos

Resolved! How to determine if am using the same DBR minor version?

DBR minor version details are not exposed. However, in the documentation, it mentioned Databricks performs maintenance releases every 2 weeks. How can I determine if I am using the same minor version

  • 2406 Views
  • 1 replies
  • 0 kudos
Latest Reply
brickster_2018
Databricks Employee
  • 0 kudos

The below code snippet can help to determine the DBR Hash string for the DBR version. DBR hash string is unique for the DBR minor version. val scalaVersion = scala.util.Properties.versionString   val hadoopVersion = org.apache.hadoop.util.VersionInf...

  • 0 kudos
Anonymous
by Not applicable
  • 2920 Views
  • 2 replies
  • 0 kudos
  • 2920 Views
  • 2 replies
  • 0 kudos
Latest Reply
sajith_appukutt
Databricks Employee
  • 0 kudos

Delete workspace doesn't delete the root bucket. You could choose to use the same root bucket for more than one workspace ( though not recommended ) It is recommended to automate the infrastructure creation via terraform or quickstart so that cleanup...

  • 0 kudos
1 More Replies
Anonymous
by Not applicable
  • 3448 Views
  • 1 replies
  • 0 kudos

Monitoring jobs

Are there any event streams that are or could be exposed in AWS (such as Cloudwatch Eventbridge events or SNS messages? In particular I'm interested in events that detail jobs being run. The use case here would be for monitoring jobs from our web app...

  • 3448 Views
  • 1 replies
  • 0 kudos
Latest Reply
sajith_appukutt
Databricks Employee
  • 0 kudos

You could write code to call the PutLogEvents api at the beginning of each job to write out custom events to cloudwatch / or use aws sdk to send and SNS notification and route it to a desired consumer.

  • 0 kudos