cancel
Showing results for 
Search instead for 
Did you mean: 
Administration & Architecture
Explore discussions on Databricks administration, deployment strategies, and architectural best practices. Connect with administrators and architects to optimize your Databricks environment for performance, scalability, and security.
cancel
Showing results for 
Search instead for 
Did you mean: 
Data + AI Summit 2024 - Data Lakehouse Architecture


Forum Posts

alexlod
by New Contributor III
  • 4225 Views
  • 2 replies
  • 1 kudos

How to monitor a python wheel job with Prometheus?

Hi Community,We have a Databricks job with a single Python wheel task that runs our streaming pyspark job. The job runs on a single-node compute cluster and consumes from Kafka.Our monitoring stack is Prometheus + Grafana.I want the job's metrics to ...

  • 4225 Views
  • 2 replies
  • 1 kudos
Latest Reply
amelia1
New Contributor II
  • 1 kudos

Hi I'm trying to use the metrics registry object inside an UDF function, but I can't because it's not serializable due to Lock. Our goal is to be able to count the number of messages parsed, and the number of messages we can't parsed (due to exceptio...

  • 1 kudos
1 More Replies
leelee3000
by Databricks Employee
  • 1024 Views
  • 0 replies
  • 0 kudos

Handling Kafka Topics with Avro Schema

Our input data resides in a Kafka topic, and we utilize the Kafka schema registry with Avro schemas. While I can retrieve the schema from the registry, I am facing challenges creating a Spark DataFrame that correctly serializes data for streaming rea...

  • 1024 Views
  • 0 replies
  • 0 kudos
leelee3000
by Databricks Employee
  • 671 Views
  • 0 replies
  • 0 kudos

Scalable API/binary lookups

We sometimes process large dataframes that contain a column of IP addresses and we need to associate an Autonomous System Number (ASN) per IP address. The ASN information is provided by MaxMind in the form of a binary data file only accessible via a ...

  • 671 Views
  • 0 replies
  • 0 kudos
leelee3000
by Databricks Employee
  • 1122 Views
  • 0 replies
  • 0 kudos

auto-deleted workspace on GCP

Our production Databricks workspaces were auto-deleted when our subscription from our GCP account was canceled due to system error.  We have a backup of the GCS buckets that Databricks was running on (not workspace exports).  Is it possible to recove...

  • 1122 Views
  • 0 replies
  • 0 kudos
552532
by New Contributor II
  • 1494 Views
  • 1 replies
  • 0 kudos

Databricks Private link connectivity for External SaaS Application

We need your guidance on completing the set-up around private link set-up with a Customer who is in the same region in AWS where our application is hosted in AWS. Our Customer has already enabled Private Link in their account and they are using custo...

  • 1494 Views
  • 1 replies
  • 0 kudos
Latest Reply
552532
New Contributor II
  • 0 kudos

Hi Databricks Support,We followed the instructions above, but we are getting an error when registering the endpoint in customer databricks environment using "Registering Endpoint".Following is the error message we see:"INVALID_PARAMETER_VALUE" Endpoi...

  • 0 kudos
alexometis
by New Contributor III
  • 7064 Views
  • 2 replies
  • 3 kudos

System Tables Preview - retention period?

The new System Tables for billing, pricing & compute look really useful and easier to consume than getting it via the APIs.However I can't see in the documentation:Does data only start being gathered when you turn them on or is there immediately a hi...

  • 7064 Views
  • 2 replies
  • 3 kudos
Latest Reply
Avvar2022
Contributor
  • 3 kudos

@Retired_mod  -We are customer of databricks. Have databricks premium workspace with unity catalog enabled. and we have also legacy workspaces (non-unity enabled).I can see history is available for all workspaces (unity and non-unity) in same meta st...

  • 3 kudos
1 More Replies
smart5mk
by New Contributor III
  • 1940 Views
  • 0 replies
  • 0 kudos

Destination Path of Cloned Notebooks

Hi, for my project I need to get destination paths of cloned notebooks. But when I run the query to get them: ''SELECT DISTINCT request_params.destinationPathFROM system.access.auditWHERE service_name = "notebook"andaction_name = 'cloneNotebook'LIMIT...

  • 1940 Views
  • 0 replies
  • 0 kudos
JDL
by New Contributor III
  • 12169 Views
  • 4 replies
  • 2 kudos

Get number of rows in delta lake table from metadata without count(*)

Hello folks,Is there a way with sql query to get count from delta table metadata without doing count(*) on each of table? Wondering, if this information is stored in any of INFORMATION_SCHEMA tables.I have a use-case to get counts from 1000's of delt...

  • 12169 Views
  • 4 replies
  • 2 kudos
Latest Reply
SSundaram
Contributor
  • 2 kudos

 Here is a related one.https://community.databricks.com/t5/data-engineering/how-to-get-the-total-number-of-records-in-a-delta-table-from-the/td-p/20441

  • 2 kudos
3 More Replies
karthik_p
by Esteemed Contributor
  • 1526 Views
  • 3 replies
  • 0 kudos

Disaster Recovery Issue

We are trying to create Disaster Recovery for UC enabled Workspaces in Azure. our UC metastore are in different regions.1. we are trying to use Deep Clone2. In source we are adding region2 metastore as external location3. able to do deep cloneproblem...

  • 1526 Views
  • 3 replies
  • 0 kudos
Latest Reply
-werners-
Esteemed Contributor III
  • 0 kudos

Right I get it.Actually cloning it as external seems logical to me, for the moment, as unity cannot manage the other metastore.For the moment I would go with cloning the data and then creating an external table of that.Not ideal, but at least you hav...

  • 0 kudos
2 More Replies
Kaviana
by New Contributor III
  • 1570 Views
  • 1 replies
  • 0 kudos

error when trying to create a cluster in databricks

We are trying to create the cluster within the databricks workspace but it is generating the error attach

  • 1570 Views
  • 1 replies
  • 0 kudos
Latest Reply
" src="" />
This widget could not be displayed.
This widget could not be displayed.
This widget could not be displayed.
  • 0 kudos

This widget could not be displayed.
We are trying to create the cluster within the databricks workspace but it is generating the error attach

This widget could not be displayed.
  • 0 kudos
This widget could not be displayed.
JDL
by New Contributor III
  • 2828 Views
  • 1 replies
  • 1 kudos

Tie Parquet files in Azure ADLS to Databricks table

Hello All,I have databricks delta table with files residing in Azure Data Lake. I understand, when I load create table and load data from databricks, it creates respective folder and files for table in ADLS. I am wondering if there is reverse way to ...

  • 2828 Views
  • 1 replies
  • 1 kudos
Latest Reply
JDL
New Contributor III
  • 1 kudos

Thanks @Retired_mod this is helpful

  • 1 kudos
RonMoody
by New Contributor II
  • 6957 Views
  • 1 replies
  • 0 kudos

service principal table accesses not showing up in system.audit

When we run jobs using service principals system.audit doesn't show any table accesses (getTable). Volume (getVolume) shows up for service principals. Same query when run as a user shows up in system.audit. I know system.audit is in public preview. W...

  • 6957 Views
  • 1 replies
  • 0 kudos
Latest Reply
RonMoody
New Contributor II
  • 0 kudos

hi @Retired_mod thanks so much for your reply ! I was referring to https://docs.databricks.com/en/administration-guide/system-tables/audit-logs.html which is part of databricks core offering and isn't related to ServiceNow's offering. I am assuming t...

  • 0 kudos
cltj
by New Contributor III
  • 1775 Views
  • 1 replies
  • 1 kudos

Three level name space naming standard

Hi all, I have not been successful in getting a good grip of the naming conventions for the three level name space. Initially i learned about bronze, silver and gold, but i am confused where to put this. The obvious choice may be to use the {catalog}...

  • 1775 Views
  • 1 replies
  • 1 kudos
Latest Reply
" src="" />
This widget could not be displayed.
This widget could not be displayed.
This widget could not be displayed.
  • 1 kudos

This widget could not be displayed.
Hi all, I have not been successful in getting a good grip of the naming conventions for the three level name space. Initially i learned about bronze, silver and gold, but i am confused where to put this. The obvious choice may be to use the {catalog}...

This widget could not be displayed.
  • 1 kudos
This widget could not be displayed.
aayusha3
by New Contributor II
  • 2703 Views
  • 4 replies
  • 2 kudos

Internal error: Attach your notebook to a different compute or restart the current compute.

I am currently using a personal computer cluster [13.3 LTS (includes Apache Spark 3.4.1, Scala 2.12)] on GCP attached to a notebook. After running a few command lines without an issue, I end up getting this error  Internal error. Attach your notebook...

aayusha3_0-1700411861325.png
  • 2703 Views
  • 4 replies
  • 2 kudos
Latest Reply
amandaolens
New Contributor III
  • 2 kudos

@Martin74  same here martin.

  • 2 kudos
3 More Replies

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group
Labels