cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 
Data + AI Summit 2024 - Data Engineering & Streaming

Forum Posts

parimalpatil28
by New Contributor III
  • 997 Views
  • 0 replies
  • 0 kudos

Looking for Upload file to dbfs using "/api/2.0/dbfs/put"

Hello,I am trying to upload the file from local linux machine to dbfs using request.post(<URI>,<Headers>, params={"path": "dbfs:/tmp", "contents": local_path}) and getting the error b'{"error_code":"INVALID_PARAMETER_VALUE","message":"You must provid...

  • 997 Views
  • 0 replies
  • 0 kudos
804082
by New Contributor III
  • 1869 Views
  • 2 replies
  • 1 kudos

Backup/Export Databricks SQL Column Comments

We've had users make comments on tables/columns throughout Databricks SQL using the Data Explorer UI. I'm looking for a way to backup these comments, but when I run DESCRIBE TABLE, the comment column is always null despite being non-null in Data Expl...

  • 1869 Views
  • 2 replies
  • 1 kudos
Latest Reply
shan_chandra
Databricks Employee
  • 1 kudos

@804082  - Markdown does not render when returned by DESCRIBE statements. we can view them on the Data Explorer UI.Reference:   https://docs.databricks.com/en/data/markdown-data-comments.html#document-data-with-markdown-comments

  • 1 kudos
1 More Replies
venkat94
by New Contributor
  • 884 Views
  • 1 replies
  • 0 kudos

Databricks Job RUns API

/api/2.1/jobs/runs/list Currently returns all jobs which got executed within specified time lines which we provide as input. Is There any way where we can get only specific jobs as per their status(only success)?

Data Engineering
API
azure
Databricks
jobruns
  • 884 Views
  • 1 replies
  • 0 kudos
Latest Reply
BilalAslamDbrx
Databricks Employee
  • 0 kudos

@venkat94 thanks for the feedback. We are working on updating the Jobs Runs API so you can filter runs by status e.g. only success. Stay tuned in the next couple of months.

  • 0 kudos
dbdude
by New Contributor II
  • 1539 Views
  • 1 replies
  • 0 kudos

Re-running DLT Pipeline Does Not Add Data After Delete

I am using DLT and unity catalog and using managed tables. The first table in this pipeline is a live streaming table. I first did this in the SQL editor:DELETE FROM my_table;This appears to have deleted all the records, which I wanted since now when...

  • 1539 Views
  • 1 replies
  • 0 kudos
Latest Reply
BilalAslamDbrx
Databricks Employee
  • 0 kudos

@Mo is correct! 

  • 0 kudos
Julie285720
by New Contributor
  • 1533 Views
  • 0 replies
  • 0 kudos

SQL Merge condition

Hi guys  I have a question regarding this merge step and I am a new beginner for Databricks, trying to do some study in data warehousing, but couldn't figure it out by myself. need your help with it. Appreciate your help in advance. I got this questi...

Julie285720_0-1693869289599.png
  • 1533 Views
  • 0 replies
  • 0 kudos
XavierPereVives
by New Contributor II
  • 2065 Views
  • 1 replies
  • 0 kudos

Azure Shared Clusters - P4J Security Exception on non-whitelisted classes

When I try to use a third party JAR on an Azure shared cluster - which is installed via Maven and I can successfully import - , I get the following message:  py4j.security.Py4JSecurityException: Method public static org.apache.spark.sql.Column com.da...

  • 2065 Views
  • 1 replies
  • 0 kudos
Latest Reply
XavierPereVives
New Contributor II
  • 0 kudos

Thanks Kaniz.I must use a shared cluster because I'm reading from a DLT table stored in a Unity Catalog.https://docs.databricks.com/en/data-governance/unity-catalog/compute.htmlMy understanding is that shared clusters are enforcing the Py4J policy I ...

  • 0 kudos
alemo
by New Contributor III
  • 1882 Views
  • 3 replies
  • 1 kudos

Delta live table UC Kinesis: options overwriteschema, ignorechanges not supported for data sourc

I try to build a DLT in UC with Kinesis as producer.My first table looks like:  @dlt.create_table( table_properties={ "pipelines.autoOptimize.managed": "true" }, spark_conf={"spark.databricks.delta.schema.autoMerge.enabled": "true"},)def feed_chu...

  • 1882 Views
  • 3 replies
  • 1 kudos
Latest Reply
Corbin
Databricks Employee
  • 1 kudos

If you use the "Preview" Channel in the "Advanced" section of the DLT Pipeline, this error should resolve itself. This fix is planned to make it into the "Current" channel by Aug 31, 2023

  • 1 kudos
2 More Replies
vroste
by New Contributor III
  • 2060 Views
  • 0 replies
  • 0 kudos

Delta Live Tables maintenance schedule

I have a DLT that runs every day and an automatically executed maintenance job that runs within 24 hours every day. The maintenance operations are costly, is it possible to change the schedule to once a week or so?

  • 2060 Views
  • 0 replies
  • 0 kudos
scvbelle
by New Contributor III
  • 3684 Views
  • 3 replies
  • 3 kudos

Resolved! DLT failure: ABFS does not allow files or directories to end with a dot

In my DLT pipeline outlined below which generically cleans identifier tables, after successfully creating initial streaming tables from the append-only sources, fails when trying to create the second cleaned tables witht the following:It'**bleep** cl...

Data Engineering
abfss
azure
dlt
engineering
  • 3684 Views
  • 3 replies
  • 3 kudos
Latest Reply
Priyanka_Biswas
Databricks Employee
  • 3 kudos

Hi @scvbelle The error message you're seeing is caused by an IllegalArgumentException error due to the restriction in Azure Blob File System (ABFS) that does not allow files or directories to end with a dot. This error is thrown by the trailingPeriod...

  • 3 kudos
2 More Replies
kinsun
by New Contributor II
  • 19808 Views
  • 5 replies
  • 1 kudos

Resolved! DBFS and Local File System Doubts

Dear Databricks Expert,I got some doubts when dealing with DBFS and Local File System.Case01: Copy a file from ADLS to DBFS. I am able to do so through the below python codes:#spark.conf.set("fs.azure.account.auth.type", "OAuth") spark.conf.set("fs.a...

  • 19808 Views
  • 5 replies
  • 1 kudos
Latest Reply
Anonymous
Not applicable
  • 1 kudos

Hi @KS LAU​ Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers your q...

  • 1 kudos
4 More Replies
Madison
by New Contributor II
  • 10903 Views
  • 1 replies
  • 0 kudos

AnalysisException: [ErrorClass=INVALID_PARAMETER_VALUE] Missing cloud file system scheme

I am trying to follow along Apache Spark Programming training module where the instructor creates events table from a parquet file like this:%sql CREATE TABLE IF NOT EXISTS events USING parquet OPTIONS (path "/mnt/training/ecommerce/events/events.par...

Data Engineering
Databricks SQL
  • 10903 Views
  • 1 replies
  • 0 kudos
Latest Reply
Madison
New Contributor II
  • 0 kudos

@Retired_mod Thanks for your response. I didn't provide cloud file system scheme in the path while creating the table using DataFrame API, but I was still able to create the table.  %python # File location and type file_location = "/mnt/training/ecom...

  • 0 kudos
meystingray
by New Contributor II
  • 3650 Views
  • 0 replies
  • 0 kudos

Azure Databricks: Cannot create volumes or tables

If I try to create a Volume, I get this error:Failed to access cloud storage: AbfsRestOperationException exceptionTraceId=fa207c57-db1a-406e-926f-4a7ff0e4afddWhen i try to create a table, I get this error:Error creating table[RequestId=4b8fedcf-24b3-...

  • 3650 Views
  • 0 replies
  • 0 kudos
Nino
by Contributor
  • 8023 Views
  • 8 replies
  • 5 kudos

Resolved! Where in Hive Metastore can the s3 locations of Databricks tables be found?

I have a few Databricks clusters, some share a single Hive Metastore (HMS), call them PROD_CLUSTERS, and an additional cluster, ADHOC_CLUSTER, which has its own HMS. All my data is stored in S3, as Databricks delta tables: PROD_CLUSTERS have read-wri...

Data Engineering
HMS
metastore
  • 8023 Views
  • 8 replies
  • 5 kudos
Latest Reply
Nino
Contributor
  • 5 kudos

Something went wrong there, here's the last sentence: I expected "location" will be the s3 path but it's not always so (elaborated in the original posting). Thanks! 

  • 5 kudos
7 More Replies

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group
Labels