cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

ChinmayU
by New Contributor
  • 2217 Views
  • 0 replies
  • 0 kudos

java.time.LocalDate exception when a date column is used with "IN" operator in replace where clause

Hi, we recently made an upgrade to our Databricks warehouse, transitioning from SQL Classic to SQL PRO. However, we encountered the following error message when attempting to execute the "INSERT INTO" table query with a "REPLACE WHERE" predicate that...

Data Engineering
Databricks
LocalDate exception
replace where
Unity Catalog
  • 2217 Views
  • 0 replies
  • 0 kudos
FabriceDeseyn
by Contributor
  • 1815 Views
  • 1 replies
  • 0 kudos

Databricks-connect VSCode debugging pandas_api not working

HiI am using the databricks extension on VSCode and am running against an issue since two days, prior it worked fine. I receive an error when I want to use Pandas-on-Spark during debugging.from databricks.connect import DatabricksSession spark = Data...

FabriceDeseyn_0-1689838667900.png FabriceDeseyn_1-1689838868477.png
Data Engineering
databricks VSCode extension
databricks-connect
  • 1815 Views
  • 1 replies
  • 0 kudos
Latest Reply
FabriceDeseyn
Contributor
  • 0 kudos

Additional info:It seems that the issue comes from the 1.1.0 version of the databricks extension in VSCode.Downgrading to 1.0.0 solves my issue.

  • 0 kudos
PhillT
by New Contributor
  • 4593 Views
  • 1 replies
  • 2 kudos

SQL expr undefined function 'LEN'

Getting this error message on our production cluster when I run a notebook that uses the SQL expr function that call the LEN() funciton example code:df = df.withColumn("POL", expr("CASE WHEN SRC_SYSTEM = 'X' THEN CONCAT('08' , SUBSTRING(POL, 3, LEN(P...

  • 4593 Views
  • 1 replies
  • 2 kudos
Latest Reply
daniel_sahal
Databricks MVP
  • 2 kudos

@PhillT There's no "LEN" function. You should use "LENGTH" instead.https://spark.apache.org/docs/2.3.0/api/sql/index.html#length

  • 2 kudos
numersoz
by New Contributor III
  • 5802 Views
  • 3 replies
  • 5 kudos

Resolved! Z-Ordering Timestamp Column

Hi,I've large Delta Table for IoT data for over 10K different sensors with timestamp, sensor name and value columns at 1 second precision.Query pattern is usually random 5-100 sensors at a time. But typically involves specific year/month/day interval...

  • 5802 Views
  • 3 replies
  • 5 kudos
Latest Reply
Oliver_Angelil
Valued Contributor II
  • 5 kudos

@numersoz did you z-order on the timestamp column or on less granular columns, like Year, Month, or Day. timestamp column is very granular (high cardinality) since it also includes hour, minute, second...

  • 5 kudos
2 More Replies
DarthObert
by New Contributor II
  • 1303 Views
  • 1 replies
  • 0 kudos

Databricks intellisence adds a second aliad

Hi all, A couple of weeks ago I noticed that whenever I use the intellisense to autocomplete my column names in the sql editor, it adds a second alias.  For example if i have a table (Table1) and I alias it in my query (i.e. Table1 as a) if I use the...

  • 1303 Views
  • 1 replies
  • 0 kudos
Latest Reply
DarthObert
New Contributor II
  • 0 kudos

And how do I fix it? 

  • 0 kudos
agar08
by New Contributor
  • 557 Views
  • 0 replies
  • 0 kudos

ava.net.SocketTimeoutException - ReadTimeOut

Databricks notebook is connecting ADLS Gen2 using Service principal authentication and the setup is working fine. The notebook is able to read/write files to ADLS Gen2. However, occasionally, we are seeing below error in the production environment:ja...

  • 557 Views
  • 0 replies
  • 0 kudos
jdhao
by New Contributor II
  • 4602 Views
  • 4 replies
  • 0 kudos

Why can't I query a table from a cluster, but can query from another cluster in the same workspace

I have two clusters A, B under the same azure databricks workspace. Under cluster A, inside my notebook, I tried to query a table: `SELECT * FROM some_table LIMIT 5`.  It shows some permission errors. Under cluster B, if I run the same sql query, it ...

  • 4602 Views
  • 4 replies
  • 0 kudos
Latest Reply
Lakshay
Databricks Employee
  • 0 kudos

Check for any spark config or init script differences in the two clusters.

  • 0 kudos
3 More Replies
gopikrsna925
by New Contributor
  • 2385 Views
  • 0 replies
  • 0 kudos

Azure Databricks: leading zeros in decimal integer literals are not permitted

Hey team,Need your help.I am trying to run the below python code in a data bricks notebook, which is part of parsing an XML file, exploding the element. This works great for all the other elements with no numbers and elements not starting with a zero...

Data Engineering
leading
zeros
  • 2385 Views
  • 0 replies
  • 0 kudos
Phani1
by Databricks MVP
  • 1476 Views
  • 1 replies
  • 0 kudos

Streaming tables vs DLT

Have a couple of questions wrt Streaming tables, Kindly help us with this.1)Can we create streaming tables without a DLT pipeline?2)Can we create streaming tables in Databricks SQL?3)What we observe in Streaming tables, it  support Kafka and event lo...

Data Engineering
dlt
sql
Streaming tables
  • 1476 Views
  • 1 replies
  • 0 kudos
Latest Reply
Lakshay
Databricks Employee
  • 0 kudos

HiPlease refer to the document: https://docs.databricks.com/sql/language-manual/sql-ref-syntax-ddl-create-streaming-table.html#create-streaming-tableI think this should help you answer your questions.

  • 0 kudos
thomasthomas
by New Contributor II
  • 11980 Views
  • 4 replies
  • 0 kudos

How can I insert into 2 tables within one database transaction with spark SQL / pyspark?

Hi all,I have a postgres database that contains two tables: A and B.Also, I have 2 delta tables, called C and D. My task is to push the data from A to C and B to D - and if something fails, then leave everything as is.With python it is easy. Set up t...

Data Engineering
Spark JDBC Commit
  • 11980 Views
  • 4 replies
  • 0 kudos
Latest Reply
Anonymous
Not applicable
  • 0 kudos

Hi @thomasthomas  We haven't heard from you since the last response from @daniel_sahal â€‹, and I was checking back to see if her suggestions helped you. Or else, If you have any solution, please share it with the community, as it can be helpful to oth...

  • 0 kudos
3 More Replies
handreassa
by New Contributor III
  • 4098 Views
  • 3 replies
  • 1 kudos

Resolved! How to manage cluster permissions

How to manage different workspaces’ clusters permissions?

  • 4098 Views
  • 3 replies
  • 1 kudos
Latest Reply
KoenZandvliet
New Contributor III
  • 1 kudos

Which {request_object_type} to use for setting permissions for a cluster? "cluster", "clusters", "compute" does not work.

  • 1 kudos
2 More Replies
malati
by New Contributor II
  • 1809 Views
  • 2 replies
  • 0 kudos

Unable to login or reset password Community Edition

I'm not able to login databricks community edition from last 5 days, getting  Invalid email address or password issue even though its correct credentials. i tried to reset password but unable to reset password also. what is the issue. can you guys pl...

  • 1809 Views
  • 2 replies
  • 0 kudos
Latest Reply
malati
New Contributor II
  • 0 kudos

im getting mails saying that if issue is fixed then mark one as a accepted solution but unfortunately issue is still not yet fixed , can someone look into it and fix issue please ??  

  • 0 kudos
1 More Replies
bstanie
by New Contributor
  • 1826 Views
  • 1 replies
  • 0 kudos

A problem with formating python code to pep8

Hi there,Anyone here experienced any problems with formatiing python code inside databricks notebook cells? I'm constantly having this weird problem, both with very short/simple and complex code, in different notebooks. Sometimes it somehow works, an...

  • 1826 Views
  • 1 replies
  • 0 kudos
Latest Reply
-werners-
Esteemed Contributor III
  • 0 kudos

that is interesting.Can you check if you you use a version > 11.2.  If not you have to install black and tokenize-rt.Can you check if you use a custom pyproject.toml file?  It might be related to that.(You also need edit permission on the notebook bu...

  • 0 kudos
dzm
by New Contributor
  • 2337 Views
  • 1 replies
  • 0 kudos

Using Libreoffice in Databricks

Hi Community, I'm using Databricks E2, and need to convert pptx files to pdf files.This can be done in either a python or an R notebook using #LibreofficeTo achieve this I'd have to download LibreOffice; I'm not too sure on how to do that. Would I ha...

Data Engineering
pdf
pptx
python
R
  • 2337 Views
  • 1 replies
  • 0 kudos
Latest Reply
-werners-
Esteemed Contributor III
  • 0 kudos

I suppose by Libreoffice you mean the sdk, without the frontend?You will have to install the jar as a library on the compute cluster.From that moment on, you can use the classes in your code.If you cannot run the jar from a command line, it might be ...

  • 0 kudos

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now
Labels