Data Engineering

Forum Posts

Sorted by:

by Yasser • New Contributor

08-11-2023 8:04:19 AM

449 Views
0 replies
0 kudos

[sql warehouse] Invalid configuration value detected for fs.azure.account.key with 'force' = 'true'

Hello,I am getting the following error when trying to copy data to databricks from an ADLS with SQL and using a SAS tokenFailure to initialize configuration for storage account <storage account>: Invalid configuration value detected for fs.azure.acco...

Data Engineering

449 Views
0 replies
0 kudos

08-11-2023 8:04:19 AM

by Chris_Shehu • Valued Contributor III

03-21-2022 9:59:31 PM

9307 Views
5 replies
5 kudos

Resolved! What is the best way to handle big data sets?

I'm trying to find the best strategy for handling big data sets. In this case I have something that is 450 million records. I'm pulling the data from SQL Server very quickly but when I try to push the data to the Delta Table OR a Azure Container the...

Data Engineering

9307 Views
5 replies
5 kudos

03-21-2022 9:59:31 PM

View Replies

Latest Reply

Wilynan
New Contributor II

08-11-2023 6:41:05 AM

5 kudos

I think you should consult experts in Big Data for advice on this issue

5 kudos

08-11-2023 6:41:05 AM

4 More Replies

by wschoi • New Contributor III

05-08-2023 1:42:24 PM

2325 Views
4 replies
3 kudos

How to fix plots and image color rendering on Notebooks?

I am currently running dark mode for my Databricks Notebooks, and am using the "new UI" released a few days ago (May 2023) and the "New notebook editor."Currently all plots (like matplotlib) are showing wrong colors. For example, denoting:```... p...

Data Engineering

2325 Views
4 replies
3 kudos

05-08-2023 1:42:24 PM

View Replies

Latest Reply

leonardoazzi
New Contributor II

08-11-2023 5:48:35 AM

3 kudos

Hello all,Thank you @wschoi for reporting this issue. I've lost a lot of time trying to figure out if my image plotting was wrong.

3 kudos

08-11-2023 5:48:35 AM

3 More Replies

by parthsalvi • Contributor

09-16-2022 5:26:08 AM

1482 Views
1 replies
2 kudos

Amazon SES : boto3 credentials not found. DBR 11.2 Shared mode

We're trying to send email using Amazon SES using boto3.client in python. We've added SES Full access in clusters IAM Role.We were able to send email in "No isolation shared" mode in DBR 11.2 using ses = boto3.client('ses', region_name='us-****-2') s...

Data Engineering

1482 Views
1 replies
2 kudos

09-16-2022 5:26:08 AM

View Replies

Latest Reply

JameDavi_51481
New Contributor III

08-11-2023 5:30:31 AM

2 kudos

This appears to be an intentional design choice to prevent users from using the credentials of the host machine to carry out arbitrary AWS API calls. I really wish there was a workaround or setting to disable this behavior because we put a lot of wor...

2 kudos

08-11-2023 5:30:31 AM

by DatabricksHero • New Contributor II

08-10-2023 5:19:47 AM

1062 Views
2 replies
0 kudos

Unity Catalog 2.1 API Not Returning SQL Function/View Dependencies

Hi all,I have a problem with reading responses generated by Unity Catalog API 2.1 as they are missing fields that are otherwise described in the specification:List functions - The fields routine_dependencies, return_params, and input_params are missi...

Data Engineering

API

sql

Unity Catalog

1062 Views
2 replies
0 kudos

08-10-2023 5:19:47 AM

View Replies

Latest Reply

Kaniz
Community Manager

08-11-2023 3:50:17 AM

0 kudos

Hi @DatabricksHero , view_dependencies View dependencies (when table_type == VIEW or MATERIALIZED_VIEW, STREAMING_TABLE) when DependencyList is None, the dependency is not provided;when DependencyList is an empty list, the dependency is provided b...

0 kudos

08-11-2023 3:50:17 AM

1 More Replies

by Henrik • New Contributor III

08-11-2023 3:10:34 AM

830 Views
1 replies
1 kudos

Resolved! Run notebooks on serverless SQL cluster

Is it just me or i'm I right that we can't run notebooks on a serverless SQL cluster?It would be a nice feature for SQL based notebooks.

Data Engineering

830 Views
1 replies
1 kudos

08-11-2023 3:10:34 AM

View Replies

Latest Reply

Henrik
New Contributor III

08-11-2023 3:12:47 AM

1 kudos

I figured out.I needed to start the cluster first.

1 kudos

08-11-2023 3:12:47 AM

by Sinthiya • New Contributor II

08-11-2023 1:07:40 AM

1310 Views
1 replies
1 kudos

Multiple streaming sources to the single delta live table

In our case, we have multiple sources writing to the same target table. A target table can be populated from multiple source tables, each contributing a set of fields. How to add/update columns in a target table from multiple sources.In a delta live...

Data Engineering

1310 Views
1 replies
1 kudos

08-11-2023 1:07:40 AM

View Replies

Latest Reply

SaiKiranGajjala
New Contributor II

08-11-2023 1:49:52 AM

1 kudos

Following.

1 kudos

08-11-2023 1:49:52 AM

by a_t_h_i • New Contributor

08-09-2023 9:28:09 AM

1605 Views
1 replies
1 kudos

Move managed DLT table from one schema to another schema in Databricks

I have a DLT table in schema A which is being loaded by DLT pipeline.I want to move the table from schema A to schema B, and repoint my existing DLT pipeline to table in schema B. also I need to avoid full reload in DLT pipeline on table in Schema B....

Data Engineering

delta-live-table

deltalivetable

deltatable

dlt

1605 Views
1 replies
1 kudos

08-09-2023 9:28:09 AM

View Replies

Latest Reply

Tharun-Kumar
Honored Contributor II

08-10-2023 10:05:30 PM

1 kudos

@a_t_h_i This feature is being actively worked upon by our Engineers. The plan is to change the schema name in the DLT pipeline settings and DLT will move the managed DLT table to the other schema.

1 kudos

08-10-2023 10:05:30 PM

by Mr_K • New Contributor

05-19-2023 7:41:15 AM

3584 Views
2 replies
2 kudos

AnalysisException: [UC_COMMAND_NOT_SUPPORTED] Spark higher-order functions are not supported in Unity Catalog.;

Hello,forecast_date = '2017-12-01' spark.conf.set('spark.sql.shuffle.partitions', 500 ) # generate forecast for this data forecasts = ( history .where(history.date < forecast_date) # limit training data to prior to our forecast date .groupBy...

Data Engineering

3584 Views
2 replies
2 kudos

05-19-2023 7:41:15 AM

View Replies

Latest Reply

Tharun-Kumar
Honored Contributor II

08-10-2023 9:54:22 PM

2 kudos

@Mr_K ApplyInPandas is a higher order function in Python. As of now, we do not support higher order functions in Unity Catalog. We do support direct calls made to python UDFs. Here is an example of how to reference UDFs in UC - https://docs.databrick...

2 kudos

08-10-2023 9:54:22 PM

1 More Replies

by schnee1 • New Contributor III

10-23-2015 6:07:48 AM

6094 Views
8 replies
0 kudos

Access struct elements inside dataframe?

I have JSON data set that contains a price in a string like "USD 5.00". I'd like to convert the numeric portion to a Double to use in an MLLIB LabeledPoint, and have managed to split the price string into an array of string. The below creates a data...

Data Engineering

6094 Views
8 replies
0 kudos

10-23-2015 6:07:48 AM

View Replies

Latest Reply

goldentriangle
New Contributor II

08-10-2023 8:26:34 PM

0 kudos

Thanks, Golden Triangle Tour

0 kudos

08-10-2023 8:26:34 PM

7 More Replies

by NCat • New Contributor III

08-04-2023 8:47:20 AM

3958 Views
5 replies
0 kudos

ipywidgets: Uncaught RefferenceError require is not defined

Hi,When I tried to use ipywidgets, it returns the following error.I’m using Databricks with PrivateLink enabled on AWS, and Runtime version is 12.2 LTS.Is there something that I need to use ipywidgets in my environment?

Data Engineering

3958 Views
5 replies
0 kudos

08-04-2023 8:47:20 AM

View Replies

Latest Reply

Kaniz
Community Manager

08-04-2023 9:09:02 AM

0 kudos

Hi @NCat, The error message "uncaught reference error: require is not defined" indicates that the require function is not defined in the current scope. This error can occur when using Databricks Connect with a version of Node.js that does not support...

0 kudos

08-04-2023 9:09:02 AM

4 More Replies

by MattM • New Contributor III

02-09-2022 1:11:01 AM

3915 Views
8 replies
2 kudos

Resolved! Access Databricks Delta table using SSRS without copying data to AzureSQL

We have our BI facts and dimensions built in as delta table in Datarbicks env and is being used for reporting by connecting PowerBI reports using datarbricks connection. We now have a need to use this data for another application utilizing SSRS repor...

Data Engineering

3915 Views
8 replies
2 kudos

02-09-2022 1:11:01 AM

View Replies

Latest Reply

Anonymous
Not applicable

03-19-2022 4:18:40 AM

2 kudos

https://buyusasmm.com/product/buy-google-5-star-reviews/

2 kudos

03-19-2022 4:18:40 AM

7 More Replies

by kll • New Contributor III

04-19-2023 4:48:03 PM

6888 Views
3 replies
0 kudos

python multiprocessing and the Databricks Architecture - under the hood.

I am curious what is going on under-the-hood when using `multiprocessing` module to parallelize an function call and apply it to a Pandas DataFrame along the row axis. Specifically, how does it work with DataBricks Architecture / Compute. My cluster ...

Data Engineering

6888 Views
3 replies
0 kudos

04-19-2023 4:48:03 PM

View Replies

Latest Reply

Anonymous
Not applicable

04-20-2023 7:23:45 PM

0 kudos

@Keval Shah :When using the multiprocessing module in Python to parallelize a function call and apply it to a Pandas DataFrame along the row axis, the following happens under the hood:The Pool object is created with the specified number of processes...

0 kudos

04-20-2023 7:23:45 PM

2 More Replies

by DineshKumar • New Contributor III

08-24-2020 9:52:19 AM

18621 Views
5 replies
2 kudos

Spark Read CSV doesn't preserve the double quotes while reading!

Hi , I am trying to read a csv file with one column has double quotes like below. James,Butt,"Benton, John B Jr",6649 N Blue Gum St Josephine,Darakjy,"Chanay, Jeffrey A Esq",4 B Blue Ridge Blvd Art,Venere,"Chemel, James L Cpa",8 W Cerritos Ave #54...

Data Engineering

18621 Views
5 replies
2 kudos

08-24-2020 9:52:19 AM

View Replies

Latest Reply

LearningAj
New Contributor II

08-10-2023 12:08:21 PM

2 kudos

Hi Team,I am also facing same issue and i have applied all the option mentioned from above posts:I will just post my dataset here:Attached is the my input data with 3 different column out of which comment column contains text value with double quotes...

2 kudos

08-10-2023 12:08:21 PM

4 More Replies

by erigaud • Honored Contributor

08-10-2023 1:41:51 AM

2922 Views
1 replies
2 kudos

Get total number of files of a Delta table

I'm looking to know programatically how many files a delta table is made of.I know I can do %sqlDESCRIBE DETAIL my_tableBut that would only give me the number of files of the current version. I am looking to know the total number of files (basically ...

Data Engineering

2922 Views
1 replies
2 kudos

08-10-2023 1:41:51 AM

View Replies

Latest Reply

erigaud
Honored Contributor

08-10-2023 8:55:57 AM

2 kudos

Yes I think that solution is good, thank you !

2 kudos

08-10-2023 8:55:57 AM

User

Count

1603

736

344

284

247

Databricks

Forum Posts

[sql warehouse] Invalid configuration value detected for fs.azure.account.key with 'force' = 'true'

Resolved! What is the best way to handle big data sets?

How to fix plots and image color rendering on Notebooks?

Amazon SES : boto3 credentials not found. DBR 11.2 Shared mode

Unity Catalog 2.1 API Not Returning SQL Function/View Dependencies

Resolved! Run notebooks on serverless SQL cluster

Multiple streaming sources to the single delta live table

Move managed DLT table from one schema to another schema in Databricks

AnalysisException: [UC_COMMAND_NOT_SUPPORTED] Spark higher-order functions are not supported in Unity Catalog.;

Access struct elements inside dataframe?

ipywidgets: Uncaught RefferenceError require is not defined

Resolved! Access Databricks Delta table using SSRS without copying data to AzureSQL

python multiprocessing and the Databricks Architecture - under the hood.

Spark Read CSV doesn't preserve the double quotes while reading!

Get total number of files of a Delta table

Load multiple delta tables at once from Sql server

Starting Serverless sql cluster on GCP

"Can't login to databricks socket is closed" when ...

Temporary views no longer working for Share Comput...

Does DLT use one single SparkSession?