Community Platform Discussions

by jenshumrich • Contributor

04-10-2024 3:00:23 AM

1834 Views
2 replies
0 kudos

Long running jobs get lost

Hello,I tried to schedule a long running job and surprisingly it does seem to neither terminate (and thus does not let the cluster shut down), nor continue running, even though the state is still "Running":But the truth is that the job has miserably ...

Community Platform Discussions

Reply

1834 Views
2 replies
0 kudos

04-10-2024 3:00:23 AM

View Replies

Latest Reply

Lakshay
Esteemed Contributor

04-19-2024 4:16:46 AM

0 kudos

Have you looked at the sql plan to see what the spark job 72 was doing?

0 kudos

04-19-2024 4:16:46 AM

1 More Replies

by chari • Contributor

04-19-2024 12:00:51 AM

1309 Views
3 replies
0 kudos

Reading csv file with spark throws [insufficient privelage] error

Hello Community,I have some csv files saved in databricks workspace and want to read them with spark. I make use of the commanddf = spark.read.format('csv').load(r'filepath') However, it throws the error.org.apache.spark.SparkSecurityException: [INSU...

Community Platform Discussions

Reply

1309 Views
3 replies
0 kudos

04-19-2024 12:00:51 AM

View Replies

Latest Reply

Lakshay
Esteemed Contributor

04-19-2024 4:09:41 AM

0 kudos

If this a UC enabled workspace, you need to provide the right access.

0 kudos

04-19-2024 4:09:41 AM

2 More Replies

by Ajay-Pandey • Esteemed Contributor III

02-13-2024 9:01:14 PM

2553 Views
3 replies
2 kudos

Resolved! Update regarding Community Reward Store

Hi Team,Is there any update on the Community Reward Store, as it's been discontinued from the old portal, and we still can't see the new portal for that.Is there any expected date when this will be available for community members?

Community Platform Discussions

Reply

2553 Views
3 replies
2 kudos

02-13-2024 9:01:14 PM

View Replies

Latest Reply

Ajay-Pandey
Esteemed Contributor III

02-16-2024 8:28:37 PM

2 kudos

Thanks for update.

2 kudos

02-16-2024 8:28:37 PM

2 More Replies

by anonymous_567 • New Contributor II

04-17-2024 7:15:03 AM

1270 Views
3 replies
0 kudos

Autoloader update table when new changes are made

Hello,Everyday a new file of the same name gets sent to my storage account with old and new data appended at the end. Columns may also be added during one of these file updates. This file does a complete overwrite of the previous file. Is it possibl...

Community Platform Discussions

Reply

1270 Views
3 replies
0 kudos

04-17-2024 7:15:03 AM

View Replies

Latest Reply

data-grassroots
New Contributor III

04-17-2024 12:31:01 PM

0 kudos

This may be helpful - the bit on allow overwritehttps://docs.databricks.com/en/ingestion/auto-loader/faq.html

0 kudos

04-17-2024 12:31:01 PM

2 More Replies

by Alexandru • New Contributor III

04-12-2024 4:07:52 AM

2353 Views
3 replies
0 kudos

Resolved! vscode python project for development

Hi,I'm trying to set up a local development environment using python / vscode / poetry. Also, linting is enabled (Microsoft pylance extension) and the python.analysis.typeCheckingMode is set to strict.We are using python files for our code (.py) whit...

Community Platform Discussions

Reply

2353 Views
3 replies
0 kudos

04-12-2024 4:07:52 AM

View Replies

Latest Reply

artsheiko
Honored Contributor

04-16-2024 5:37:50 AM

0 kudos

Hi Alexandru, Take a look at VSCode extension for Databricks : https://marketplace.visualstudio.com/items?itemName=databricks.databricks

0 kudos

04-16-2024 5:37:50 AM

2 More Replies

by Hogan • New Contributor II

04-15-2024 2:23:32 PM

977 Views
1 replies
0 kudos

Can browse external Storage, but can not create a Table from there - VNET, ADLSGen2

Hi there!Hope somebody here can help me. We have created a new Databricks Account on Azure with the ARM template for VNET injection.We have all the subnets etc., unitiy catalog active and the connector for databricks.I want now to create my first tab...

Community Platform Discussions

Reply

977 Views
1 replies
0 kudos

04-15-2024 2:23:32 PM

View Replies

Latest Reply

Hogan
New Contributor II

04-16-2024 1:06:57 PM

0 kudos

Hi,To solve this problem, the following Microsoft documentation can be used to configure the NCC to enable the connection between the private Azure storage and the serverless resources.https://learn.microsoft.com/en-us/azure/databricks/security/netwo...

0 kudos

04-16-2024 1:06:57 PM

by sai_sathya • New Contributor III

04-11-2024 9:06:25 AM

2170 Views
6 replies
1 kudos

DataFrame to CSV write has issues due to multiple commas inside an row value

Hi alliam working on a data containing JSON fields with embedded commas into CSV format. iam facing challenges due to the commas within the JSON being misinterpreted as column delimiters during the conversion process.i tried several methods to modify...

Community Platform Discussions

Reply

2170 Views
6 replies
1 kudos

04-11-2024 9:06:25 AM

View Replies

Latest Reply

artsheiko
Honored Contributor

04-16-2024 4:55:31 AM

1 kudos

Hi Sai, I assume that the problem comes not from the PySpark, but from Excel. I tried to reproduce the error and didn't find the way - that a good thing, right ? Please try the following : df.write.format("csv").save("/Volumes/<my_catalog_name>/<m...

1 kudos

04-16-2024 4:55:31 AM

5 More Replies

by Nithya_r • New Contributor II

04-11-2024 10:40:38 AM

863 Views
1 replies
0 kudos

Access Delta sharing from Azure Data Factory

I recently got access to delta sharing and I am looking to access the data from the tables in share through ADF. I used linked services such as REST API and HTTP and successfully established connection using the credential file token and http path, h...

Community Platform Discussions

Reply

863 Views
1 replies
0 kudos

04-11-2024 10:40:38 AM

View Replies

Latest Reply

artsheiko
Honored Contributor

04-16-2024 10:30:23 AM

0 kudos

Hey, I think you'll need to use a Databricks activity instead of Copy See : https://learn.microsoft.com/en-us/azure/data-factory/connector-overview#integrate-with-more-data-storeshttps://learn.microsoft.com/en-us/azure/data-factory/transform-data-dat...

0 kudos

04-16-2024 10:30:23 AM

by databird • New Contributor II

04-06-2024 1:01:57 PM

1783 Views
4 replies
1 kudos

Redefine ETL strategy with pypskar approach

Hey everyone!I've some previous experience with Data Engineering, but totally new in Databricks and Delta Tables.Starting this thread hoping to ask some questions and asking for help on how to design a process.So I have essentially 2 delta tables (sa...

Community Platform Discussions

Reply

1783 Views
4 replies
1 kudos

04-06-2024 1:01:57 PM

View Replies

Latest Reply

artsheiko
Honored Contributor

04-16-2024 9:21:37 AM

1 kudos

Hi @databird , You can review the code of each demo by opening the content via "View the Notebooks" or by exploring the following repo : https://github.com/databricks-demos (you can try to search for "merge" to see all the occurrences, for example) T...

1 kudos

04-16-2024 9:21:37 AM

3 More Replies

by vinay076 • New Contributor III

04-10-2024 2:59:12 AM

1229 Views
2 replies
0 kudos

There is no certification number in my Databricks certificate that i had received after passing the

I enrolled myself for the Databricks data engineer certification recently and gave a shot at the exam and i did clear it successfully. I have received the certificate in the form of a pdf file along with a URL in which i can see my certificate and ba...

Community Platform Discussions

Reply

1229 Views
2 replies
0 kudos

04-10-2024 2:59:12 AM

View Replies

Latest Reply

Cert-Team
Esteemed Contributor

04-16-2024 7:27:02 AM

0 kudos

Hi @vinay076 Thanks for asking! Our support team can provide you with a credential ID. Please file a ticket with our support team, give them your email associated with your certification, and they can get you the credential ID.

0 kudos

04-16-2024 7:27:02 AM

1 More Replies

by VabethRamirez • New Contributor II

04-05-2024 9:16:09 AM

3478 Views
5 replies
4 kudos

Resolved! How obtain a list of workflows in Databricks?

I need to obtain a list of my Databricks workflows with their job IDs in a notebook Databricks

Community Platform Discussions

Reply

3478 Views
5 replies
4 kudos

04-05-2024 9:16:09 AM

View Replies

Latest Reply

artsheiko
Honored Contributor

04-16-2024 7:17:25 AM

4 kudos

Hi @VabethRamirez , Also, instead of using directly the API, you can use databricks Python sdk : %pip install databricks-sdk --upgrade dbutils.library.restartPython()from databricks.sdk import WorkspaceClient w = WorkspaceClient() job_list = w.jobs...

4 kudos

04-16-2024 7:17:25 AM

4 More Replies

by RahulChaubey • New Contributor III

04-06-2024 6:01:09 AM

1096 Views
2 replies
0 kudos

Can api for query history /api/2.0/sql/history/queries return data which is older than 30 days?

I am using this api but it is returning the data for only last 30 days. Can this api return data which is older than 30 days?

Community Platform Discussions

Reply

1096 Views
2 replies
0 kudos

04-06-2024 6:01:09 AM

View Replies

Latest Reply

artsheiko
Honored Contributor

04-16-2024 6:48:34 AM

0 kudos

Hi @RahulChaubey, The query history system table was announced during the Q1 roadmap webinar (see the recording, 32:25). There is a chance that it will provide data with a horizon beyond 30 days. Meanwhile, you can enable system tables - I hope some ...

0 kudos

04-16-2024 6:48:34 AM

1 More Replies

by QPeiran • New Contributor III

04-06-2024 3:25:07 AM

1401 Views
3 replies
0 kudos

Does Delta Table can be the source of streaming/auto loader?

Hi,Since the Auto Loader only accept "append-only" data as the source, I am wondering if the "Delta Table" can also be the source.Does VACCUM(deleting stale files) or _delta_log(creating nested and different file format than parquet) going to break A...

Community Platform Discussions

Reply

1401 Views
3 replies
0 kudos

04-06-2024 3:25:07 AM

View Replies

Latest Reply

artsheiko
Honored Contributor

04-16-2024 6:39:25 AM

0 kudos

Hi @QPeiran, Auto-loader is a feature that allows to integrate files into the Data Platform. Once your data is stored into the Delta Table, you can rely on spark.readStream.table("<my_table_name>") to continuously read from the table. Take a look at ...

0 kudos

04-16-2024 6:39:25 AM

2 More Replies

by alano • New Contributor

04-10-2024 8:42:33 AM

602 Views
1 replies
0 kudos

Handling large volumes of streamed transactional data using DLT

We have a data stream from event hub with approximately 10 million rows per day (into one table) - these records are insert only (no update). We are trying to find a solution to aggregate / group by the data based on multiple data points and our requ...

Community Platform Discussions

Reply

602 Views
1 replies
0 kudos

04-10-2024 8:42:33 AM

View Replies

Latest Reply

artsheiko
Honored Contributor

04-16-2024 6:19:32 AM

0 kudos

Hi, please find below a set of resources I believe relevant for you. Success stories You can find the success stories of companies leveraging the streaming on Databricks here. Videos Introduction to Data Streaming on the Lakehouse : Structured Stream...

0 kudos

04-16-2024 6:19:32 AM

by chemajar • New Contributor III

04-09-2024 1:42:15 AM

2534 Views
3 replies
1 kudos

Resolved! Rearrange tasks in databricks workflow

Hello,There is anyway to rearrange tasks in databricks workflow?.I would like that line that join the two marked tasks doesn't pass behind the other tasks. It is posible that this line by one side?Thanks.

Community Platform Discussions

Databricks

Reply

2534 Views
3 replies
1 kudos

04-09-2024 1:42:15 AM

View Replies

Latest Reply

artsheiko
Honored Contributor

04-16-2024 6:04:05 AM

1 kudos

Hi @chemajar, Take a look at Databricks Asset Bundles. It allows you to streamline the development of complex workflows using a yaml definition. In case you need to change the task dependencies, you can rearrange the flow as you need just change the ...

1 kudos

04-16-2024 6:04:05 AM

2 More Replies

Databricks Community

Forum Posts

Long running jobs get lost

Reading csv file with spark throws [insufficient privelage] error

Resolved! Update regarding Community Reward Store

Autoloader update table when new changes are made

Resolved! vscode python project for development

Can browse external Storage, but can not create a Table from there - VNET, ADLSGen2

DataFrame to CSV write has issues due to multiple commas inside an row value

Access Delta sharing from Azure Data Factory

Redefine ETL strategy with pypskar approach

There is no certification number in my Databricks certificate that i had received after passing the

Resolved! How obtain a list of workflows in Databricks?

Can api for query history /api/2.0/sql/history/queries return data which is older than 30 days?

Does Delta Table can be the source of streaming/auto loader?

Handling large volumes of streamed transactional data using DLT

Resolved! Rearrange tasks in databricks workflow

Connect with Databricks Users in Your Area

Data loss after writing a transformed pyspark data...

Task runs missing from system database

Unity catalog implementation

capture return value from databricks job to local ...

How does coalesce works internally