Data Engineering

Forum Posts

Sorted by:

by sunil_smile • Contributor

01-12-2023 10:04:20 AM

2297 Views
2 replies
1 kudos

Vnet peering settings is not enable in Azure databricks premium , even though its deployed inside my VNET?

Hi All,Vnet peering settings is not enabled in Azure databricks , even though its deployed inside my VNET?Here i not mentioned my vnet and subnet details , but filled this and created databricks (without private endpoint - allow public access)virtual...

Data Engineering

2297 Views
2 replies
1 kudos

01-12-2023 10:04:20 AM

View Replies

Latest Reply

Debayan
Esteemed Contributor III

01-12-2023 1:49:21 PM

1 kudos

Hi, VNET peering is not supported or possible on VNET-injected workspaces. Please refer: https://learn.microsoft.com/en-us/azure/databricks/administration-guide/cloud-configurations/azure/vnet-peering#requirements

1 kudos

01-12-2023 1:49:21 PM

1 More Replies

by patdev • New Contributor III

01-12-2023 7:52:23 AM

917 Views
2 replies
2 kudos

load new data in delta table

Hello all,I want to know how to update new data in delta table from new csv file.here is the code that i have used to create delta table from csv file and loaded data. but i have go new updated file and trying to load new data but not able to any gui...

Data Engineering

917 Views
2 replies
2 kudos

01-12-2023 7:52:23 AM

View Replies

Latest Reply

patdev
New Contributor III

01-12-2023 1:13:19 PM

2 kudos

Thank you, i tried that and it ended in error, the table created with delta are from csv which must have converted to parquet file and all the columns are varchar or string. so not if i want to entered new file it ends in incmopatibility error for da...

2 kudos

01-12-2023 1:13:19 PM

1 More Replies

by sunil_smile • Contributor

01-04-2023 5:22:49 AM

3559 Views
9 replies
11 kudos

Resolved! How i can add ADLS Gen2 - OAuth 2.0 as Cluster scope for my High concurrency Shared Cluster (without unity catalog)?

Hi All,Kindly help me , how i can add the ADLS gen2 OAuth 2.0 authentication to my high concurrency shared cluster. I want to scope this authentication to entire cluster not for particular notebook.Currently i have added them as spark configuration o...

Data Engineering

3559 Views
9 replies
11 kudos

01-04-2023 5:22:49 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-11-2023 7:10:55 AM

11 kudos

Hi @Sunilprasath Elangovan , We haven’t heard from you since the last response from @Hubert Dudek, and I was checking back to see if his suggestions helped you. Or else, If you have any solution, please do share that with the community as it can b...

11 kudos

01-11-2023 7:10:55 AM

8 More Replies

by JesseS • New Contributor

01-12-2023 7:02:20 AM

2420 Views
2 replies
1 kudos

Resolved! How to extract source data from on-premise databases into a data lake and load with AutoLoader?

Here is the situation I am working with. I am trying to extract source data using Databricks JDBC connector using SQL Server databases as my data source. I want to write those into a directory in my data lake as JSON files, then have AutoLoader ing...

Data Engineering

2420 Views
2 replies
1 kudos

01-12-2023 7:02:20 AM

View Replies

Latest Reply

Aashita
Contributor III

01-12-2023 9:41:03 AM

1 kudos

To add to @werners point, I would use ADF to load SQL server data into ADLS Gen 2 as json. Then Load these Raw Json files from your ADLS base location into a Delta table using Autoloader.Delta Live Tables can be used in this scenario.You can also reg...

1 kudos

01-12-2023 9:41:03 AM

1 More Replies

by databicky • Contributor II

01-12-2023 4:06:31 AM

475 Views
1 replies
0 kudos

Resolved! How to create border for sme specific cells?

i tried some code to create border for excel sheet, for particular cell iam able to write but while i am trying with some set of cells means it is showing error.

Data Engineering

475 Views
1 replies
0 kudos

01-12-2023 4:06:31 AM

View Replies

Latest Reply

Chaitanya_Raju
Honored Contributor

01-12-2023 9:09:31 AM

0 kudos

Hi @Mohammed sadamusean ,Can you please try similar to below code using loops, I have implemented a similar use case that might be useful, please let me know if you need further top = Side(border_style = 'thin',color = '00000000') bottom = Side(bor...

0 kudos

01-12-2023 9:09:31 AM

by sreedata • New Contributor III

01-03-2023 3:38:47 AM

2060 Views
5 replies
9 kudos

Resolved! Getting status of "If Condition" Activity into a variable

"If Condition" has lot of activities that can succeeded or fail. If any activity fails then whole "If Condition" fails. I have to get the status of the "If Condition" activity (pass or fail) so that i can use it for processing in the next notebook t...

Data Engineering

2060 Views
5 replies
9 kudos

01-03-2023 3:38:47 AM

View Replies

Latest Reply

Kaniz
Community Manager

01-11-2023 6:09:01 AM

9 kudos

Hi @srikanth nair, We haven’t heard from you since the last response from @Hubert Dudek and @Uma Maheswara Rao Desula, and I was checking back to see if their suggestions helped you. Or else, If you have any solution, share that with the communit...

9 kudos

01-11-2023 6:09:01 AM

4 More Replies

by Nhan_Nguyen • Valued Contributor

12-12-2022 4:02:53 PM

4668 Views
15 replies
27 kudos

Resolved! Do not received Databricks Certification: Fully Sponsored after order on Reward Store

Hi team.Would you please help check on my case?From 30-Nov I have placed an order "Databricks Certification: Fully Sponsored" on https://communitydatabricks.mybrightsites.com/ and after waiting 10 bussiness days. I still not receive that voucher.Is t...

Data Engineering

4668 Views
15 replies
27 kudos

12-12-2022 4:02:53 PM

View Replies

Latest Reply

ramravi
Contributor II

01-12-2023 12:46:39 AM

27 kudos

I receive a voucher today and it saysPlease find your code here (expires 6/1/23):Does it mean it expires on 1-June-2023 ?

27 kudos

01-12-2023 12:46:39 AM

14 More Replies

by Chanu • New Contributor II

01-11-2023 2:15:40 AM

988 Views
2 replies
2 kudos

Databricks JAR task type functionality

Hi, I would like to understand Databricks JAR based workflow tasks. Can I interpret JAR based runs to be something like a spark-submit on a cluster? In the logs, I was expecting to see the spark-submit --class com.xyz --num-executors 4 etc., And, the...

Data Engineering

988 Views
2 replies
2 kudos

01-11-2023 2:15:40 AM

View Replies

Latest Reply

Chanu
New Contributor II

01-12-2023 4:25:52 AM

2 kudos

Hi, I did try using the Workflows>Jobs>CreateTask>JarTaskType>UploadedMyJAR and Class and created JobCluster and tested this task. This JAR reads some tables as input, does some transformations and output as writing some other tables. I would like t...

2 kudos

01-12-2023 4:25:52 AM

1 More Replies

by sudhanshu1 • New Contributor III

01-10-2023 4:00:49 AM

1620 Views
4 replies
2 kudos

Resolved! DLT workflow failing to read files from AWS S3

Hi All, I am trying to read streams directly from AWS S3. I set the instance profile , but when i run the workflow it fails with below error"No AWS Credentials provided by TemporaryAWSCredentialsProvider : shaded.databricks.org.apache.hadoop.fs.s3a.C...

Data Engineering

1620 Views
4 replies
2 kudos

01-10-2023 4:00:49 AM

View Replies

Latest Reply

Vivian_Wilfred
Honored Contributor

01-10-2023 6:54:32 AM

2 kudos

Hi @SUDHANSHU RAJ is UC enabled on this workspace? What is the access mode set on the cluster? Is this coming from the metastore or directly when you read from S3? Is the S3 cross-account?

2 kudos

01-10-2023 6:54:32 AM

3 More Replies

by alxsbn • New Contributor III

01-11-2023 2:40:56 AM

1185 Views
2 replies
2 kudos

Resolved! Autloader on CSV file didn't infer well cell with JSON data

Hello ! I playing with autoloader schema inference on a big S3 repo with +300 tables and large CSV files. I'm looking at autoloader with great attention, as it can be a great time saver on our ingestion process (data comes from a transactional DB gen...

Data Engineering

1185 Views
2 replies
2 kudos

01-11-2023 2:40:56 AM

View Replies

Latest Reply

daniel_sahal
Esteemed Contributor

01-11-2023 3:43:05 AM

2 kudos

PySpark by default is using \ as an escape character. You can change it to "Doc: https://docs.databricks.com/ingestion/auto-loader/options.html#csv-options

2 kudos

01-11-2023 3:43:05 AM

1 More Replies

by Victhor • New Contributor III

05-08-2022 7:29:03 PM

4197 Views
4 replies
8 kudos

Resolved! How to use vim keybindings within Databricks notebooks?

Data Engineering

4197 Views
4 replies
8 kudos

05-08-2022 7:29:03 PM

View Replies

Latest Reply

chanshing
New Contributor II

07-22-2022 12:06:57 AM

8 kudos

@Kaniz Fatma Is that tool (dbvim) still maintained? It looks like it has been abandoned and there are a couple of unresolved issues.Are there any plans to support vim keybindings in Databricks? This is possible in many other web-based editors such a...

8 kudos

07-22-2022 12:06:57 AM

3 More Replies

by DeveloperAmarde • New Contributor

01-10-2023 6:48:31 AM

716 Views
1 replies
0 kudos

Connection to Collibra

Hi Team,I want to connect to collibra to fetch details from Collibra.Currently we are using username and password to connect.I want to know recommended practice to connect Collibra account from databricks notebook.

Data Engineering

716 Views
1 replies
0 kudos

01-10-2023 6:48:31 AM

View Replies

Latest Reply

Debayan
Esteemed Contributor III

01-11-2023 2:41:41 PM

0 kudos

Hi, Could you please know if this helps. https://marketplace.collibra.com/listings/jdbc-driver-for-databricks/

0 kudos

01-11-2023 2:41:41 PM

by lmcglone • New Contributor II

01-11-2023 8:08:37 AM

2440 Views
2 replies
3 kudos

Comparing 2 dataframes and create columns from values within a dataframe

Hi,I have a dataframe that has name and companyfrom pyspark.sql import SparkSessionspark = SparkSession.builder.appName('SparkByExamples.com').getOrCreate()columns = ["company","name"]data = [("company1", "Jon"), ("company2", "Steve"), ("company1", "...

Data Engineering

2440 Views
2 replies
3 kudos

01-11-2023 8:08:37 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

01-11-2023 8:59:13 AM

3 kudos

You need to join and pivotdf .join(df2, on=[df.company == df2.job_company])) .groupBy("company", "name") .pivot("job_company") .count()

3 kudos

01-11-2023 8:59:13 AM

1 More Replies

by andrew0117 • Contributor

01-11-2023 7:09:29 AM

4959 Views
1 replies
0 kudos

Resolved! How to read a local file using Databricks( file stored in your own computer)

without uploading the file into dbfs? Thanks!

Data Engineering

4959 Views
1 replies
0 kudos

01-11-2023 7:09:29 AM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

01-11-2023 7:18:33 AM

0 kudos

In my opinion, it doesn't make sense, but...you can Mount SMB Azure file share on a Windows Machine https://learn.microsoft.com/en-us/azure/storage/files/storage-how-to-use-files-windows and then mount the same folder on databricks using pip install ...

0 kudos

01-11-2023 7:18:33 AM

by rams • Contributor

01-05-2023 5:28:34 AM

1029 Views
4 replies
4 kudos

Resolved! 14 day trial version console showing blank screen after login

I have taken a trial version of Databricks and wanted to configure it with AWS. but after login it was showing as blank screen since 20 hours. can someone help me with this. Note: strictly i have to use AWS with Databricks for configuration.

Data Engineering

1029 Views
4 replies
4 kudos

01-05-2023 5:28:34 AM

View Replies

Latest Reply

Aviral-Bhardwaj
Esteemed Contributor III

01-07-2023 8:15:44 AM

4 kudos

try to reach your account manager

4 kudos

01-07-2023 8:15:44 AM

3 More Replies

User

Count

1601

736

343

284

246

Databricks

Forum Posts

Vnet peering settings is not enable in Azure databricks premium , even though its deployed inside my VNET?

load new data in delta table

Resolved! How i can add ADLS Gen2 - OAuth 2.0 as Cluster scope for my High concurrency Shared Cluster (without unity catalog)?

Resolved! How to extract source data from on-premise databases into a data lake and load with AutoLoader?

Resolved! How to create border for sme specific cells?

Resolved! Getting status of "If Condition" Activity into a variable

Resolved! Do not received Databricks Certification: Fully Sponsored after order on Reward Store

Databricks JAR task type functionality

Resolved! DLT workflow failing to read files from AWS S3

Resolved! Autloader on CSV file didn't infer well cell with JSON data

Resolved! How to use vim keybindings within Databricks notebooks?

Connection to Collibra

Comparing 2 dataframes and create columns from values within a dataframe

Resolved! How to read a local file using Databricks( file stored in your own computer)

Resolved! 14 day trial version console showing blank screen after login

DELTA_EXCEED_CHAR_VARCHAR_LIMIT

Not able to set run_as service_principal_name

Pyspark operations slowness in CLuster 14.3LTS as ...

[Databricks Assets Bundles] Workflow trigger on fi...

Addressing Pipeline Error Handling in Databricks b...