Topics with Label: Data Ingestion & connectivity

Forum Posts

Sorted by:

by Nyarish • Contributor

09-11-2021 9:42:00 PM

16185 Views
18 replies
18 kudos

Resolved! How to connect Neo4j aura to a cluster

Please help resolve this error :org.neo4j.driver.exceptions.SecurityException: Failed to establish secured connection with the serverThis occurs when I want to establish a connection to neo4j aura to my cluster .Thank you.

Data Engineering

16185 Views
18 replies
18 kudos

09-11-2021 9:42:00 PM

View Replies

Latest Reply

saab123
New Contributor II

2 weeks ago

18 kudos

I have added the init script on cluster start up in Spark->config->init.scripts. But my clsuter won't start after this.Cluster scoped init script /Volumes/xxx/xxx/neo4j/neo4j-init.sh failed: Script exit status is non-zero. Could you please help me or...

18 kudos

2 weeks ago

17 More Replies

by User16790091296 • Contributor II

06-24-2021 8:45:47 AM

3264 Views
3 replies
5 kudos

Resolved! How do I use databricks-cli without manual configuration

I want to use databricks cli:databricks clusters listbut this requires a manual step that requires interactive work with the user:databricks configure --tokenIs there a way to use databricks cli without manual intervention so that you can run it as p...

Data Engineering

3264 Views
3 replies
5 kudos

06-24-2021 8:45:47 AM

View Replies

Latest Reply

alexott
Databricks Employee

11-25-2021 10:38:48 AM

5 kudos

You can set two environment variables: DATABRICKS_HOST and DATABRICKS_TOKEN, and databricks-cli will use them. See the example of that in the DevOps pipelinesee the full list of environment variables at the end of the Authentication section of docume...

5 kudos

11-25-2021 10:38:48 AM

2 More Replies

by abhinandan084 • New Contributor III

08-19-2021 11:15:28 AM

24662 Views
19 replies
13 kudos

Community Edition signup issues

I am trying to sign up for the community edition (https://databricks.com/try-databricks) for use with a databricks academy course. However, I am unable to signup and I receive the following error (image attached). On going to login page (link in ora...

Data Engineering

24662 Views
19 replies
13 kudos

08-19-2021 11:15:28 AM

View Replies

Latest Reply

brokeTechBro
New Contributor II

09-20-2024 11:53:19 AM

13 kudos

Hello,I get "An error occurred, try again"I am exhausted from trying... also from solving the puzzle to prove I'm not a robot

13 kudos

09-20-2024 11:53:19 AM

18 More Replies

by saniafatimi • New Contributor II

08-04-2021 12:28:27 AM

4825 Views
3 replies
1 kudos

How to migrate power bi reports to databricks

I have a sample set of power bi(.pbix) reports with all dropdowns, tables, filters etc. Now I would like to migrate this reports to data bricks. whatever visuals are created in power bi, I would like to create same in data bricks from scratch. I wou...

Data Engineering

4825 Views
3 replies
1 kudos

08-04-2021 12:28:27 AM

View Replies

Latest Reply

Neeljy
New Contributor II

04-07-2022 8:50:41 AM

1 kudos

We must ensure you are sure that the Databricks cluster is operational. These are the steps needed for integration between Azure Databricks into Power BI Desktop.1. Constructing the URL for the connectionConnect to the cluster, and click the Advanced...

1 kudos

04-07-2022 8:50:41 AM

2 More Replies

by tj-cycyota • Databricks Employee

06-22-2021 5:55:02 PM

2850 Views
4 replies
1 kudos

Can you use the Databricks API from a notebook?

I want to test out different APIs directly from a Databricks notebook instead of using Postman or CURL. Is this possible?

Data Engineering

2850 Views
4 replies
1 kudos

06-22-2021 5:55:02 PM

View Replies

Latest Reply

Boris2
New Contributor II

10-18-2024 5:50:09 AM

1 kudos

@Panda There is no REST API for databricks. "RE" in REST stands for Ready Everywhere. You cannot connect to the API in workspace 1, from a notebook in workspace 2. Therefor it is Not Ready Everywhere. Workspace 1 cannot resolve the hostname for Works...

1 kudos

10-18-2024 5:50:09 AM

3 More Replies

by User16752245312 • Databricks Employee

06-10-2021 10:04:02 AM

21185 Views
3 replies
3 kudos

How can I make Databricks API calls from notebook?

Access to Databricks APIs require the user to authenticate. This usually means creating a PAT (Personal Access Token) token. Conveniently, a token is readily available to you when you are using a Databricks notebook.databricksURL = dbutils.notebook....

Data Engineering

21185 Views
3 replies
3 kudos

06-10-2021 10:04:02 AM

View Replies

Latest Reply

Panda
Valued Contributor

10-18-2024 3:32:25 AM

3 kudos

@User16752245312 You can use Databricks Secret Scope to manage sensitive data such as personal access tokens (PATs) securely. Storing your token in a secret scope ensures you don’t hard-code credentials in your notebook, making it more secure.For mo...

3 kudos

10-18-2024 3:32:25 AM

2 More Replies

by gtaspark • New Contributor II

02-05-2020 12:57:42 PM

56195 Views
9 replies
5 kudos

Resolved! How to get the total directory size using dbutils

Is there a way to get the directory size in ADLS(gen2) using dbutils in databricks? If I run this dbutils.fs.ls("/mnt/abc/xyz") I get the file sizes inside the xyz folder( there are about 5000 files), I want to get the size of the XYZ folder how ca...

Data Engineering

56195 Views
9 replies
5 kudos

02-05-2020 12:57:42 PM

View Replies

Latest Reply

User16788316720
New Contributor III

06-21-2023 10:22:51 AM

5 kudos

File size is only specified for files. So, if you specify a directory as your source, you have to iterate through the directory. The below snippet should work (and should be faster than the other solutions).import glob def get_directory_size_in_byt...

5 kudos

06-21-2023 10:22:51 AM

8 More Replies

by User16826987838 • Contributor

06-18-2021 3:29:39 PM

4255 Views
2 replies
4 kudos

Resolved! Does Databricks integrate with kerberos using keytab

Data Engineering

4255 Views
2 replies
4 kudos

06-18-2021 3:29:39 PM

View Replies

Latest Reply

User16826994223
Honored Contributor III

06-21-2021 5:35:33 AM

4 kudos

Yes it does, !!https://databricks.com/session/secured-kerberos-based-spark-notebook-for-data-science

4 kudos

06-21-2021 5:35:33 AM

1 More Replies

by brickster_2018 • Databricks Employee

06-23-2021 11:37:25 PM

13508 Views
3 replies
6 kudos

Resolved! How to add I custom logging in Databricks

I want to add custom logs that redirect in the Spark driver logs. Can I use the existing logger classes to have my application logs or progress message in the Spark driver logs.

Data Engineering

13508 Views
3 replies
6 kudos

06-23-2021 11:37:25 PM

View Replies

Latest Reply

Kaizen
Valued Contributor

02-09-2024 10:18:15 AM

6 kudos

1) Is it possible to save all the custom logging to its own file? Currently it is being logging with all other cluster logs (see image) 2) Also Databricks it seems like a lot of blank files are also being created for this. Is this a bug? this include...

6 kudos

02-09-2024 10:18:15 AM

2 More Replies

by PHorniak • New Contributor II

04-30-2019 4:20:35 PM

17657 Views
3 replies
4 kudos

Resolved! AttributeError: 'DataFrame' object has no attribute 'rename'

Hello, I am doing the Data Science and Machine Learning course. The Boston housing has unintuitive column names. I want to rename them, e.g. so 'zn' becomes 'Zoning'. When I run this command: df_bostonLegible = df_boston.rename({'zn':'Zoning'}, axi...

Data Engineering

17657 Views
3 replies
4 kudos

04-30-2019 4:20:35 PM

View Replies

Latest Reply

KrunalLathiya
New Contributor II

01-02-2024 2:50:36 AM

4 kudos

If df_boston is a DataFrame, but you still face issues, try an alternative syntax: df_boston = df_boston.rename(columns={'zn': 'Zoning'}).Make sure df_boston is a proper DataFrame and you're using a recent version of Pandas.

4 kudos

01-02-2024 2:50:36 AM

2 More Replies

by User16826994223 • Honored Contributor III

06-22-2021 2:17:45 AM

5110 Views
3 replies
2 kudos

TPC -DS test On databricks

If I want to run TPC-DS test on databricks what are the steps involved, do we have already daya available on databricks file system or I have to download or create from somewhere.

Data Engineering

5110 Views
3 replies
2 kudos

06-22-2021 2:17:45 AM

View Replies

Latest Reply

aladda
Databricks Employee

06-22-2021 8:54:52 PM

2 kudos

See the spark-sql-perf repo for details on how to run benchmark tests using TPC-DS - https://github.com/databricks/spark-sql-perf

2 kudos

06-22-2021 8:54:52 PM

2 More Replies

by tj-cycyota • Databricks Employee

06-22-2021 5:46:27 PM

9757 Views
2 replies
1 kudos

Whats the difference between magic commands %pip and %sh pip

In Databricks you can do either %pipor %sh pipWhats the difference? Is there a recommended approach?

Data Engineering

9757 Views
2 replies
1 kudos

06-22-2021 5:46:27 PM

View Replies

Latest Reply

stefnhuy
New Contributor III

10-25-2023 5:48:37 AM

1 kudos

Hey there, User16776431030.Great question about those magic commands in Databricks! Let me shed some light on this mystical matter.The %pip and %sh pip commands may seem similar on the surface, but they're quite distinct in their powers. %sh pip is l...

1 kudos

10-25-2023 5:48:37 AM

1 More Replies

by User15986662700 • New Contributor III

06-16-2021 9:47:51 AM

4905 Views
4 replies
1 kudos

How to integrate Databricks and Spark to Secure HBase cluster with Kerberos?

Data Engineering

4905 Views
4 replies
1 kudos

06-16-2021 9:47:51 AM

View Replies

Latest Reply

User15986662700
New Contributor III

06-16-2021 9:59:37 AM

1 kudos

Yes, it is possible to connect databricks to a kerberized hbase cluster. The attached article explains the steps. It consists of setting up a kerberos client using a keytab in the cluster nodes, installing the hbase-spark integration library, and set...

1 kudos

06-16-2021 9:59:37 AM

3 More Replies

by Madman • New Contributor II

08-07-2021 9:55:06 AM

14453 Views
5 replies
6 kudos

Snowflake connection to Databricks error

When I am trying to read snowflake table from my databricks notebook, it is giving the error as:df1.read.format("snowflake") \.options(**options) \.option("query", "select * from abc") \.save()Getting below errorjava.sql.SQLException: No suitable dri...

Data Engineering

14453 Views
5 replies
6 kudos

08-07-2021 9:55:06 AM

View Replies

Latest Reply

pdiegop
New Contributor II

08-22-2023 3:13:13 AM

6 kudos

@anurag2192 did you managed to solve it?

6 kudos

08-22-2023 3:13:13 AM

4 More Replies

by Anonymous • Not applicable

06-17-2021 11:53:24 AM

6016 Views
2 replies
1 kudos

Resolved! If I know the workspaceID, how can I figure out the the respective Databricks workspace URL?

Data Engineering

6016 Views
2 replies
1 kudos

06-17-2021 11:53:24 AM

View Replies

Latest Reply

wmespi
New Contributor II

06-13-2023 7:54:38 AM

1 kudos

Is this random number not possible to extract from the notebook context? It is available in the browser_hash but that is not populated when running a job.Is this random number static or does it change over time? If it is static, it can then be hardco...

1 kudos

06-13-2023 7:54:38 AM

1 More Replies