Topics with Label: Type

Forum Posts

Sorted by:

by cmilligan • Contributor II

11-11-2022 9:09:24 AM

1542 Views
3 replies
2 kudos

Dropdown for parameters in a job

I want to be able to denote the type of run from a predetermined list of values that a user can choose from when kicking off a run using different parameters. Our team does standardized job runs on a weekly cadence but can have timeframes that change...

Data Engineering

1542 Views
3 replies
2 kudos

11-11-2022 9:09:24 AM

View Replies

Latest Reply

dev56
New Contributor II

3 weeks ago

2 kudos

Hi @cmilligan , I have a similar requirement and would really be grateful if you could provide me with any information on how to fix this issue. Thanks a lot!

2 kudos

3 weeks ago

2 More Replies

by mbejarano89 • New Contributor III

06-21-2023 6:13:57 AM

1184 Views
2 replies
0 kudos

Running a K-means (.fit) gives error:Params must be either a param map or a list/tuple of param maps but got %s." % type(params)

am running a k-means algorithm. My feature are DoubleType and have no nulls, but I get : raise TypeError("Params must be either a param map or a list/tuple of param maps but got %s." % type(params). Anyone have any idea how to solve this?File /datab...

Data Engineering

1184 Views
2 replies
0 kudos

06-21-2023 6:13:57 AM

View Replies

Latest Reply

mbejarano89
New Contributor III

06-22-2023 3:29:11 AM

0 kudos

I found the answer just by trying several things, although I do not understand exactly what the problem was. All I had to do was to cache the input data before fitting the model:assemble=VectorAssembler(inputCols=columns_input, outputCol='features')...

0 kudos

06-22-2023 3:29:11 AM

1 More Replies

by kll • New Contributor III

05-01-2023 8:21:36 PM

2714 Views
2 replies
3 kudos

Nested struct type not supported pyspark error

I am attempting to apply a function to a pyspark DataFrame and save the API response to a new column and then parse using `json_normalize`. This works fine in pandas, however, I run into an exception with `pyspark`. import pyspark.pandas as ps i...

Data Engineering

2714 Views
2 replies
3 kudos

05-01-2023 8:21:36 PM

View Replies

Latest Reply

Anonymous
Not applicable

05-18-2023 11:25:47 PM

3 kudos

Hi @Keval Shah Thank you for posting your question in our community! We are happy to assist you.To help us provide you with the most accurate information, could you please take a moment to review the responses and select the one that best answers yo...

3 kudos

05-18-2023 11:25:47 PM

1 More Replies

by PraveenC • New Contributor II

01-02-2023 6:21:03 PM

1634 Views
4 replies
3 kudos

[Databricks][JDBC](10400) Invalid type for data - column: 10, type: Array

Getting below error while mapping an Array Column to String[] entity. Please suggest if Databricks JDBC support entity mapping of Array Values [Worked the same code for below config - H2 DB version - 2.1.214 and org.hibernate.dialect.H2Dialect - ...

Data Engineering

1634 Views
4 replies
3 kudos

01-02-2023 6:21:03 PM

View Replies

Latest Reply

Atanu
Esteemed Contributor

04-12-2023 7:19:11 AM

3 kudos

Hello @Emmanuel Trindade @Praveen C This does not look like coming from Databricks end. Look at the error thread.javax.persistence.PersistenceException: org.hibernate.exception.DataException: Could not read entity state from ResultSet : EntityKey...

3 kudos

04-12-2023 7:19:11 AM

3 More Replies

by Philearner • New Contributor II

03-16-2023 8:15:20 PM

1320 Views
3 replies
3 kudos

Unable to find input by typing input in the Multiselect Widget

In the AWS databricks widgets.multiselect, I'm unable to find input by typing input in the mulitselect bar. It was working before. Although I can find the inputs by scrolling down the list, it's annoying if the list is long.Here's my script:measlis...

Data Engineering

1320 Views
3 replies
3 kudos

03-16-2023 8:15:20 PM

View Replies

Latest Reply

Anonymous
Not applicable

03-17-2023 11:14:06 PM

3 kudos

Hi @Philip Teu Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear from you.Thanks!

3 kudos

03-17-2023 11:14:06 PM

2 More Replies

by MShee • New Contributor II

03-14-2023 1:00:18 AM

752 Views
1 replies
1 kudos

Resolved! Unable to type in dropdown of dbutils.widgets.dropdown() In the AWS databricks widgets.dropdown, I'm unable to type input in the dropdown box like in the below screenshot:

Data Engineering

752 Views
1 replies
1 kudos

03-14-2023 1:00:18 AM

View Replies

Latest Reply

NandiniN
Valued Contributor II

03-14-2023 3:55:07 AM

1 kudos

Hello @M Shee ,In a drop down you can select a value from a list of provided values, not type the values in. What you might be interested in is a combobox - It is combination of text and dropdown. It allows to select a value from a provided list or ...

1 kudos

03-14-2023 3:55:07 AM

by jonathan-dufaul • Valued Contributor

01-25-2023 10:16:34 AM

990 Views
2 replies
2 kudos

Resolved! Does anyone have a single example of a graphframe with two+ types of vertices? (e.g. user and post, not user to user)

I have gone through about 75 pages and every single example has only relationships from one type of object to the same type of object. about 90% have the exact same example of "Alice Bob" "friends."Has anyone ever made a graphframe with two types of ...

Data Engineering

990 Views
2 replies
2 kudos

01-25-2023 10:16:34 AM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

01-26-2023 4:53:12 AM

2 kudos

I feel your pain,I once tried to use graphframes to flatten a complex tree, ended up using graphX (which is even worse to use but at least it is more flexible).So maybe take a look at graphX? Beware, it is terrible to use.I wonder what happened to m...

2 kudos

01-26-2023 4:53:12 AM

1 More Replies

by Panna • New Contributor II

09-27-2022 6:01:43 PM

963 Views
2 replies
3 kudos

Is there only one element type option for an array?

I'm creating an array which contains both string and double, just wondering if I can have multiple element type options for one array column? Thanks

Data Engineering

963 Views
2 replies
3 kudos

09-27-2022 6:01:43 PM

View Replies

Latest Reply

Kaniz
Community Manager

09-29-2022 3:25:21 AM

3 kudos

Hi @Panna Pan , We haven’t heard from you on the last response from @Debayan Mukherjee, and I was checking back to see if his suggestions helped you. Or else, If you have any solution, please do share that with the community as it can be helpful to...

3 kudos

09-29-2022 3:25:21 AM

1 More Replies

by danny_edm • New Contributor

08-19-2022 9:44:18 PM

352 Views
0 replies
0 kudos

collect_set wired result when Proton enable

Cluster : DBR 10.4 LTS with protonSample schemaseq_no (decimal)type (string)Sample dataseq_no type1 A1 A2 A2 B2 Bcommand : F.size(F.collect_set(F.col("type")).over(Window.partitionBy("seq_no"))...

Data Engineering

352 Views
0 replies
0 kudos

08-19-2022 9:44:18 PM

by cmotla • New Contributor III

03-18-2022 2:58:04 PM

1322 Views
3 replies
8 kudos

Issue with complex json based data frame select

We are getting the below error when trying to select the nested columns (string type in a struct) even though we don't have more than a 1000 records in the data frame. The schema is very complex and has few columns as struct type and few as array typ...

Data Engineering

1322 Views
3 replies
8 kudos

03-18-2022 2:58:04 PM

View Replies

Latest Reply

Kaniz
Community Manager

05-09-2022 6:41:09 AM

8 kudos

Hi @Chaitanya Motla , Just a friendly follow-up. Do you still need help, or did you find the solution? Please let us know.

8 kudos

05-09-2022 6:41:09 AM

2 More Replies

by NAS • New Contributor III

03-07-2022 10:45:53 AM

937 Views
1 replies
1 kudos

Resolved! "import pandas as pd" => [Errno 5]

When I type import pandas as pdfrom a Notebook in a Repo I get:--------------------------------------------------------------------------- AttributeError Traceback (most recent call last) /usr/lib/python3.8/importlib/_boots...

Data Engineering

937 Views
1 replies
1 kudos

03-07-2022 10:45:53 AM

View Replies

Latest Reply

NAS
New Contributor III

03-07-2022 4:33:49 PM

1 kudos

Thanks to Elliott Hertz, I found out that the ML Experiments cannot be stored in the repo. After I moved them to my Workspace everything seems to work.

1 kudos

03-07-2022 4:33:49 PM

by sravan_enukonda • New Contributor II

08-24-2021 8:43:08 AM

1469 Views
3 replies
2 kudos

Resolved! I am looking for best practices in implementing Ranger type of Access control in Databricks ?

Need this to do auditing and numbers of users accessing databases and tables created in databricks

Data Engineering

1469 Views
3 replies
2 kudos

08-24-2021 8:43:08 AM

View Replies

Latest Reply

Kaniz
Community Manager

03-04-2022 6:43:38 AM

2 kudos

Hi @sravankumar enukonda , How are you?

2 kudos

03-04-2022 6:43:38 AM

2 More Replies

by frank26364 • New Contributor III

01-20-2022 5:36:35 AM

6400 Views
4 replies
0 kudos

Resolved! Command prompt won't let me type the Databricks token

Hi, I am trying to set up Databricks CLI using the command prompt on my computer. I downloaded the Python 3.9 app and successfully ran the command pip install databricks-cliWhen I try to set up the Databricks token, I am able to type my Databricks Ho...

Data Engineering

6400 Views
4 replies
0 kudos

01-20-2022 5:36:35 AM

View Replies

Latest Reply

Anonymous
Not applicable

01-21-2022 12:03:47 PM

0 kudos

Hey there! You're on a roll today! Thanks for letting us know.

0 kudos

01-21-2022 12:03:47 PM

3 More Replies

by AzureDatabricks • New Contributor III

11-21-2021 11:18:10 PM

5700 Views
7 replies
2 kudos

Resolved! Can we store 300 million records and what is the preferable compute type and config?

How we can persist 300 million records? What is the best option to persist data databricks hive metastore/Azure storage/Delta table?What is the limitations we have for deltatables of databricks in terms of data?We have usecase where testers should be...

Data Engineering

5700 Views
7 replies
2 kudos

11-21-2021 11:18:10 PM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

11-21-2021 11:26:42 PM

2 kudos

You can certainly store 300 million records without any problem.The best option kinda depends on the use case. If you want to do a lot of online querying on the table, I suggest using delta lake, which is optimeized (using z-order, bloom filter, par...

2 kudos

11-21-2021 11:26:42 PM

6 More Replies

by Anonymous • Not applicable

06-18-2021 2:15:42 PM

666 Views
1 replies
0 kudos

Resolved! Any recommendations on instance type for z-order / vacuum/ optimize ?

Data Engineering

666 Views
1 replies
0 kudos

06-18-2021 2:15:42 PM

View Replies

Latest Reply

aladda
Honored Contributor II

06-21-2021 1:20:44 PM

0 kudos

For Delta in general having Delta cache accelerates data reads by creating copies of remote files in nodes’ local storage using a fast intermediate data format. The data is cached automatically whenever a file has to be fetched from a remote locatio...

0 kudos

06-21-2021 1:20:44 PM