Topics with Label: Function

Forum Posts

Sorted by:

by alexgv12 • New Contributor III

09-20-2022 3:52:15 PM

1614 Views
2 replies
0 kudos

How can I somehow run spark.something in a worker? - rdd foreach spark.context

i am using rdd to parallelize a function, in this function i format the record i want to save, how can i store from this function the record with a dataframe? because every time i use spark..... an error is generated Caused by: org.apache.spark.api.p...

Data Engineering

1614 Views
2 replies
0 kudos

09-20-2022 3:52:15 PM

View Replies

Latest Reply

Anonymous
Not applicable

10-02-2022 11:15:36 PM

0 kudos

Hi @alexander grajales vanegas Hope all is well! Just wanted to check in if you were able to resolve your issue and would you be happy to share the solution or mark an answer as best? Else please let us know if you need more help. We'd love to hear ...

0 kudos

10-02-2022 11:15:36 PM

1 More Replies

by jay_sharma • New Contributor III

09-13-2022 12:43:56 PM

859 Views
0 replies
4 kudos

Function not found when running from another Notebook using %run command.

Hi all,I'm trying to run some functions from another notebook (data_process_notebook) in my main notebook, using the %run command command. When I run the command: %run ../path/to/data_process_notebook, it is able to complete successfully, no path, pe...

Data Engineering

859 Views
0 replies
4 kudos

09-13-2022 12:43:56 PM

by jakubk • Contributor

08-07-2022 6:01:55 PM

480 Views
0 replies
0 kudos

databricks spark sql Custom table valued function + struct really slow (minutes for a single row)

I'm using azure databricksI have a custom table valued function which takes a URL as a parameter and outputs a single row table with certain elements from the URL extracted/labelled(i get search activity URLs and when in a specific format I can retri...

Data Engineering

480 Views
0 replies
0 kudos

08-07-2022 6:01:55 PM

by Vegard_Stikbakk • New Contributor II

03-25-2022 7:49:21 AM

1362 Views
2 replies
3 kudos

Resolved! External functions on a SQL endpoint

want to create an external function using CREATE FUNCTION (External) and expose it to users of my SQL endpoint. Although this works from a SQL notebook, if I try to use the function from a SQL endpoint, I get "User defined expression is not supporte...

Data Engineering

1362 Views
2 replies
3 kudos

03-25-2022 7:49:21 AM

View Replies

Latest Reply

Kaniz
Community Manager

04-07-2022 2:15:13 PM

3 kudos

Hi @Vegard Stikbakke , Were you able to resolve your problem?

3 kudos

04-07-2022 2:15:13 PM

1 More Replies

by Jeff1 • Contributor II

03-23-2022 8:39:55 AM

920 Views
3 replies
1 kudos

Resolved! Strange object returned using sparklyr

CommunityI'm running a sparklyr "group_by" function and the function returns the following info:# group by event_typeacled_grp_tbl <- acled_tbl %>% group_by("event_type") %>% summary(count = n()) Length Cl...

Data Engineering

920 Views
3 replies
1 kudos

03-23-2022 8:39:55 AM

View Replies

Latest Reply

Jeff1
Contributor II

03-24-2022 5:21:44 AM

1 kudos

I should have deleted the post. While your are correct "event_type" should be without quotes the problem was the Summary function. I was using the wrong function it should have been "summarize."

1 kudos

03-24-2022 5:21:44 AM

2 More Replies

by irfanaziz • Contributor II

01-13-2022 4:39:22 AM

1787 Views
3 replies
1 kudos

Resolved! What is the difference between passing the schema in the options or using the .schema() function in pyspark for a csv file?

I have observed a very strange behavior with some of our integration pipelines. This week one of the csv files was getting broken when read with read function given below.def ReadCSV(files,schema_struct,header,delimiter,timestampformat,encode="utf8...

Data Engineering

1787 Views
3 replies
1 kudos

01-13-2022 4:39:22 AM

View Replies

Latest Reply

jose_gonzalez
Moderator

02-08-2022 4:41:55 PM

1 kudos

Hi @nafri A ,What is the error you are getting, can you share it please? Like @Hubert Dudek mentioned, both will call the same APIs

1 kudos

02-08-2022 4:41:55 PM

2 More Replies

by gbrueckl • Contributor II

10-14-2021 1:12:48 PM

2997 Views
6 replies
4 kudos

Resolved! CREATE FUNCTION from Python file

Is it somehow possible to create an SQL external function using Python code?the examples only show how to use JARshttps://docs.databricks.com/spark/latest/spark-sql/language-manual/sql-ref-syntax-ddl-create-function.htmlsomething like:CREATE TEMPORAR...

Data Engineering

2997 Views
6 replies
4 kudos

10-14-2021 1:12:48 PM

View Replies

Latest Reply

pts
New Contributor II

02-04-2022 6:11:28 PM

4 kudos

As a user of your code, I'd find it a less pleasant API because I'd have to some_module.some_func.some_func() rather than just some_module.some_func()No reason to have "some_func" exist twice in the hierarchy. It's kind of redundant. If some_func is ...

4 kudos

02-04-2022 6:11:28 PM

5 More Replies

by antoooks • New Contributor III

10-25-2021 1:10:39 AM

1603 Views
3 replies
5 kudos

Resolved! display() function always return connection refused on tunneling despite successfully retrieving the schema

Hi everyone,I am using SSH tunnelling with SSHTunnelForwarder to reach a target AWS RDS PostgreSQL database. The connection got through, however when I tried to display the retrieved data frame it always throws "connection refused" error. Please see ...

Data Engineering

1603 Views
3 replies
5 kudos

10-25-2021 1:10:39 AM

View Replies

Latest Reply

jose_gonzalez
Moderator

11-12-2021 4:41:41 PM

5 kudos

hi @Kurnianto Trilaksono Sutjipto ,This seems like a connectivity issue with the url you are trying to connect to. It fails during the display() command because read is a lazy transformation and it will not be executed right away. On the other hand,...

5 kudos

11-12-2021 4:41:41 PM

2 More Replies

by maranBH • New Contributor III

10-19-2021 1:41:18 PM

19867 Views
5 replies
12 kudos

Resolved! How to import a function to another notebook using Repos without %run?

Hi all,I was reading the Repos documentation: https://docs.databricks.com/repos.html#migrate-from-run-commandsIt is explained that, one advantage of Repos is no longer necessary to use %run magic command to make funcions available in one notebook to ...

Data Engineering

19867 Views
5 replies
12 kudos

10-19-2021 1:41:18 PM

View Replies

Latest Reply

maranBH
New Contributor III

10-22-2021 7:25:54 AM

12 kudos

Thank you all for your help! I tried all that was suggested; but I finally realized it was my fault in first place:I was testing Files in Repos with a runtime < 8.4.I was trying to import a file from a DB Notebook instead of a static .py file.Upgradi...

12 kudos

10-22-2021 7:25:54 AM

4 More Replies

by Kaniz • Community Manager

09-21-2021 11:23:47 AM

489 Views
1 replies
0 kudos

What's the difference between a method and a function?

Data Engineering

489 Views
1 replies
0 kudos

09-21-2021 11:23:47 AM

View Replies

Latest Reply

Ryan_Chynoweth
Honored Contributor III

09-22-2021 1:35:00 PM

0 kudos

Typically a method is associated to an object and/or class while a function is not. For example, the following class has a single method called "my_method":class MyClass(): def __init__(self, a): self.a = a def my_method(self): ...

0 kudos

09-22-2021 1:35:00 PM

by User15787040559 • New Contributor III

06-18-2021 9:42:06 AM

2898 Views
2 replies
0 kudos

How to do a unionAll() when the number and the name of columns are different?

Looking at the API for Dataframe.unionAll() when you have 2 different dataframes with different number of columns and names unionAll() doesn't work.How can you do it?One possible solution is using the following function which performs the union of tw...

Data Engineering

2898 Views
2 replies
0 kudos

06-18-2021 9:42:06 AM

View Replies

Latest Reply

sean_owen
Honored Contributor II

06-18-2021 10:11:30 AM

0 kudos

I'm not sure union is the right tool, if the DataFrames have fundamentally different information in them. If the difference is merely column name, yes, rename. If they don't, then the 'union' contemplated here is really a union of columns as well as ...

0 kudos

06-18-2021 10:11:30 AM

1 More Replies

by User16790091296 • Contributor II

05-28-2021 11:40:57 AM

1664 Views
1 replies
0 kudos

Resolved! How can I use a Python function defined in my git-repo module within the DB notebook?

I have a function within a module in my git-repo. I want to import that to my DB notebook - how can I do that?

Data Engineering

1664 Views
1 replies
0 kudos

05-28-2021 11:40:57 AM

View Replies

Latest Reply

aladda
Honored Contributor II

05-28-2021 5:00:00 AM

0 kudos

Databricks Repos allows you to sync your work in Databricks with a remote Git repository. This makes it easier to implement development best practices. Databricks supports integrations with GitHub, Bitbucket, and GitLab. Using Repos you can bring you...

0 kudos

05-28-2021 5:00:00 AM

by Yogi • New Contributor III

04-17-2019 4:50:09 AM

6952 Views
15 replies
0 kudos

Resolved! Can we pass Databricks output to Azure function body?

Hi, Can anyone help me with Databricks and Azure function. I'm trying to pass databricks json output to azure function body in ADF job, is it possible? If yes, How? If No, what other alternative to do the same?

Data Engineering

6952 Views
15 replies
0 kudos

04-17-2019 4:50:09 AM

View Replies

Latest Reply

AbhishekNarain_
New Contributor III

09-10-2019 9:02:02 PM

0 kudos

You can now pass values back to ADF from a notebook.@@Yogi Though there is a size limit, so if you are passing dataset of larger than 2MB then rather write it on storage, and consume it directly with Azure Functions. You can pass the file path/ refe...

0 kudos

09-10-2019 9:02:02 PM

14 More Replies