Data Engineering

Forum Posts

Sorted by:

by HGH_Vallarta • New Contributor

01-27-2022 1:05:46 AM

864 Views
0 replies
0 kudos

Not having a standard height becomes quite a challenge for many growing adolescents, which is said to be a condition of idiopathic short stature. So, ...

Not having a standard height becomes quite a challenge for many growing adolescents, which is said to be a condition of idiopathic short stature. So, while we can think it will be “ok” with time, we can’t ignore the elephant in the room. To diagnose ...

Data Engineering

864 Views
0 replies
0 kudos

01-27-2022 1:05:46 AM

by Cordis_Technolo • New Contributor

01-26-2022 11:54:25 PM

1556 Views
0 replies
0 kudos

Inventory Management System Handling the inventory and keeping track of everything is the most difficult, yet important aspect of leading a business...

Inventory Management System Handling the inventory and keeping track of everything is the most difficult, yet important aspect of leading a business. Inventory never stays the same. Due to sales, return, purchases, etc your business’ inventory is alw...

Data Engineering

1556 Views
0 replies
0 kudos

01-26-2022 11:54:25 PM

by venkyv • New Contributor II

01-26-2022 1:51:22 PM

2821 Views
1 replies
3 kudos

Resolved! Can I use Databricks to join data from S3 and Postgres using SQL?

Hello, I'm very much new to Databricks and I'm finding it hard if it's right solution for our needs.Requirement:We have multiple data sources spread across AWS S3 and Postgres. We need a common SQL endpoint that can be used to write queries to join d...

Data Engineering

2821 Views
1 replies
3 kudos

01-26-2022 1:51:22 PM

View Replies

Latest Reply

Hubert-Dudek
Esteemed Contributor III

01-26-2022 2:46:48 PM

3 kudos

Yes you can. You can ETL to data lake storage register your tables to metastore and register your SELECT with JOINS as VIEW or even better create additionally jobs and store your JOINED table. From BI you can connect to databricks sql or to data lake...

3 kudos

01-26-2022 2:46:48 PM

by bluetail • Contributor

01-21-2022 6:48:29 AM

4316 Views
4 replies
2 kudos

Resolved! Value Labels fail to display in Databricks notebook but they are displayed ok in Jupyter

import matplotlib.pyplot as pltimport seaborn as snsimport pandas as pdimport numpy as npprob = np.random.rand(7) + 0.1prob /= prob.sum()df = pd.DataFrame({'department': np.random.choice(['helium', 'neon', 'argon', 'krypton', 'xenon', 'radon', 'ogane...

Data Engineering

4316 Views
4 replies
2 kudos

01-21-2022 6:48:29 AM

View Replies

Latest Reply

Anonymous
Not applicable

01-26-2022 8:53:48 AM

2 kudos

@Maria Bruevich - Do either of these answers help? If yes, would you be happy to mark one as best so that other members can find the solution more quickly?

2 kudos

01-26-2022 8:53:48 AM

3 More Replies

by guruv • New Contributor III

01-15-2022 1:18:38 AM

7219 Views
4 replies
1 kudos

Resolved! Saprk UI not showing any running tasks

HI,I am running a Notebook job calling a JAR code (application code implmented in C#). in the Spark UI page for almost 2 hrs, it'w not showing any tasks and even the CPU usage is below 20%, memory usage is very small. Before this 2 hr window it shows...

Data Engineering

7219 Views
4 replies
1 kudos

01-15-2022 1:18:38 AM

View Replies

Latest Reply

Atanu
Databricks Employee

01-25-2022 9:03:42 PM

1 kudos

If I understood the issue correctly .

1 kudos

01-25-2022 9:03:42 PM

3 More Replies

by thomasthomas • New Contributor II

01-14-2022 6:36:36 AM

3221 Views
4 replies
0 kudos

Resolved! Customer deployment

Hi,I have a bunch of scripts in Databricks that perform a decent amount of data-wrangling. All of these scripts contain sensitive information and I have no intention of making them public.I would like to provide a service to my customers - so they ca...

Data Engineering

3221 Views
4 replies
0 kudos

01-14-2022 6:36:36 AM

View Replies

Latest Reply

Atanu
Databricks Employee

01-25-2022 7:36:25 PM

0 kudos

@Tamas D I understood your concern. For cluster creation in different subscription I think that's by design at this moment. But I would like to request you to add your use case to https://feedback.azure.com/d365community/forum/2efba7dc-ef24-ec11-b6...

0 kudos

01-25-2022 7:36:25 PM

3 More Replies

by Mateo • New Contributor II

01-24-2022 10:56:10 PM

1789 Views
2 replies
0 kudos

Hi all, I'm having some trouble with my Certification Transcript in the Academy Portal. I've passed "Databricks Certified Associate Devel...

Hi all,I'm having some trouble with my Certification Transcript in the Academy Portal. I've passed "Databricks Certified Associate Developer for Apache Spark 3.0" last year and everything seemed fine (apart from the fact that I've been issued two sep...

Data Engineering

1789 Views
2 replies
0 kudos

01-24-2022 10:56:10 PM

View Replies

Latest Reply

Mateo
New Contributor II

01-25-2022 9:13:26 AM

0 kudos

Hey @Piper Wilson ! Thank you for your response. Unfortunately, I already created a support ticket through the address provided in this post you mentioned. And I got a 'case closed' e-mail after over two weeks with no response and no fix (certificat...

0 kudos

01-25-2022 9:13:26 AM

1 More Replies

by MattM • New Contributor III

01-23-2022 6:51:30 AM

4562 Views
3 replies
2 kudos

Resolved! Pricing Spot Instance vs New Job Cluster

We are running multiple Databricks job via ADF. I was wondering which option out of the below is a cheaper route for databricks notebook processing from ADF. When I create a ADF linked service, which should I use to lower my cost.New Job Cluster opti...

Data Engineering

4562 Views
3 replies
2 kudos

01-23-2022 6:51:30 AM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

01-25-2022 6:19:25 AM

2 kudos

the instance pool will be cheaper if you use spot instances. But only if you size your instance pool correctly. (number of workers and scale down time)AFAIK you cannot use spot instances for job clusters in ADF

2 kudos

01-25-2022 6:19:25 AM

2 More Replies

by swzzzsw • New Contributor III

01-24-2022 11:34:29 AM

7489 Views
5 replies
2 kudos

Resolved! Pass variable values from one task to another

I created a Databricks job with multiple tasks. Is there a way to pass variable values from one task to another. For example, if I have tasks A and B as Databricks notebooks. Can I create a variable (e.g. x) in notebook A and later use that value in ...

Data Engineering

7489 Views
5 replies
2 kudos

01-24-2022 11:34:29 AM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

01-25-2022 7:26:43 AM

2 kudos

you could also consider using an orchestration tool like Data Factory (Azure) or Glue (AWS). there you can inject and use parameters from notebooks.The job scheduling of databricks also has the possibility to add parameters, but I do not know if yo...

2 kudos

01-25-2022 7:26:43 AM

4 More Replies

by MiguelKulisic • New Contributor II

01-21-2022 1:52:10 PM

9387 Views
2 replies
4 kudos

Resolved! ProtocolChangedException on concurrent blind appends to delta table

Hello, I am developing an application that runs multiple processes that write their results to a common delta table as blind appends. According to the docs I've read online: https://docs.databricks.com/delta/concurrency-control.html#protocolchangedex...

Data Engineering

9387 Views
2 replies
4 kudos

01-21-2022 1:52:10 PM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

01-25-2022 6:36:33 AM

4 kudos

I think you are right, the mergeSchema will change the schema of the table, but if you both write to that same table with another schema, which one will it be?Can you check if both of you actually write the same schema, or remove the mergeschema?

4 kudos

01-25-2022 6:36:33 AM

1 More Replies

by study_community • New Contributor III

01-22-2022 11:51:16 PM

3932 Views
2 replies
3 kudos

Resolved! Error creating delta table over an existing delta schema

I created a delta table through a cluster over a dbfs location .Schema :create external table tmp_db.delta_data(delta_id int ,delta_name varchar(20) , delta_variation decimal(10,4) ,delta_incoming_timestamp timestamp,delta_date date generated always ...

Data Engineering

3932 Views
2 replies
3 kudos

01-22-2022 11:51:16 PM

View Replies

Latest Reply

-werners-
Esteemed Contributor III

01-25-2022 6:28:51 AM

3 kudos

varchartype is only available as from spark 3.1 I think.https://spark.apache.org/docs/latest/sql-ref-datatypes.htmlThe link is for spark 3.2, and 3.1 also has varchartype. So can you check your spark version?Also if the table definition still exists...

3 kudos

01-25-2022 6:28:51 AM

1 More Replies

by Soma • Valued Contributor

01-11-2022 6:30:19 AM

3744 Views
4 replies
0 kudos

Queries regarding workspace Migration to Premium

We are planning to migrate from standard to premium workspaceWe need to know if below artifacts will be maintainedneed to check on streaming Job DowntimeAccess token DBFS Access Production Cluster /JobsCluster ID Job ID and other properties like URL ...

Data Engineering

3744 Views
4 replies
0 kudos

01-11-2022 6:30:19 AM

View Replies

Latest Reply

Soma
Valued Contributor

01-25-2022 4:36:57 AM

0 kudos

hi @Kaniz Fatma then I can assume there wont be any impact on metastore and all the metadata(table definition,schema ) will be available post upgradation

0 kudos

01-25-2022 4:36:57 AM

3 More Replies

by aksharamaham • New Contributor

01-24-2022 1:50:53 AM

2173 Views
1 replies
0 kudos

Delta Live Table - How to get details of which records were excluded in Quality Checks?

I've been experimenting with DLT and it works well. I'd like to understand where can I see details of which records didn't meet the quality critera?

Data Engineering

2173 Views
1 replies
0 kudos

01-24-2022 1:50:53 AM

View Replies

Latest Reply

Anonymous
Not applicable

01-24-2022 7:45:29 AM

0 kudos

Hello, @Paresh J! Welcome and thank you for asking! My name is Piper, and I'm a moderator for Databricks.Let's give the community some time to help before we circle back to you. Thanks in advance for your patience.

0 kudos

01-24-2022 7:45:29 AM

by Situs_UG300_Off • New Contributor

01-24-2022 4:22:57 AM

670 Views
0 replies
0 kudos

res.cloudinary.com

Link UG300 ada menyediakan depo tipe e- wallet yang dapat dipakai unyuk dapat melaksanakan pembelian ataupun top up saldo ke e- wallet tujuan yang telah ada di dalam web. Adanya berita gembira buat kalian yang tidak mempunyai rekening bank, Jika kali...

Data Engineering

670 Views
0 replies
0 kudos

01-24-2022 4:22:57 AM

by Ravi1979 • New Contributor

01-21-2022 1:08:02 AM

3394 Views
1 replies
0 kudos

Can you share some examples of seldon core batch prediction with mlflow server as backend and airflow as workflow engine?

Data Engineering

3394 Views
1 replies
0 kudos

01-21-2022 1:08:02 AM

View Replies

Latest Reply

Anonymous
Not applicable

01-21-2022 12:16:08 PM

0 kudos

Hello, @Ravi Param - My name is Piper, and I'm one of the moderators here. Thank you for your question! Let's give the community a chance to respond and then we'll circle back if necessary.

0 kudos

01-21-2022 12:16:08 PM

Databricks Community

Forum Posts

Not having a standard height becomes quite a challenge for many growing adolescents, which is said to be a condition of idiopathic short stature. So, ...

Inventory Management System Handling the inventory and keeping track of everything is the most difficult, yet important aspect of leading a business...

Resolved! Can I use Databricks to join data from S3 and Postgres using SQL?

Resolved! Value Labels fail to display in Databricks notebook but they are displayed ok in Jupyter

Resolved! Saprk UI not showing any running tasks

Resolved! Customer deployment

Hi all, I'm having some trouble with my Certification Transcript in the Academy Portal. I've passed "Databricks Certified Associate Devel...

Resolved! Pricing Spot Instance vs New Job Cluster

Resolved! Pass variable values from one task to another

Resolved! ProtocolChangedException on concurrent blind appends to delta table

Resolved! Error creating delta table over an existing delta schema

Queries regarding workspace Migration to Premium

Delta Live Table - How to get details of which records were excluded in Quality Checks?

res.cloudinary.com

Can you share some examples of seldon core batch prediction with mlflow server as backend and airflow as workflow engine?

Join Us as a Local Community Builder!

Issue with Lakebridge transpile installation – SSL...

Spark JDBC Netsuite error - SQLSyntaxErrorExcepti...

Syncing lakebase table to delta table

Online Table Migration

How can I execute a Spark SQL query inside a Unity...