cancel
Showing results for 
Search instead for 
Did you mean: 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

NT911
by New Contributor II
  • 1088 Views
  • 0 replies
  • 0 kudos

how to reduce file size in sedona o/p

I have shape files with polygon/geometry info. I am exporting the file after Sedona integration with Kepler.I o/p file is in .html. I want to reduce the file size.Pls suggest in case any option is available.

  • 1088 Views
  • 0 replies
  • 0 kudos
data-warriors
by New Contributor
  • 1053 Views
  • 0 replies
  • 0 kudos

workspace deletion at Databricks recovery

Hi Team,I accidentally deleted our databricks workspace, which had all our artefacts and control plane, and was the primary resource for our team's working environment.Could anyone please help on priority, regarding the recovery/ restoration mechanis...

  • 1053 Views
  • 0 replies
  • 0 kudos
Mist3
by New Contributor II
  • 6033 Views
  • 9 replies
  • 4 kudos

Dashboard API - Create a dashboard object doesn't work

I am trying to copy a dashboard object from one workspace to another using API. I am using Get dashboard objects (/api/2.0/preview/sql/dashboards GET method), then Retrieve a definition (/api/2.0/preview/sql/dashboards/{dashboard_id} GET method) and ...

  • 6033 Views
  • 9 replies
  • 4 kudos
Latest Reply
eason_gao_db
Databricks Employee
  • 4 kudos

Hi @markusk, unfortunately the issue you're running into is a limitation of the outgoing SQL dashboard APIs. You can create blank dashboards, but you cannot programmatically insert existing definitions. The good news is we're currently running a prev...

  • 4 kudos
8 More Replies
SJR
by New Contributor III
  • 7748 Views
  • 4 replies
  • 2 kudos

Resolved! Problem when updating Databricks Repo through DevOps Pipeline

Hello all!I've been working on integrating a Databricks Repos update API call to a DevOps Pipeline so that the Databricks local repo stays up to date with the remote staging branch (Pipeline executes whenever there's a new commit in to the staging br...

Data Engineering
CICD
Data_Engineering
DevOps
pipelines
repo
  • 7748 Views
  • 4 replies
  • 2 kudos
Latest Reply
SJR
New Contributor III
  • 2 kudos

@BookerE1 I found it!. There was already another thread related to this problem and someone else helped me find the solution (Problem was the pool that I was using for the pipeline)This is the link to the other thread: https://community.databricks.co...

  • 2 kudos
3 More Replies
costi9992
by New Contributor III
  • 1703 Views
  • 2 replies
  • 0 kudos

Pipeline API documentation issue

 In List Pipelines API documentation, at response is specified that response is  statuses Array of objects The list of events matching the request criteria. next_page_token string If present, a token to fetch the next page of events. But if we retri...

  • 1703 Views
  • 2 replies
  • 0 kudos
Latest Reply
arpit
Databricks Employee
  • 0 kudos

@costi9992 This has been documented now: https://docs.databricks.com/api/workspace/jobs/listruns

  • 0 kudos
1 More Replies
Hal
by New Contributor II
  • 1529 Views
  • 1 replies
  • 3 kudos

Connecting Power BI on Azure to Databricks on AWS?

Can someone share with me the proper way to connect Power BI running on Azure to Databricks running on AWS?

  • 1529 Views
  • 1 replies
  • 3 kudos
Latest Reply
bhanadi
New Contributor II
  • 3 kudos

Have the same question. Do we have to take care of any specific tasks to make it work. Anyone who implemented it?

  • 3 kudos
srjchoubey2
by New Contributor
  • 4434 Views
  • 1 replies
  • 0 kudos

How to import excel files xls/xlsx file into Databricks python notebook?

Method 1: Using "com.crealytics.spark.excel" package, how do I import the package?Method 2: Using pandas I tried the possible paths, but file not found it shows, nor while uploading the xls/xlsx file it shows options for importing the dataframe.Help ...

Data Engineering
excel
import
pyspark
python
  • 4434 Views
  • 1 replies
  • 0 kudos
Latest Reply
vishwanath_1
New Contributor III
  • 0 kudos

import pandas as pd ExcelData = pd.read_excel("/dbfs"+FilePath, sheetName) #  make sure you add /dbfs to FilePath 

  • 0 kudos
AurelioGesino
by New Contributor II
  • 2334 Views
  • 1 replies
  • 0 kudos

Bug in unity catalog registering external database that is case sensitive

I successfully registered in my Unity Catalog an external Database ```dwcore``` that is hosted on SQL server.I first added the connection in "External Data": tested the connection and it was successful.I then added the database on top: tested the con...

  • 2334 Views
  • 1 replies
  • 0 kudos
rbauer
by New Contributor
  • 1188 Views
  • 0 replies
  • 0 kudos

Dask-Databricks init script not working

Hello everybody !  I am trying to use the Dask-Databricks distribution (https://github.com/dask-contrib/dask-databricks?tab=readme-ov-file)i set up the required init-script according to the instructions on the Github page and had no problems there, h...

  • 1188 Views
  • 0 replies
  • 0 kudos
MoJaMa
by Databricks Employee
  • 10317 Views
  • 7 replies
  • 2 kudos
  • 10317 Views
  • 7 replies
  • 2 kudos
Latest Reply
User15848365773
New Contributor II
  • 2 kudos

Hi @amitca71 @atanu .. yes you can associate as many vpcs(workspace deployment fundamental) across regions and aws accounts to one single databricks aws account infact its one of the super powers of databricks platform and you can even track all thei...

  • 2 kudos
6 More Replies
alxsbn
by Contributor
  • 1272 Views
  • 0 replies
  • 0 kudos

SELECT issue after an OPTIMIZE operation

I have a strange issue after an OPTIMIZE, no results are returned anymore.I can time travel over the version easily but passed this data nothing when I'm doing a simple SELECT *.But I still got a result when I'm doing a SELECT count(*).How is this po...

  • 1272 Views
  • 0 replies
  • 0 kudos
Bas1
by New Contributor III
  • 15248 Views
  • 16 replies
  • 20 kudos

Resolved! network security for DBFS storage account

In Azure Databricks the DBFS storage account is open to all networks. Changing that to use a private endpoint or minimizing access to selected networks is not allowed.Is there any way to add network security to this storage account? Alternatively, is...

  • 15248 Views
  • 16 replies
  • 20 kudos
Latest Reply
Odee79
New Contributor II
  • 20 kudos

How can we secure the storage account in the managed resource group which holds the DBFS with restricted network access, since access from all networks is blocked by our Azure storage account policy?

  • 20 kudos
15 More Replies
deltax_07
by New Contributor
  • 1601 Views
  • 0 replies
  • 0 kudos

Parse_Syntax_Error Help

i'm getting this error: Exception in thread "main" org.apache.spark.sql.catalyst.parser.ParseException: [PARSE_SYNTAX_ERROR] Syntax error at or near ','.(line 1, pos 18) == SQL == sum(mp4) AS Videos, sum(csv+xlsx) AS Sheets, sum(docx+txt+pdf) AS Docu...

  • 1601 Views
  • 0 replies
  • 0 kudos

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now
Labels