cancel
Showing results for 
Search instead for 
Did you mean: 
Community Platform Discussions
Connect with fellow community members to discuss general topics related to the Databricks platform, industry trends, and best practices. Share experiences, ask questions, and foster collaboration within the community.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

AlexG
by New Contributor III
  • 2647 Views
  • 5 replies
  • 1 kudos

Query results in csv file include 'null' string for blank cell

After running a sql script, when downloading the results to a csv file, the file includes a null string for blank cells (see screenshot). Is ther a setting I can change to simply get empty cells instead? 

AlexG_1-1702927614092.png
  • 2647 Views
  • 5 replies
  • 1 kudos
Latest Reply
NandiniN
Databricks Employee
  • 1 kudos

I understand, however this is more on CSV file format.  Save your data in Delta format instead of CSV or text-based formats. Delta tables handle empty strings and NULL values more effectively, ensuring that empty strings are preserved during data ins...

  • 1 kudos
4 More Replies
Sudheer2
by New Contributor III
  • 381 Views
  • 0 replies
  • 0 kudos

How to Fetch Azure OpenAI api_version and engine Dynamically After Resource Creation via Python?

Hello,I am using Python to automate the creation of Azure OpenAI resources via the Azure Management API. I am successfully able to create the resource, but I need to dynamically fetch the following details after the resource is created:API Version (a...

  • 381 Views
  • 0 replies
  • 0 kudos
mrstevegross
by New Contributor III
  • 424 Views
  • 4 replies
  • 0 kudos

Resolved! Is it possible to obtain a job's event log via the REST API?

Currently, to investigate job performance, I can look at a job's information (via the UI) to see the "Event Log" (pictured below):I'd like to obtain this information programmatically, so I can analyze it across jobs. However, the docs for the `get` c...

mrstevegross_0-1736967992555.png
  • 424 Views
  • 4 replies
  • 0 kudos
Latest Reply
mrstevegross
New Contributor III
  • 0 kudos

Sure, I want to assess the overall performance of our Spark jobs, particularly the time between "CREATING" and "RUNNING". It's very time-consuming to gather this data manually via the UI; if there is a way to get it programmatically that would be gre...

  • 0 kudos
3 More Replies
Avvar2022
by Contributor
  • 4636 Views
  • 8 replies
  • 3 kudos

Unity catalog enabled workspace -Is there any way to disable workflow/job creation for certain users

Currently in unity catalog enabled workspace users with "Workspace access" can create workflows/jobs, there is no access control available to restrict users from creating jobs/workflows.Use case: In production there is no need for users, data enginee...

  • 4636 Views
  • 8 replies
  • 3 kudos
Latest Reply
Avvar2022
Contributor
  • 3 kudos

@Lakshay Databricks offers a robust platform with a variety of features, including data ingestion, engineering, science, dashboards, and applications. However, I believe that some features, such as workflow/job creation, alerts, dashboards, and Genie...

  • 3 kudos
7 More Replies
SamGreene
by Contributor II
  • 1700 Views
  • 3 replies
  • 0 kudos

String to date conversion errors

Hi,I am getting data from CDC on SQL Server using Informatica which is writing parquet files to ADLS.  I read the parquet files using DLT and end up with the date data as a string such as this'20240603164746563' I couldn't get this to convert using m...

  • 1700 Views
  • 3 replies
  • 0 kudos
Latest Reply
SamGreene
Contributor II
  • 0 kudos

Checking on my current code, this is what I am using, which works for me because we don't use daylight savings time.  from_utc_timestamp(date_time_utc, 'UTC-7') as date_time_local

  • 0 kudos
2 More Replies
GeKo
by New Contributor III
  • 12276 Views
  • 5 replies
  • 0 kudos

Insufficient privileges:User does not have permission SELECT on any file

Hello,after switching to "shared cluster" usage a python job is failing with error message:  Py4JJavaError: An error occurred while calling o877.load. : org.apache.spark.SparkSecurityException: [INSUFFICIENT_PERMISSIONS] Insufficient privileges: User...

Community Platform Discussions
permissions
privileges
python
  • 12276 Views
  • 5 replies
  • 0 kudos
Latest Reply
Uj337
New Contributor III
  • 0 kudos

Hi @GeKo The checkpoint directory, is that set on cluster level or how do we set that ? Can you please help me with this ?

  • 0 kudos
4 More Replies
RobsonNLPT
by Contributor III
  • 1090 Views
  • 1 replies
  • 0 kudos

Databricks UC Data Lineage Official Limitations

Hi all.I have a huge data migration project using medallion architecture,  UC, notebooks and workflows . One of the relevant requirements we have is to capture all data dependencies (upstreams and downstreams) using data lineage. I've followed all re...

  • 1090 Views
  • 1 replies
  • 0 kudos
Latest Reply
MathieuDB
Databricks Employee
  • 0 kudos

Hello @RobsonNLPT , Yes SQL CTE are supported by the data lineage service. You can track table that were created using CTEs. Here is an example that demonstrate the feature. CREATE TABLE IF NOT EXISTS mpelletier.dbdemos.menu ( recipe_id INT, ...

  • 0 kudos
OlehSemeniuk
by New Contributor II
  • 331 Views
  • 3 replies
  • 1 kudos

Resolved! Ingesting and Transforming NetCDF Data in Delta Table on Databricks Cluster

Hi,I need to ingest and transform historical climate data into a Delta table. The data is stored in .nc format (NetCDF). To work with this format, specific C libraries for Python are required, along with particular versions of Python libraries (e.g.,...

  • 331 Views
  • 3 replies
  • 1 kudos
Latest Reply
Walter_C
Databricks Employee
  • 1 kudos

Great, please let us know in case any assistance is needed

  • 1 kudos
2 More Replies
Brianhourigan
by New Contributor II
  • 613 Views
  • 5 replies
  • 0 kudos

Service Principal Access to Users Directory in Databricks - Creating Git Folders

I am trying to automate the creation of git folders in user workspace directories triggered by GitHub feature branch creation. When developers create feature branches in GitHub, we want a service principal to automatically create corresponding git fo...

  • 613 Views
  • 5 replies
  • 0 kudos
Latest Reply
Alberto_Umana
Databricks Employee
  • 0 kudos

Hi @Brianhourigan, Can you please DIM your suggestions? I can add it to our internal AHA idea.

  • 0 kudos
4 More Replies
iptkrisna
by New Contributor III
  • 639 Views
  • 5 replies
  • 0 kudos

Restore deleted databricks jobs and job runs

Hi All,Is there a way to restore deleted databricks jobs?Thank you.

Community Platform Discussions
Databricks
job-runs
Workflows
  • 639 Views
  • 5 replies
  • 0 kudos
Latest Reply
hari-prasad
Valued Contributor II
  • 0 kudos

Hi @iptkrisna ,Currently, there is no option to recover deleted items. In architectures, it not necessary to control or manage the final code available in the system. Instead, the focus should be controlling and managing how code and jobs are deploye...

  • 0 kudos
4 More Replies
mrstevegross
by New Contributor III
  • 476 Views
  • 7 replies
  • 0 kudos

Resolved! Tutorial docs for running a job using serverless?

I'm exploring whether serverless (https://docs.databricks.com/en/jobs/run-serverless-jobs.html#create-a-job-using-serverless-compute) could be useful for our use case. I'd like to see an example of using serverless via the API. The docs say "To learn...

  • 476 Views
  • 7 replies
  • 0 kudos
Latest Reply
mrstevegross
New Contributor III
  • 0 kudos

Thanks!

  • 0 kudos
6 More Replies
mrstevegross
by New Contributor III
  • 560 Views
  • 6 replies
  • 0 kudos

preloaded_docker_images: how do they work?

At my org, when we start a databricks cluster, it oftens takes awhile to become available (due to (1) instance provisioning, (2) library loading, and (3) init script execution). I'm exploring whether an instance pool could be a viable strategy for im...

  • 560 Views
  • 6 replies
  • 0 kudos
Latest Reply
Alberto_Umana
Databricks Employee
  • 0 kudos

Sure, I will inform the team in charge of it to review it.

  • 0 kudos
5 More Replies
aonurdemir
by New Contributor II
  • 283 Views
  • 1 replies
  • 1 kudos

Resolved! Is there a cluster option for dashboards?

Hi everyone,I do not want to use 4 DBU/h XS warehouse since I have very tiny data on the new startup. I want to create a minimal cluster and run it as the underlying SQL engine for my dashboard.Thanks.

  • 283 Views
  • 1 replies
  • 1 kudos
Latest Reply
Walter_C
Databricks Employee
  • 1 kudos

Unfortunately no, as dashboards are part of the SQL service on the platform they are designed to work with SQL warehouses only, you can create Notebook dashboards that will be able to work with regular clusters but functionalities will be limited in ...

  • 1 kudos
h2p5cq8
by New Contributor III
  • 457 Views
  • 5 replies
  • 1 kudos

Resolved! Databricks workflow with sequenced tasks

I have a continuous workflow. It is continuous because I would like it to run every minute and if it has stuff to do the first task will take several minutes. As I understand, continuous workflows won't requeue while a job is currently running, where...

  • 457 Views
  • 5 replies
  • 1 kudos
Latest Reply
Alberto_Umana
Databricks Employee
  • 1 kudos

Hi @h2p5cq8, No problem! and you can have the queue option disabled to stop it. Go to the Advanced settings in the Job details side panel and toggle off the Queue option to prevent jobs from being queued

  • 1 kudos
4 More Replies

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you won’t want to miss the chance to attend and share knowledge.

If there isn’t a group near you, start one and help create a community that brings people together.

Request a New Group
Top Kudoed Authors