cancel
Showing results for 
Search instead for 
Did you mean: 
Get Started Discussions
Start your journey with Databricks by joining discussions on getting started guides, tutorials, and introductory topics. Connect with beginners and experts alike to kickstart your Databricks experience.
cancel
Showing results for 
Search instead for 
Did you mean: 

Forum Posts

joseroca99
by New Contributor II
  • 2816 Views
  • 6 replies
  • 0 kudos

Resolved! File found with %fs ls but not with spark.read

Code: wikipediaDF = (spark.read  .option("HEADER", True)  .option("inferSchema", True)  .csv("/databricks-datasets/wikipedia-datasets/data-001/pageviews/raw/pageviews_by_second.tsv"))display(bostonDF) Error: Failed to store the result. Try rerunning ...

  • 2816 Views
  • 6 replies
  • 0 kudos
Latest Reply
xx123
New Contributor III
  • 0 kudos

I have the exact same issue. Seems like limiting the the display() method works as a temporary solution, but I wonder if there's any long term one. The idea would be to have the possibility of displaying larger datasets within a notebook. How to achi...

  • 0 kudos
5 More Replies
j_h_robinson
by New Contributor II
  • 1093 Views
  • 1 replies
  • 1 kudos

Resolved! Spreadsheet-Like UI for Databricks

We are currently entering data into Excel and then uploading it into Databricks.  Is there a built-in spreadsheet-like UI within Databricks that can update data directly in Databricks? 

  • 1093 Views
  • 1 replies
  • 1 kudos
Latest Reply
Advika_
Databricks Employee
  • 1 kudos

Hello, @j_h_robinson! Databricks doesn’t have a built-in spreadsheet-like UI for direct data entry or editing. Are you manually uploading the Excel files or using an ODBC driver setup? If you’re doing it manually, you might find this helpful: Connect...

  • 1 kudos
h_h_ak
by Contributor
  • 6437 Views
  • 5 replies
  • 2 kudos

Resolved! Understanding Autoscaling in Databricks: Under What Conditions Does Spark Add a New Worker Node?

I’m currently working with Databricks autoscaling configurations and trying to better understand how Spark decides when to spin up additional worker nodes. My cluster has a minimum of one worker and can scale up to five. I know that tasks are assigne...

  • 6437 Views
  • 5 replies
  • 2 kudos
Latest Reply
filipniziol
Esteemed Contributor
  • 2 kudos

Hi @h_h_ak ,Short Answer:Autoscaling primarily depends on the number of pending tasks.Workspaces on the Premium plan use optimized autoscaling, while those on the Standard plan use standard autoscaling.Long Answer:Databricks autoscaling responds main...

  • 2 kudos
4 More Replies
Yuki
by Contributor
  • 3341 Views
  • 4 replies
  • 5 kudos

Resolved! Why is the ipynb format recommended?

In this document, https://docs.databricks.com/aws/en/notebooks/notebook-format,Jupyter (.ipynb) format is recommended.> Select File from the workspace menu, select Notebook format, and choose the format you want. You can choose either Jupyter (.ipynb...

  • 3341 Views
  • 4 replies
  • 5 kudos
Latest Reply
Nivethan_Venkat
Contributor III
  • 5 kudos

Hi @Yuki,One other risk that we foresee / encountered recently is how the notebooks will look in your pull requests of external repos (Azure Devops or GitHub). It will be very hard for a pull request reviewer to understand on the code / notebook read...

  • 5 kudos
3 More Replies
Phani1
by Databricks MVP
  • 1762 Views
  • 2 replies
  • 0 kudos

Solace to Azure Data Lake Storage

 Hi Team,What is the most effective method for performing data ingestion from Solace to Azure Data Lake Storage (ADLS) utilizing an Azure Databricks notebook? Any recommendations would be greatly appreciated.Regards,Phani 

  • 1762 Views
  • 2 replies
  • 0 kudos
Latest Reply
SravanThotakura
New Contributor II
  • 0 kudos

Here is the sample script to invoke the connectorval struct_stream = spark.readStream.format("solace").option("host", "").option("vpn", "").option("username", "").option("password", "").option("queue", "").option("connectRetries", 3).option("reconnec...

  • 0 kudos
1 More Replies
EricMa
by New Contributor III
  • 9843 Views
  • 20 replies
  • 4 kudos

Mounting Data IOException

Hello,I am currently taking a course from Coursera for data science using SQL. For one of our assignments we need to mount some data by running a script that has been provided to us by the class. When I run the script I receive the following error. I...

IOException.jpg IOException_Classroom-Setup.jpg
  • 9843 Views
  • 20 replies
  • 4 kudos
Latest Reply
raghdafaris
New Contributor II
  • 4 kudos

Hello all, we came up with a solution: to download the data directly instead of mounting it. The community version is limited, and we don't have access to S3 unless we create our own aws account, load the data there, and then mount our account on dat...

  • 4 kudos
19 More Replies
Shravanshibu
by New Contributor III
  • 4670 Views
  • 6 replies
  • 3 kudos

Unable to install a wheel file which is in my volume to a serverless cluster

I am trying to install a wheel file which is in my volume to a serverless cluster, getting the below error@ken@Retired_mod Note: you may need to restart the kernel using %restart_python or dbutils.library.restartPython() to use updated packages. WARN...

  • 4670 Views
  • 6 replies
  • 3 kudos
Latest Reply
tom-ph
New Contributor II
  • 3 kudos

Same issue here. Any solution?

  • 3 kudos
5 More Replies
Sangamswadik
by New Contributor III
  • 2052 Views
  • 4 replies
  • 1 kudos

Resolved! Agents and Inference table errors

Hi, I'm trying to deploy a rag model from GCP databricks. I've added an external gpt4o endpoint and enabled inference table in settings. But when Im trying to deploy agents I'm still getting the inference table not enabled error. (I've registered the...

Sangamswadik_0-1740977753362.png Sangamswadik_1-1740978040054.png
  • 2052 Views
  • 4 replies
  • 1 kudos
Latest Reply
MariuszK
Valued Contributor III
  • 1 kudos

The Model Serving is supported in your region so it can be another problem or limitation.

  • 1 kudos
3 More Replies
Nik_Vanderhoof
by Contributor
  • 1337 Views
  • 2 replies
  • 0 kudos

Resolved! DatabricksWorkflowTaskGroup

Hello,I recently learned about the DatabricksWorkflowTaskGroup operator for Airflow that allows one to run multiple Notebook tasks on a shared job compute cluster from Airflow.Is a similar feature possible to run multiple non-Notebook tasks from Airf...

Get Started Discussions
Airflow
workflow
  • 1337 Views
  • 2 replies
  • 0 kudos
Latest Reply
Nik_Vanderhoof
Contributor
  • 0 kudos

Thank you!

  • 0 kudos
1 More Replies
anil_reddaboina
by New Contributor II
  • 1149 Views
  • 2 replies
  • 0 kudos

Databricks tasks are not skipping if running tasks using Airflow DatabricksworkflowTaskgroup

Currently we are facing a challenge with below use case:The Airflow DAG has 4 tasks (Task1, Task2, Task3 and Task4) and The dependency is like thisTask 1>> Task2 >> Task3 >> Task4 (All tasks are spark-jar task typesIn Airflow DAG for Task2, there is ...

  • 1149 Views
  • 2 replies
  • 0 kudos
Latest Reply
Alberto_Umana
Databricks Employee
  • 0 kudos

Hi @anil_reddaboina, Databricks allows you to add control flow logic to tasks based on the success, failure, or completion of their dependencies. This can be achieved using the "Run if" dependencies fiel: https://docs.databricks.com/aws/en/jobs/run-i...

  • 0 kudos
1 More Replies
shkelzeen
by New Contributor II
  • 2837 Views
  • 3 replies
  • 1 kudos

Databricks JDBC driver multi query in one request.

Can I run multi query in one command using databricks JDBC driver and would databricks execute one query faster then running multi queries in one script?  

  • 2837 Views
  • 3 replies
  • 1 kudos
Latest Reply
NandiniN
Databricks Employee
  • 1 kudos

Yes, you can run multiple queries in one command using the Databricks JDBC driver.The results will be displayed in separate tables. When you run the multiple queries, they are all still individual queries. Running multiple queries in a script will no...

  • 1 kudos
2 More Replies
Arindam19
by New Contributor II
  • 942 Views
  • 3 replies
  • 0 kudos

Are row filters and column masks supported on foreign catalogs in Azure Databricks Unity Catalog?

In my solution I am planning to bring in an Azure SQL Database to Azure Databricks Unity Catalog as Foreign Catalog. Are table row filters and column masks supported in my scenario ?

  • 942 Views
  • 3 replies
  • 0 kudos
Latest Reply
Alberto_Umana
Databricks Employee
  • 0 kudos

Hi @Arindam19, Yes. Certain operations, including filtering, can be pushed down from Databricks to SQL Server. This is managed by querying the SQL Server directly via a federated connection, allowing SQL Server to handle the filter criteria and retur...

  • 0 kudos
2 More Replies
KaustubhShah
by New Contributor
  • 753 Views
  • 1 replies
  • 0 kudos

GCP Databricks Spark Connector for Cassandra - Error: com.typesafe.config.impl.ConfigImpl.newSimple

Hello,I am using Databricks runtime 12.2 with the spark connector - com.datastax.spark:spark-cassandra-connector_2.12:3.3.0as runtime 12.2 comes with spark 3.3.2 and scala 2.12. I encounter an issue with conneciting to cassandra DB using the below co...

  • 753 Views
  • 1 replies
  • 0 kudos
Latest Reply
cgrant
Databricks Employee
  • 0 kudos

Try using the assembly version of the jar with 12.2.  https://mvnrepository.com/artifact/com.datastax.spark/spark-cassandra-connector-assembly  If this doesn't work, please paste the full, original stacktrace

  • 0 kudos
mrstevegross
by Contributor III
  • 2973 Views
  • 6 replies
  • 0 kudos

Resolved! Is it possible to obtain a job's event log via the REST API?

Currently, to investigate job performance, I can look at a job's information (via the UI) to see the "Event Log" (pictured below):I'd like to obtain this information programmatically, so I can analyze it across jobs. However, the docs for the `get` c...

mrstevegross_0-1736967992555.png
  • 2973 Views
  • 6 replies
  • 0 kudos
Latest Reply
mrstevegross
Contributor III
  • 0 kudos

I also see there is a "list cluster events" API (https://docs.databricks.com/api/workspace/clusters/events); can I get the event log this way?

  • 0 kudos
5 More Replies
crowley
by New Contributor III
  • 4786 Views
  • 2 replies
  • 1 kudos

Resolved! How are Struct type columns stored/accessed (interested in efficiency)?

Hello, I've searched around for awhile and didn't find a similar question here or elsewhere, so thought I'd ask...I'm assessing the storage/access efficiency of Struct type columns in delta tables.  I want to know more about how Databricks is storing...

  • 4786 Views
  • 2 replies
  • 1 kudos
Latest Reply
crowley
New Contributor III
  • 1 kudos

Thank you very much for the thoughful response.  Please excuse my belated feedback and thanks!

  • 1 kudos
1 More Replies

Join Us as a Local Community Builder!

Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!

Sign Up Now
Labels