cancel
Showing results for 
Search instead for 
Did you mean: 
Discussions
cancel
Showing results for 
Search instead for 
Did you mean: 

Activity in Discussions

Ameshj
by New Contributor II
  • 667 Views
  • 9 replies
  • 0 kudos

Dbfs init script migration

I need help with migrating from dbfs on databricks to workspace. I am new to databricks and am struggling with what is on the links provided.My workspace.yml also has dbfs hard-coded. Included is a full deployment with great expectations.This was don...

Data Engineering
Azure Databricks
dbfs
Great expectations
python
  • 667 Views
  • 9 replies
  • 0 kudos
Latest Reply
NandiniN
Valued Contributor III
  • 0 kudos

#!/bin/bashcurl https://packages.microsoft.com/keys/microsoft.asc | apt-key add -curl https://packages.microsoft.com/config/ubuntu/$(lsb_release -rs)/prod.list > /etc/apt/sources.list.d/mssql-release.listsudo apt-get updatesudo ACCEPT_EULA=Y apt-get ...

  • 0 kudos
8 More Replies
mayank_gupta
by New Contributor
  • 18 Views
  • 0 replies
  • 0 kudos

Not able to use tool type role with DBRX endpoint

Error code: 404 - {'error_code': 'FEATURE_DISABLED', 'message': 'FEATURE_DISABLED: Tool Calling is not enabled for this workspace'}Receiving this error when trying to complete chat using tools. How to enable Features?I am posting for the first time. ...

Generative AI
DBRX
model endpoint
  • 18 Views
  • 0 replies
  • 0 kudos
Agnieszka_1987
by Visitor
  • 28 Views
  • 1 replies
  • 0 kudos

Missing materials for Data Analysis with Databricks SQL ID: E-089Z3V

Hi,In the training Data Analysis with Databricks SQL, ID: E-089Z3V, there is mentioned some workinstruction, there are links where to downlaod sample .csv files.Where I can find this instructions? 

  • 28 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz
Community Manager
  • 0 kudos

Hi @Agnieszka_1987, Thank you for sharing your concern on Community!   To expedite your request, please list your concerns on our ticketing portal. Our support staff would be able to act faster on the resolution (our standard resolution time is 24-48...

  • 0 kudos
orangepepino
by Visitor
  • 40 Views
  • 1 replies
  • 0 kudos

SFTP connection using private key on Azure Databricks

I need to connect to a server to retrieve some files using spark and a private ssh key. However, to manage the private key safely I need to store it as a secret in Azure Key Vault, which means I don't have the key as a file to pass down in the keyFil...

  • 40 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz
Community Manager
  • 0 kudos

Hi @orangepepino,  Instead of specifying the keyFilePath, you can pass the private key as a PEM string directly. This approach avoids the need for a physical key file.Since you’re already using Azure Key Vault, consider storing the private key as a s...

  • 0 kudos
Harsh-dataB
by Visitor
  • 33 Views
  • 1 replies
  • 0 kudos

Cluster termination using python script, sending return code 1

i have used a cluster termination logic for terminating a cluster , the issue is , the cluster is not terminating gracefully , returns a return/exit code 1The cluster is completing all the spark jobs, but it goes on long running state, hence i create...

  • 33 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz
Community Manager
  • 0 kudos

Hi @Harsh-dataB, First, review your cluster termination logic. Make sure it accounts for all necessary cleanup tasks and allows sufficient time for Spark jobs to complete.If you’re using custom scripts or logic, ensure that it gracefully handles a...

  • 0 kudos
georgeyjy
by Visitor
  • 54 Views
  • 2 replies
  • 0 kudos

Resolved! Why saving pyspark df always converting string field to number?

  import pandas as pd from pyspark.sql.types import StringType, IntegerType from pyspark.sql.functions import col save_path = os.path.join(base_path, stg_dir, "testCsvEncoding") d = [{"code": "00034321"}, {"code": "55964445226"}] df = pd.Data...

  • 54 Views
  • 2 replies
  • 0 kudos
Latest Reply
daniel_sahal
Esteemed Contributor
  • 0 kudos

@georgeyjy Try opening the CSV as text editor. I bet that Excel is automatically trying to detect the schema of CSV thus it thinks that it's an integer.

  • 0 kudos
1 More Replies
Olly
by New Contributor
  • 211 Views
  • 2 replies
  • 1 kudos

Resolved! DBR14.3 Shared Access cluster delta.DeltaTable.toDF() issues

Having issues with the pyspark DataFrames returned by delta.DeltaTable.toDF(), in what I believe is specific to shared access clusters on DBR14.3. Recently created a near identical workflow with the only major difference being that one of the source ...

  • 211 Views
  • 2 replies
  • 1 kudos
Latest Reply
Olly
New Contributor
  • 1 kudos

That works, as mentioned it is easy to work around. as does replacing df = spark.table("test")df.select(df.col)

  • 1 kudos
1 More Replies
StephanKnox
by New Contributor
  • 36 Views
  • 1 replies
  • 0 kudos

Unit Testing with PyTest in Databricks - ModuleNotFoundError

Dear all,I am following the guide in this article: https://docs.databricks.com/en/notebooks/testing.htmlhowever I am unable to run pytest due to the following error: ImportError while importing test module '/Workspace/Users/deadmanhide@gmail.com/test...

  • 36 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz
Community Manager
  • 0 kudos

Hi @StephanKnox, Ensure that your directory structure is set up correctly. Based on your description, it should look something like this: Workspace/ ├── run_tests.py ├── test_trans.py └── transform/ ├── operations.py └── __init__.py In both ...

  • 0 kudos
Sambit_S
by New Contributor II
  • 41 Views
  • 1 replies
  • 0 kudos

Databricks Autoloader File Notification Not Working As Expected

Hello Everyone,In my project I am using databricks autoloader to incrementally and efficiently processes new data files as they arrive in cloud storage.I am using file notification mode with event grid and queue service setup in azure storage account...

  • 41 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz
Community Manager
  • 0 kudos

Hi @Sambit_S,  cloudFiles.maxFilesPerTrigger: This option specifies the maximum number of files processed in each micro-batch.By default, it’s set to 10001.When you set it to 50000, you expect it to trigger more files per batch, but you’re observing ...

  • 0 kudos
Devsql
by New Contributor II
  • 46 Views
  • 1 replies
  • 0 kudos

Measure size of all tables in Azure databricks

Hi Team,Currently I am trying to find size of all tables in my Azure databricks, as i am trying to get idea of current data loading trends, so i can plan for data forecast ( i.e. Last 2 months, approx 100 GB data came-in, so in next 2-3 months there ...

  • 46 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz
Community Manager
  • 0 kudos

Hi @Devsql,  For delta tables, you can use Apache Sparkâ„¢ SQL commands.To determine the size of non-delta tables, calculate the total sum of the individual files within the underlying directory. Alternatively, you can use queryExecution.analyzed.stats...

  • 0 kudos
thiagoawstest
by Visitor
  • 59 Views
  • 1 replies
  • 0 kudos

Migration Azure to AWS

Hello, today I use Azure Databricks, I want to migrate my wordspaces to AWS Databricks. What is the best practice, which path should I follow?, I didn't find anything in the documentation.thanks.

  • 59 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz
Community Manager
  • 0 kudos

Hi @thiagoawstest, For detailed guidance, consider reading Databricks’ blog series on deploying Databricks on AWS1. Additionally, explore the official Databricks documentation on migrating data applications to Databricks. Please let me know if you n...

  • 0 kudos
Monsem
by Visitor
  • 55 Views
  • 1 replies
  • 0 kudos

No Course Materials Widget below Lesson

Hello everyone,In my Databricks partner academy account, there is no course material while it should be under the lesson video. How can I resolve this problem? Does anyone else face the same problem? I had submitted a ticket to ask Databricks team bu...

  • 55 Views
  • 1 replies
  • 0 kudos
Latest Reply
Kaniz
Community Manager
  • 0 kudos

Hi, @Monsem. I'm sorry about the issue with your Databricks Partner Academy account. Since you've already submitted a ticket without a response, please follow up on your ticket or provide the ticket number. If anyone else has faced this issue and has...

  • 0 kudos
Devsql
by New Contributor II
  • 332 Views
  • 2 replies
  • 0 kudos

How to find that given Parquet file got imported into Bronze Layer ?

Hi Team,Recently we had created new Databricks project/solution (based on Medallion architecture) having Bronze-Silver-Gold Layer based tables. So we have created Delta-Live-Table based pipeline for Bronze-Layer implementation. Source files are Parqu...

Data Engineering
Azure Databricks
Bronze Job
Delta Live Table
Delta Live Table Pipeline
  • 332 Views
  • 2 replies
  • 0 kudos
Latest Reply
raphaelblg
New Contributor III
  • 0 kudos

Hello @Devsql , It appears that you are creating DLT bronze tables using a standard spark.read operation. This may explain why the DLT table doesn't include "new files" during a REFRESH operation. For incremental ingestion of bronze layer data into y...

  • 0 kudos
1 More Replies
Madhawa
by New Contributor II
  • 51 Views
  • 2 replies
  • 0 kudos

Resolved! Unable to access AWS S3 - Error : java.nio.file.AccessDeniedException

Reading file like this "Data = spark.sql("SELECT * FROM edge.inv.rm") Getting this error org.apache.spark.SparkException: Job aborted due to stage failure: Task 10 in stage 441.0 failed 4 times, most recent failure: Lost task 10.3 in stage 441.0 (TID...

  • 51 Views
  • 2 replies
  • 0 kudos
Latest Reply
Kaniz
Community Manager
  • 0 kudos

Hi @Madhawa,  Ensure that the AWS credentials (access key and secret key) are correctly configured in your Spark application. You can set them using spark.conf.set("spark.hadoop.fs.s3a.access.key", "your_access_key") and spark.conf.set("spark.hadoop....

  • 0 kudos
1 More Replies
Sudheer2
by New Contributor II
  • 464 Views
  • 3 replies
  • 1 kudos

Updating SQL Warehouse using Terraform

Manual ApproachWe can Update SQL Warehouse manually in Databricks.Click SQL Warehouses in the sidebarIn Advanced optionsWe can find Unity Catalog toggle button there! While Updating Existing SQL Warehouse in Azure to enable unity catalog using terraf...

warehouse error.png
  • 464 Views
  • 3 replies
  • 1 kudos
Latest Reply
Sudheer2
New Contributor II
  • 1 kudos

Hello Raphael,Thank you for the update and for looking into the feature request. I appreciate your efforts in following up on this matter.If possible, could you please provide me with any updates or insights you receive from the Terraform team regard...

  • 1 kudos
2 More Replies
Top Kudoed Authors