- 2293 Views
- 0 replies
- 0 kudos
i have an issue when running the below code using the default dbdemos in the advanced preparation , i have reduced the chunk_size and max_batch_size and running the code in a proper compute resources , could anyone help on that please :(spark.readStr...
- 2293 Views
- 0 replies
- 0 kudos
by
hong
• New Contributor II
- 8112 Views
- 4 replies
- 0 kudos
Hello,I have a quick question. If my source code call pysark collect() or any method related to rdd methods, then pytest on my local PC will report the following error. My local machine doesn't have any specific setting for pyspark and I used findspa...
- 8112 Views
- 4 replies
- 0 kudos
Latest Reply
Thank you very much, brockb. Probably I will try it in databricks. Thanks.
3 More Replies
- 6721 Views
- 0 replies
- 0 kudos
I need to use Databricks SQL Statement Execution API w/ Javascript (see example post )For some reason, Curl Works, Python works, but Javascript fails.This works : (curl)______________________________curl --request POST \https://adb-5750xxxxxxx.azured...
- 6721 Views
- 0 replies
- 0 kudos
- 1723 Views
- 2 replies
- 0 kudos
Hi Team,Today (29th May 2024), I began my Databricks assessment exam, but it was abruptly suspended by the proctor without any explanation. This was my first exam and it has been a disappointing experience.I started the exam calmly, but the proctor w...
- 1723 Views
- 2 replies
- 0 kudos
Latest Reply
Thank you, @Cert-Team , for your quick response.I am looking forward to the resolution of my issue.
1 More Replies
- 1276 Views
- 0 replies
- 0 kudos
Hello,I would like to receive some support in creating a Community User Group in Romania, Cluj-Napoca. Our intention as a company is to become a partner and us to create a community: events, meetups so on.Looking for your help.Thank youNicu
- 1276 Views
- 0 replies
- 0 kudos
by
jvk
• New Contributor III
- 3041 Views
- 3 replies
- 1 kudos
I am getting an "INTERNAL_ERROR" on a databricks job submitted through the API. Which says:"Run result unavailable: run failed with error message All access to AWS S3 resource has been disabled"However, when I click on the notebook created by the job...
- 3041 Views
- 3 replies
- 1 kudos
Latest Reply
@Retired_mod In the s3 logs of the run, I am seeing this:24/05/29 06:39:30 WARN FileSystem: Failed to initialize fileystem dbfs:///: java.io.FileNotFoundException: Bucket user-workspace-s3-bucket does not exist24/05/29 06:39:30 ERROR DbfsHadoop3: Fai...
2 More Replies
by
WWoman
• Databricks Partner
- 3474 Views
- 2 replies
- 0 kudos
Hello all, I am running into a permission issue when running a simple MERGE INTO query from a Notebook: " AnalysisException: [INSUFFICIENT_PERMISSIONS] Insufficient privileges: User does not have USE SCHEMA on Schema 'system.query'."I can run the que...
- 3474 Views
- 2 replies
- 0 kudos
Latest Reply
Are you using an all purpose cluster or a SQL warehouse to run this query in the notebook?
1 More Replies
- 5395 Views
- 3 replies
- 0 kudos
Hello Databricks Community,I am currently working on creating a Terraform script to provision clusters in Databricks. However, I've noticed that by default, the clusters created using Terraform have the policy set to "Unrestricted."I would like to co...
- 5395 Views
- 3 replies
- 0 kudos
Latest Reply
The policy id will persist, it is tied to the configuration you have set, even if changing the configuration of a custom policy it will persist with same policy id
2 More Replies
- 3176 Views
- 1 replies
- 2 kudos
Hello fellow community members,In our organization, we have developed, deployed and utilized an API-based MLOps pipeline using Azure DevOps.The CI/CD pipeline has been developed and refined for about 18 months or so, and I have to say that it is pret...
- 3176 Views
- 1 replies
- 2 kudos
Latest Reply
Hello @ManiMar,
In my opinion it's up to you to choose, and you're in the right path by comparing the pros/cons of each approach.
I'd like to highlight that one of the advantages of the Databricks CLI is being able to use Databricks Asset Bundles. I...
by
avrm91
• Databricks Partner
- 1591 Views
- 1 replies
- 0 kudos
Monitor errorAn error occurred while configuring your monitor for this table:Error while creating dashboard for unity-catalog-xxx: com.databricks.api.base.DatabricksServiceException: INTERNAL_ERROR: An internal error occurredPlease delete and recreat...
- 1591 Views
- 1 replies
- 0 kudos
Latest Reply
If you have too many dashboards, there's a chance that the workspace reached the quota.
I recommend you contacting Databricks Support for a more in-depth analysis.
- 4114 Views
- 2 replies
- 0 kudos
Hi,I am fetching data from unity catalog from notebooks using spark.sql(). The query takes just a few seconds - I am actually trying to retrieving 2 rows - but some operations like count() or toPandas() take forever. I wonder why does it take so long...
- 4114 Views
- 2 replies
- 0 kudos
Latest Reply
Hey @jimcast how are you?
You can check the internals and have a good hint of what's happening using the SparkUI. Filter and select the jobs that are taking the longest and check what is being requested on the SQL/Data Frame tab, as well as their pla...
1 More Replies
by
ck_45
• New Contributor II
- 2595 Views
- 2 replies
- 2 kudos
- 2595 Views
- 2 replies
- 2 kudos
Latest Reply
Yes, storage-partitioned joins can be optimized for data skewness. Techniques like adaptive query processing and dynamic repartitioning help distribute the workload evenly across nodes. clipping path service provider By identifying and addressing dat...
1 More Replies
- 3024 Views
- 3 replies
- 0 kudos
Hello Team, I encountered Pathetic experience while attempting my 1st DataBricks certification. Abruptly, Proctor asked me to show my desk, after showing he/she asked multiple times.. wasted my time and then suspended my exam without giving any reaso...
- 3024 Views
- 3 replies
- 0 kudos
Latest Reply
@Kaniz @Cert-Team @Sujitha I have sent multiple emails to the Support team to reschedule my exam with Date, but I have not received any confirmation from them.Please look into this issue and reschedule the exam as soon as possible. This certification...
2 More Replies
- 1166 Views
- 0 replies
- 0 kudos
Hi Team, My Databricks Certified exam got suspended.I was continuously in front of the camera and an alert appeared and then my exam resumed. Then later a support person asked me to show the entire table and entire room, I have showed around the room...
- 1166 Views
- 0 replies
- 0 kudos
by
pshuk
• New Contributor III
- 6723 Views
- 1 replies
- 1 kudos
Hi,I want to run a python code on databricks notebook and return the value to my local machine. Here is the summary:I upload files to volumes on databricks. I generate a md5 for local file. Once the upload is finished, I create a python script with t...
- 6723 Views
- 1 replies
- 1 kudos
Latest Reply
Hello @pshuk,
You could check the below CLI commands:
get-run-output
Get the output for a single run. This is the REST API reference, which relates to the CLI command: https://docs.databricks.com/api/workspace/jobs/getrunoutput
export-run
There's al...