- 12084 Views
- 1 replies
- 0 kudos
<ItemMaintenance>
<Batch>
<BathInfo>info</BathInfo>
<Item attr1="tekst" attr2="Tekst2">
<ItemId type="Type" id="id"/>
<Dates>
<Start>2023-11-09</Start>
<End>2024-01-02</End>
</Dates>
<MoreData>
More data
</MoreData>
<...
- 12084 Views
- 1 replies
- 0 kudos
- 11936 Views
- 1 replies
- 0 kudos
Hi everyone, I'm a business analyst currently facing a decision on how best to develop a report. I need to choose between using Databricks or Azure Data Lake Storage (ADLS) as the data source on the transformed data in csv or excel file format from d...
- 11936 Views
- 1 replies
- 0 kudos
Latest Reply
Hey @Learnit I'd be glad to help, thanks for posting your concern. To offer the most effective advice, I might need some additional context about your specific situation as looks like your use case is to create reports (dahboards) using local CSV/exc...
by
Nixon
• New Contributor II
- 1120 Views
- 1 replies
- 0 kudos
Hi there, I have a block of code which can be executed around a month ago within 20 mins. But I came back recently and try to execute it again. It takes over 50 minutes still cannot complete (finally got kick out). Any advise and hints is appreciated...
- 1120 Views
- 1 replies
- 0 kudos
Latest Reply
Haha... Should be "unreasonable long"
- 4196 Views
- 3 replies
- 0 kudos
Recently Databricks has started showing "Storing initialization scripts on DBFS is being deprecated. We recommend using a different storage location" . Is there an alternative of still keep using DBFS or do we need to move to ABSFS ? I could not find...
- 4196 Views
- 3 replies
- 0 kudos
Latest Reply
It is suggested that we can use Workspace but databricks CLI not supporting importing shell scripts, but we can import shell script using web page?
2 More Replies
- 3906 Views
- 2 replies
- 1 kudos
When you have a job in Workflows with multiple tasks running after one another, there seems to be a consistent 7 seconds delay between execution of the tasks. Or, more precisely, every task has an approximate 7 second overhead before the code actuall...
- 3906 Views
- 2 replies
- 1 kudos
Latest Reply
JensH
New Contributor III
Hi @bergmaal , I am experiencing the same issue.My Databricks consultant suggested opening a support ticket as this should not be normal behavior.Did you solve this issue yet?We observed these delays do not seem to occur in workflows that use noteboo...
1 More Replies
- 3949 Views
- 1 replies
- 1 kudos
Trying to do a url_decode on a column, which works great in development, but running via DLT fails when trying multiple ways.1. pyspark.sql.functions.url_decode - This is new as of 3.5.0, but isn't supported using whatever version running a DLT pipel...
- 3949 Views
- 1 replies
- 1 kudos
Latest Reply
Thanks @Retired_mod, for reference if anybody finds this, the DLT release docs are here: https://docs.databricks.com/en/release-notes/delta-live-tables/index.htmlThis shows which versions are running for CURRENT and PREVIEW channels. In this case, wa...
- 14730 Views
- 2 replies
- 0 kudos
Hi Databricks Community,If only possible I would like to use Shared Jobs Cluster on external orchestrator like Azure Data Factory (ADF) or Synapse Workspace.The main reasons for using Shared Job cluster are:reduction of start-up time (<1min vs 5 min ...
- 14730 Views
- 2 replies
- 0 kudos
Latest Reply
Hi Sai Kumar,Many thanks for your response.Unfortunately using analytical clusters is not really an option for for me due to cost differences between job clusters and analytical clusters.Job cluster also offer assurance that the latest deployed versi...
1 More Replies
- 13791 Views
- 3 replies
- 1 kudos
Can anyone give me Steps to achieve this.
- 13791 Views
- 3 replies
- 1 kudos
Latest Reply
Hi, @prasad95 Thank you for sharing your concern here.
In addition to the @Retired_mod comments you can follow below To capture Change Data (CDC) from DynamoDB Streams and write it into a Delta table in Databricks:
1. Connect to DynamoDB Streams and...
2 More Replies
- 7821 Views
- 3 replies
- 2 kudos
- 7821 Views
- 3 replies
- 2 kudos
Latest Reply
We are using azure.I dont see an option for deployment name. Secondly, we have already deployed all our workspaces and wants to have user friendly URLs.Like some changes in DNS server or proxy URLs.
2 More Replies
by
Kaizen
• Valued Contributor
- 3706 Views
- 2 replies
- 0 kudos
Hi, I am saving some files locally on my cluster and moving them after my job. These are log files of my process so I cant directly reference a DBFS location. However the dbutils.fs.cp command does not work on the shared cluster. This does however wo...
- 3706 Views
- 2 replies
- 0 kudos
Latest Reply
For reference when doing this on a single user (personal) cluster - the file is store in:/databricks/driver/logs.txt Which has no issue accessing and copying to dbfs after using the dbutil commands
1 More Replies
- 10563 Views
- 2 replies
- 1 kudos
Hi,I am trying to install the following four dependency packages in order to install "slu-openGIS/postmastr" directly from GitHub:unitssftigristidycensusWhen attempting to install "units", I received the following configuration error: %r
install.pack...
- 10563 Views
- 2 replies
- 1 kudos
- 9380 Views
- 5 replies
- 2 kudos
Hi, I am using debezium server to send data from Postgres to a Kafka topic (in fact, Azure EventHub). My question is, what are the best practices and recommendations to save raw data and then implement a medallion architecture?For clarification, I wa...
- 9380 Views
- 5 replies
- 2 kudos
Latest Reply
Thank you very much @Palash01 ! It has been really helpful!
4 More Replies
- 5489 Views
- 3 replies
- 0 kudos
Flow 'user_silver' has FAILED fatally. An error occurred because we detected an update or delete to one or more rows in the source table. Streaming tables may only use append-only streaming sources. If you expect to delete or update rows to the sourc...
- 5489 Views
- 3 replies
- 0 kudos
Latest Reply
Hey @rt-slowth Just checking in if the provided solution was helpful to you. If yes, please accept this as a Best Solution so that this thread can be considered closed.
2 More Replies
- 15474 Views
- 5 replies
- 0 kudos
I'm envisioning a live streaming pipeline.The bronze, or data ingestion, is being fetched using the directory listing mode of the autoloader.I'm not using File Notification Mode because I detect about 2-300 data changes per hour.I'm thinking about im...
- 15474 Views
- 5 replies
- 0 kudos
Latest Reply
Hey @rt-slowth Thank you for sharing the code snippets. The code structure appears to be on the right track, and its dynamic nature is promising. With a few minor adjustments, it should achieve the desired outcome. Also, find the attached code syntax...
4 More Replies
- 12084 Views
- 2 replies
- 0 kudos
Delta creates more small files during merge and updates operations.
- 12084 Views
- 2 replies
- 0 kudos
Latest Reply
Delta solves the large number of small file problems using the below operations available for a Delta table. Optimize writes helps to optimizes the write operation by adding an additional shuffle step and reducing the number of output files. By defau...
1 More Replies