cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Data Engineering
Join discussions on data engineering best practices, architectures, and optimization strategies within the Databricks Community. Exchange insights and solutions with fellow data engineers.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

How to access final delta table for web application or interface.

User16826994223
Honored Contributor III

I have a final layer of the gold delta table, that has final aggregated data from silver data . I want to access this final layer of data through the WEB interface

I think I need to write a web script that would run the spark SQL behind to get the data. and then i Can write the result set data in soem table like mango db and then show in web ui

Is there a known best practice solution?

5 REPLIES 5

User16826994223
Honored Contributor III

The real answer depends on your requirements regarding latency, the number of data located - HDFS/S3/...), etc. Possible approaches are:

  1. Read data directly using the Delta Standalone Reader library for JVM, or via delta-rs library that works with Rust/Python/Ruby

Altay
New Contributor II

Is there any update on this topic?

Thank you

stefanhieslas11
New Contributor II

Hey everyone ๐Ÿ™‚

I totally get the frustration of dealing with these complex data layers, but don't worry, you're in the right place for some guidance! Accessing that final delta table through a web interface can indeed be a bit tricky, but it's not an unsolvable puzzle.

Your plan to use a web script running Spark SQL sounds pretty solid. This way, you can leverage the power of Spark to handle the heavy lifting and get the aggregated data. Storing the results in a database like MongoDB and then displaying it in the web UI is a sensible approach too, as it can help with faster retrieval and a smoother user experience.

However, if you're looking for a best practice solution, you might want to consider a microservices architecture. This could involve building a dedicated service that handles the interaction between your Spark cluster and the web UI. By decoupling these components, you could achieve better scalability and maintainability.

I am really glad that you are asking such relevant and important questions on this forum. Mobile development is still gaining popularity right now, and I would recommend you check out this resource to better understand your problem: Mobile App Development Industry: Explore Top Challenges in 2023. Let me know if this has helped you in any way.

User16826994223, about your question, kudos for your initiative! If you're going the Spark SQL route, remember to optimize your queries for performance, and perhaps consider caching if applicable. Also, explore visualization libraries to make your web interface more user-friendly.

shadowinc
New Contributor III

@stefanhieslas11 Thanks for your input. However, are there other approaches to avoid staging databases like Cosmos or Mongo DB as input to web applications and directly fetch from delta tables while maintaining the read performance? Much appreciated.

h_h_ak
Contributor

You can also use direct statement execution from databricks: https://docs.databricks.com/api/workspace/statementexecution

Connect with Databricks Users in Your Area

Join a Regional User Group to connect with local Databricks users. Events will be happening in your city, and you wonโ€™t want to miss the chance to attend and share knowledge.

If there isnโ€™t a group near you, start one and help create a community that brings people together.

Request a New Group