cancel
Showing results forย 
Search instead forย 
Did you mean:ย 
Get Started Discussions
Start your journey with Databricks by joining discussions on getting started guides, tutorials, and introductory topics. Connect with beginners and experts alike to kickstart your Databricks experience.
cancel
Showing results forย 
Search instead forย 
Did you mean:ย 

Using Patient Data Analytics for DIEP Flap Breast Reconstruction Outcomes

Rixcyshah
New Contributor

Hi everyone, Iโ€™ve been exploring how patient data can be used to improve outcomes in DIEP flap breast reconstruction procedures. While reading case studies and resources like txdiepflap.com, it seems thereโ€™s a lot of valuable insight that could benefit from large-scale data analysis. With tools like Apache Spark and Databricks, it may be possible to analyze patient recovery timelines, complication rates, and surgical variables more efficiently. Combining structured healthcare data with predictive modeling could help improve decision-making and personalize treatment approaches. Has anyone here worked with healthcare datasets or similar use cases on Databricks? Iโ€™d also be interested in best practices for handling sensitive patient data securely.

1 REPLY 1

Ashwin_DSA
Databricks Employee
Databricks Employee

Hi @Rixcyshah,

Iโ€™ve worked with a healthcare customer on a screening programme where the goal was to identify people eligible for different cancer screening pathways based on their demographic and clinical information. Happy to share some of that experience.

On the data privacy side, "best practice" can vary quite a bit by organisation and jurisdiction, and is usually driven as much by internal governance as by regulation. In my experience...

  1. For analytics and modelling use cases, customers almost always work with anonymised / deโ€‘identified data wherever possible. Many actively avoid pseudonymised data because there is still a realistic reโ€‘identification risk if someone has access to the key or to another linkable dataset.
  2. For operational or clinical workflows where personal identifiable information is unavoidable, access to demographics, clinical, and other sensitive attributes is typically very tightly controlled with strong audit trails, leastโ€‘privilege, roleโ€‘based and sometimes attributeโ€‘based access controls, and clear segregation between operational and analytical environments.
  3. Data modelling matters a lot too... Using established healthcare data models for operational vs. analytical use cases helps separate identifiers from clinical content and makes it easier to expose the minimum data required for each workload. I've worked with OMOP and FHIR data models. 

Because of this, Iโ€™m not sure there is a single "thumb rule" that fits everyone. Each provider usually has their own governance processes and often goes beyond the minimum legal requirements to stay on the safe side.

If you can share a bit more about what you mean by "best practices" (e.g., deโ€‘identification techniques, platform controls, crossโ€‘border data movement, clinical vs. research use, etc.), Iโ€™m happy to map those requirements to concrete patterns and controls we typically see implemented on Databricks.

If this answer resolves your question, could you mark it as โ€œAccept as Solutionโ€? That helps other users quickly find the correct fix.

Regards,
Ashwin | Delivery Solution Architect @ Databricks
Helping you build and scale the Data Intelligence Platform.
***Opinions are my own***