Data silos affect businesses across all industries, not just Manufacturing, where IT/OT convergence is a game-changer. To address this, Databricks has built a Unified Analytics Platform that brings together all types of data - structured, semi-structured, unstructured. This platform enables organizations to break down data barriers and gain a single, unified view of their operations
At Databricks, we're passionate about empowering data teams to unlock insights from all their data, regardless of where it resides. In this blog, we'll tackle one of the most complex and intimidating types of data out there: SAP. Yes, we're talking about the behemoth of business software, known for its intricate systems and daunting data structures.
As former SAP insiders, we've lived through the challenges of extracting data from SAP systems. We've developed a deep understanding of the complexities and nuances of SAP data extraction, gained through extensive hands-on experience and expertise.
That's why we're excited to share our expertise on the recent changes to SAP's data extraction policy and how it impacts third-party extraction tools. And don't worry, we won't make you learn “SAPanese” (read SAP jargon)! Our goal is to break down the technical barriers and make SAP data extraction accessible to everyone.
SAP has recently implemented stricter controls on data extraction from its applications, particularly affecting third-party extraction tools. This shift has been a topic of significant discussion among customers and partners.
SAP Layered Stack Architecture: A High-Level View of Database, Application, and Extraction Components (source: « AWS for SAP » Blog Post)
System |
Connector |
Is Impacted |
Comments/Notes |
Azure |
SAP Table Connector (legacy) |
No |
Does not use ODP so not impacted by the SAP Note. |
Azure |
ECC Connector |
No |
Does not use ODP. |
Azure |
BW Open Hub Connector |
No |
Does not use ODP. |
Azure |
BW MDX Connector |
No |
Does not use ODP. |
Azure |
HANA Copy Activity Connector |
No |
Does not use ODP. |
Azure |
SAP CDC Connector |
Yes |
Uses ODP RFC API, now unpermitted for third-party tools. |
Azure |
ADF OData CDC Connector |
No |
Workaround introduced in March 2024, uses ODP OData API, less performant. |
Fivetran |
HVR (LDP) |
No |
Uses database log replication. For HANA database, log replication is not supported by SAP (Note 2971304). |
Fivetran |
SAP ERP on HANA Connector |
No |
ABAP add-on, does not use ODP. |
Fivetran |
ODP OData Connector |
No |
Uses ODP OData API. Released in preview, is compliant. |
Qlik |
Qlik Replicate SAP HANA |
No |
Uses database log replication. For HANA database, log-based replication is not supported by SAP (Note 2971304) and trigger-based replication requires premium HANA licence for ERP. |
Qlik |
Qlik Replicate ODP Connector |
Yes |
Uses ODP RFC API, for CDS views/BW Objects/SAP Extractors/HANA Views/SLT, now unpermitted for third-party tools. |
Qlik |
Qlik Replicate ODP OData Connector |
No |
Uses ODP OData API. Planned for Q4 2024, will be compliant. |
Impacted connectors are concerned by the SAP ODP Support Note because they leverage the ODP RFC API, which poses a high risk for customers using it since it became unpermitted by SAP. The above list may not be exhaustive regarding all the SAP connectors, especially the ones that are not impacted by the SAP Note. We wanted to include the main SAP connectors for these vendors. For the most up-to-date and comprehensive information, it's advisable to consult directly with specific vendors.
SAP's Datasphere Replication Flow is now the SAP recommended tool for data extraction, but it has limitations, including costs associated with data egress and a cap on concurrent replication threads as specified in the Replication Flow Blog Series Part 4 - Sizing. This limitation has raised concerns for high-volume customers needing to replicate numerous source objects simultaneously.
Regarding an OData-based connection, some customers have reported that the overall performance of OData and the additional setup required on the source side act as significant barriers to adoption.
The existing partnership between SAP Datasphere and Databricks continues to drive joint exploration of integration enhancements based on customer requests. This collaboration aims to enable seamless data convergence, allowing businesses to leverage SAP's structured data alongside Databricks' analytics and AI capabilities, thereby unlocking greater value from their combined data assets.
SAP's recent policy changes regarding data extraction have significant implications for customers and third-party vendors. By understanding the implications of the SAP ODP support note, organizations can assess the risks and opportunities associated with their non-SAP extraction tools and plan accordingly. Organizations relying on third-party extraction tools must adapt to these new restrictions and consider SAP's own solutions for compliance and support. As organizations navigate these changes, Databricks stands ready to facilitate seamless integration with SAP systems and its Unified Analytics Platform by providing best practices through its team of experts.
Ultimately, as both SAP and Databricks continue to innovate, businesses can look forward to more efficient and effective ways to leverage their data across platforms, breaking down silos and unlocking valuable insights for informed decision-making.
Feel free to reach out to your Databricks account team to engage with our SAP team to discuss the complexities of your SAP applications data integration and explore customer use cases for the SAP Lakehouse powered by Databricks.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.