yesterday - last edited yesterday
Hello,
I have to push data from Airship and persists it to Delta tables. I think We can used SFTP , May someone please help me how to design the inbound part , it using SFTP on Airship end to push file on ADLS Gen2.
networking and security considerations of how this could work.
yesterday
Inbound Flow Design
Enable SFTP on the ADLS Gen2 (or Azure Blob Storage) account;
Generate and register an SSH public/private key pair with Airship, enter your SFTP endpoint credentials (username, host, port, key) in Airship’s settings to authenticate uploads.
Configure Airship to push files (CSV or other supported formats) to the specific SFTP directory in your ADLS Gen2 account, designated for inbound data.
Trigger Azure Data Factory (ADF) or Databricks jobs using storage events (e.g., SFTP commit events, which ensure files are fully uploaded before processing) to ingest these files, transforming them into Delta tables
yesterday
Inbound Flow Design
Enable SFTP on the ADLS Gen2 (or Azure Blob Storage) account;
Generate and register an SSH public/private key pair with Airship, enter your SFTP endpoint credentials (username, host, port, key) in Airship’s settings to authenticate uploads.
Configure Airship to push files (CSV or other supported formats) to the specific SFTP directory in your ADLS Gen2 account, designated for inbound data.
Trigger Azure Data Factory (ADF) or Databricks jobs using storage events (e.g., SFTP commit events, which ensure files are fully uploaded before processing) to ingest these files, transforming them into Delta tables
12 hours ago
@ManojkMohan : what will be the better approach. to have seperate Landing zone and than Bronze layer or only 1 Bronze layer can be treated as landing zone too. ?
11 hours ago
Hi @Datalight ,
If I were you I would add separate landing zone. In our poject landing zone has been extremely valuable. Among other things it lets you separate concerns in terms of extracting data vs loading/processing it. It also allows you to easily reprocess, for all the reasons that can be needed.
8 hours ago
@szymon_dybczak : Thanks a lot.
What do you think, how much effort would be in devops side from bronze to landing first than bronze, if we already deployed the 4 data pipeline use cases.
9 hours ago
Suggested Design
Passionate about hosting events and connecting people? Help us grow a vibrant local community—sign up today to get started!
Sign Up Now