Every system you run generates a constant stream of signals: traces that show how a request travelled through your service, logs that capture what happened and why, and metrics that measure the overall health. This set of telemetry is among the most valuable operational data your organization produces, yet for most teams it never makes it to the place where they do their best analytical work: their lakehouse.
Today, we're announcing Zerobus Ingest now natively supports the OpenTelemetry Protocol (OTLP), allowing you to bring OpenTelemetry (OTEL) data directly into your lakehouse. Zerobus Ingest OTEL is now available in Beta, which allows you to stream traces, logs, and metrics directly into Unity Catalog Delta tables, using standard OpenTelemetry SDKs and collectors you likely already have in place, with no custom libraries or intermediary pipelines required.
OpenTelemetry is an open source standard for instrumenting applications and collecting telemetry data. It is also vendor and tool agnostic, able to be used with a broad variety of tools. OpenTelemetry defines a common framework and toolkit to facilitate the generation, export, and collection of three types of signals:
OpenTelemetry has become the dominant open source framework for observability. The ecosystem of supported components and integrations is broad, with SDKs for every major language, and a large number of tools, agents, and collectors which already speak the OTLP protocol out-of-the-box.
When you instrument an application with OpenTelemetry, the data typically flows through three layers:
Until now, that backend was almost always a proprietary observability vendor. Zerobus Ingest OTEL is the backend, the endpoint your collectors point to, which will publish data directly in your Delta tables.
Before this integration, getting telemetry data into Databricks required extra steps. The most common patterns were:
All three patterns work, but they share the same underlying problem: operational complexity that has nothing to do with the telemetry itself. You're managing brokers, pipelines, export configurations, and ingestion jobs just to get data into a place where you can finally do something with it.
Zerobus Ingest removes that middle layer entirely. Your collector points directly at the Zerobus Ingest OTEL endpoint, and the data lands in your Delta tables — no broker, no export job, no polling pipeline, no extra monitoring surface.
This might seem like a natural question: observability vendors already provide dashboards, alerting, and querying over this data. So why bring it into the lakehouse?
When telemetry lives in a proprietary observability platform, it lives there on their terms. You're accessing your own data through their interface, subject to their retention policies, their pricing model, and their query language. When that data lands in Unity Catalog, it's in your managed storage. You control it. You can query it with whatever tools you want, keep it as long as you want, and move it if you need to.
Proprietary observability vendors typically store data in high-performance indexed systems optimized for real-time dashboards. That performance comes at a cost. For real-time alerting, that tradeoff makes sense. But for data you want to keep for 6 months or a year — historical analysis, capacity planning, compliance — Delta on object storage is orders of magnitude cheaper. You're paying for bytes in object storage, not seats in a SaaS platform.
Once telemetry is in Delta, it becomes just another table in your lakehouse. That means:
Any OTLP-compatible exporter or collector can send data to Zerobus Ingest. Some common examples:
If your tool speaks OTLP/gRPC, it can send data to Zerobus Ingest.
This feature is in Beta. Here's what you need before you can start sending data:
Point your OTLP exporter at the Zerobus endpoint and include two required headers on every request:
Each signal type is a separate gRPC service path, so you'll configure separate exporters for traces, logs, and metrics — each pointing to its own table.
Note: Currently, only OTLP/gRPC (Protobuf) is supported. HTTP/Protobuf support is on the roadmap.
For complete setup instructions, including Python SDK examples and OpenTelemetry Collector YAML configuration, see the full documentation.
The ability to stream OpenTelemetry data directly into your lakehouse is a small configuration change with a large surface area of impact. Your telemetry is no longer trapped in a proprietary system with a 30-day retention window or locked behind a per-seat bill. It lives in Delta, alongside your business data, queryable by your data teams, trainable by your ML models, and retainable for as long as you need it.
This is now available in Beta, and we're actively developing this feature. If you try it out, we'd love to hear what you build and where you want us to take this project!
Get started: Zerobus Ingest OpenTelemetry documentation
Example: syslog-ng-zerobus
Have questions or feedback? Drop them in the comments below.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.