Databricks has agreed to acquire for over US$100 million Arcion, a Databricks Ventures portfolio company that helps enterprises quickly and reliably replicate data across on-premise, cloud databases and data platforms.
This acquisition will enable Databricks to provide native solutions to ingest data from various databases and SaaS applications into the Databricks Lakehouse Platform.
Data Lakehouse Platforms have emerged as the de facto standard for enterprise data and AI platforms. However, these data platforms are only as valuable as the data in them.
Ingesting data from existing databases and applications remains complicated, fragile, and costly. Troves of important data sit not only in transactional databases such as Oracle, MySQL, and Postgres, but also in SaaS applications such as Salesforce, SAP, and Workday.
This acquisition will enable Databricks to natively provide a scalable, easy-to-use, and cost-effective solution to ingest data from various enterprise data sources.
Building on a scalable change data capture (CDC) engine, Arcion offers connectors for over 20 enterprise databases and data warehouses.
The integration will simplify ingesting such data either continuously or on-demand into the lakehouse, fully integrated with the enterprise security, governance, and compliance capabilities of the Databricks platform.
“To build analytical dashboards, data applications, and AI models, data needs to be replicated from the systems of record like CRM, ERP, and enterprise apps to the Lakehouse,” said Ali Ghodsi, co-Founder and CEO at Databricks.
“Arcion’s highly reliable and easy-to-use solution will enable our customers to make that data available almost instantly for faster and more informed decision-making,” said Ghodsi.
Gary Hagmueller, CEO of Arcion, said Arcion’s real-time, large-scale CDC data pipeline technology extends Databricks’ market-leading ETL solution to include replication of operational data in real-time.