Treeverse is an Israeli startup that has created the lakeFS open-source technology. Treeverse just completed a $15 million Series A funding round led by Dell Technologies Capital, Norwest Venture Partners, and Zeev Ventures. This brings their total funding to date to $23 Million.
Founded in 2020 by Oz Katz and Einat Orr, the Treeverse lakeFS, open-source technology that brings streamlined data lifecycle management and version control to data lakes. With global enterprise users spanning all verticals, lakeFS allows data engineers, DataOps professionals, machine learning developers, and CTOs to run their applications, manage their data, and increase productivity.
“Eventually, moving towards our vision, we want to support data not only in the lake, but also in the warehouse and even in the DB.”
Amazon explains that a data lake is a centralized repository that allows you to store all your structured and unstructured data at any scale. You can store your data as-is, without having to first structure the data, and run different types of analytics—from dashboards and visualizations to big data processing, real-time analytics, and machine learning to guide better decisions.
DataOps is an automated, process-oriented methodology, used by analytic and data teams, to improve the quality and reduce the cycle time of data analytics. DataOps began as a set of best practices. But today it is its own independent approach to data analytics. DataOps applies to the entire data lifecycle, from its preparation to reporting.
“We created lakeFS to solve the pain and frustration we knew firsthand as data engineers ourselves,” commented Treeverse CEO and co-founder Einat Orr. “We’re most proud of how lakeFS elegantly solves the file system challenge for data lakes of exponential size and how the growing community around it is involved in finding new ways to leverage lakeFS to improve their data workflows. We plan to use this round of funding to perfect the open-source core capabilities and build a SaaS offering over it that allows fast creation of predefined workflows that are essential for managing an ever-growing amount of data within every enterprise.”
To mark the company’s first birthday this week, they released a message which told the story of how Treeverse came to be. But the company also talked about what they wish to do in the future.
The company said that its vision is to allow git-like operations over data, no matter where it is stored. “The data pipeline most organizations manage span over several data stores, including an object storage, an analytics DB and maybe a key value store for OLAP operation,” they explained. “We see lakeFS providing consistency and transactionality by treating data spanning several stores as one synchronized repository that can be branched, committed to, and reverted when needed.”