Introduction In our previous blog, we compared Delta 1.2.0, Iceberg 0.13.1 and Hudi 011.1 and we published our findings only to find out that Onehouse saw...
Read MoreWelcome to DataBeans
Simplify your data pipelines through simple reusable components
What We Do
Sparser
As a general computing engine, Spark can process data from various data management/storage systems. For flexibility and high throughput, Spark defines the Data Source API, which is an abstraction of the storage layer. Based on this API we have created a Data Source for ASN.1 that gives you the ability to parse multiple encoded files.
DataBlocks
DataBlocks is a data integration platform for data lakes designed for the modern Enterprise. It provides multiple features designed for current and evolving Data Engineering that go beyond traditional ETL. Our graphic editor for Spark enables visual drag-and-drop and Spark or SQL developers to succeed in the same environment and produce efficient data pipelines.
Our Beans
Sparser Api V1 for parsing and querying ASN.1 encoded data (Ber/Der) with Apache Spark, for Spark SQL and DataFrames.
datablocks is a Big Data integration platform for Data Lake.
Use datablocks to perform big data integration and transformation without writing or maintain external code.
Lorem ipsum dolor sit amet, consectetur adipiscing elit. Nullam faucibus libero id mauris hendrerit dapibus. Vivamus sit amet arcu mi. Ut eget vehicula nisl.
Our Expertise
Recent Posts
Delta Lake: The Data Engineer’s missing piece
Open sourced in April 2019, Delta Lake is a Databricks project that brings reliability, performance and lifecycle management to data...
Read MoreZ-ordering: take the Guesswork out (part1)
“With great power comes great responsibility” -Spiderman Introduction: The world generates 2.5 quintillion bytes per day. That’s 1,000 petabytes!So in line with...
Read More