LeanBigData targets at building an ultra-scalable and ultra-efficient integrated big data platform addressing important open issues in big data analytics. Current big data infrastructure scale to large amounts of data and system sizes, however, in a very inefficient way consuming disproportionally high resources per data item processed. Furthermore, the lack of integrated big data management technologies to process streaming events and different workloads over stored data results in the complexity to integrate disparate big data systems and the overhead of copying data across systems. What is more, data analysis cycles to refine queries and identify facts of interest take hours, days, or weeks, whereas business processes demand today shorter cycles. LeanBigData will address these issues by:
-
Delivering ultra-scalable big data management systems: NoSQL key-value data store, a distributed CEP system, and a distributed SQL query engine.
-
Providing an integrated big data platform to avoid the inefficiencies and delays introduced by current ETL-based integration approaches of disparate technologies.
-
Supporting an end-to-end big data analytics solution removing the main sources of delays in data analysis cycles
Learn more about LeanBigData in:
-
the White paper
-
the promotional video: