A new addition to the open source Apache Hadoop ecosystem, Apache Kudu completes Hadoop's storage layer to enable fast analytics on fast data.
Currently, a limited-functionality version of Kudu is available as a Beta.Install the Beta Get the Kudu Quickstart VM
Kudu provides a combination of fast inserts/updates and efficient columnar scans to enable multiple real-time analytic workloads across a single storage layer. As a new complement to HDFS and Apache HBase, Kudu gives architects the flexibility to address a wider variety of use cases without exotic workarounds.
Kudu is specifically designed for use cases that require fast analytics on fast (rapidly changing) data. Engineered to take advantage of next-generation hardware and in-memory processing, Kudu lowers query latency significantly for Apache Impala (incubating) and Apache Spark (initially, with other execution engines to come).
Founded by long-time contributors to the Hadoop ecosystem, the Kudu project is released under the Apache 2 license and values community participation as an important ingredient in its long-term success. We appreciate all community contributions to date, and are looking forward to seeing more!