Apache Kylin™ Overview

Apache Kylin™ is an open source Distributed Analytics Engine designed to provide SQL interface and multi-dimensional analysis (OLAP) on Hadoop supporting extremely large datasets, original contributed from eBay Inc.

Apache Kylin™ lets you query big Hive tables at sub-second latency in 3 simple steps.

  1. Identify a set of Hive tables in star schema.
  2. Build a cube from the Hive tables in an offline batch process.
  3. Query the Hive tables using SQL and get results in sub-seconds, via Rest API, ODBC, or JDBC.

What is Kylin?

- Extremely Fast OLAP Engine at Scale:

Kylin is designed to reduce query latency on Hadoop for 10+ billions of rows of data

- ANSI SQL Interface on Hadoop:

Kylin offers ANSI SQL on Hadoop and supports most ANSI SQL query functions

- Interactive Query Capability:

Users can interact with Hadoop data via Kylin at sub-second latency, better than Hive queries for the same dataset

- MOLAP Cube:

User can define a data model and pre-build in Kylin with more than 10+ billions of raw data records

- Seamless Integration with BI Tools:

Kylin currently offers integration capability with BI Tools like Tableau, PowerBI and Excel. Integration with Microstrategy is coming soon

- Other Highlights:

- Job Management and Monitoring
- Compression and Encoding Support
- Incremental Refresh of Cubes
- Leverage HBase Coprocessor for query latency
- Approximate Query Capability for distinct Count (HyperLogLog)
- Easy Web interface to manage, build, monitor and query cubes
- Security capability to set ACL at Cube/Project Level
- Support LDAP Integration

Kylin Ecosystem

Kylin Core: Fundamental framework of Kylin OLAP Engine comprises of Metadata Engine, Query Engine, Job Engine and Storage Engine to run the entire stack. It also includes a REST Server to service client requests

Extensions: Plugins to support additional functions and features

Integration: Lifecycle Management Support to integrate with Job Scheduler, ETL, Monitoring and Alerting Systems

User Interface: Allows third party users to build customized user-interface atop Kylin core

Drivers: ODBC and JDBC drivers to support different tools and products, such as Tableau