Bring OLAP Back to Big Data!
Apache Kylin™ is an open source, distributed Analytical Data Warehouse for Big Data; it was designed to provide OLAP (Online Analytical Processing) capability in the big data era. By renovating the multi-dimensional cube and precalculation technology on Hadoop and Spark, Kylin is able to achieve near constant query speed regardless of the ever-growing data volume. Reducing query latency from minutes to sub-second, Kylin brings online analytics back to big data.
Apache Kylin™ lets you query billions of rows at sub-second latency in 3 steps.
Apache Kylin™ can also integrate with your favorite BI tools like Tableau and PowerBI etc., to enable BI on Hadoop.
👏👏 Kylin 5 is now released, please visit Kylin 5.0 Home page! Some highlight features are: 👈
Why Apache Kylin?
Timely Decision Making on Big Data
BI on Hadoop Accelerated
ANSI SQL Interface for Big Data on Hadoop
Interactive Queries at High Concurrency
Real-time OLAP for Streaming Big Data
MOLAP Cube Precalculation
- Job Management and Monitoring
- Compression and Encoding Support
- Incremental Refresh of Cubes
- Leverage HBase Coprocessor for query latency
- Both approximate and precise Query Capabilities for Distinct Count
- Approximate Top-N Query Capability
- Easy Web interface to manage, build, monitor and query cubes
- Security capability to set ACL at Project/Table Level
- Support LDAP and SAML Integration
Kylin Ecosystem
Kylin Core:
Fundamental framework of Kylin OLAP Engine comprises of Metadata Engine, Query Engine, Job Engine and Storage Engine to run the entire stack. It also includes a REST Server to service client requests
Extensions:
Plugins to support additional functions and features
Integration:
Lifecycle Management Support to integrate with Job Scheduler, ETL, Monitoring and Alerting Systems
User Interface:
Allows third party users to build customized user-interface atop Kylin core
Drivers:
ODBC and JDBC drivers to support different tools and products, such as Tableau