Data Skipping
Starting from Kylin 5.0, we support the calculation of the dimension value range (maximum and minimum) of all dimensions when building the Segment, so we can prune segment during queries, reducing the scanning range of the segment to optimize some query performance.
Configuration
This optimization is enabled by default. Under normal circumstances, you do not need to pay attention to this optimization. In some extreme cases, system-level or project-level shutdown is supported.
To disable it on the system level, configure the parameters in $KYLIN_HOME/conf/kylin.properties
. To disable it on project level, add the configuration in Setting -> Advanced Settings -> Custom Project Configuration.
kylin.storage.columnar.dimension-range-filter-enabled=false
Known Limitation
-
Currently, only the query filter conditions including
=, in, >, >=, <, <=, and, or
support pruning segment. Filters includingnot, is null
are not supported. -
This optimization will slightly increase the building time, but it is basically negligible compared to the total building time.
-
The historical data that has been built will not use this optimization. If you want the historical data to apply this optimization, you need to refresh the Segment.
-
The multi-level partition models are not supported.