Apache Kylin 2.3.2 release, Open source distributed analytics engine

Software

Apache Kylin™ is an open source Distributed Analytics Engine designed to provide SQL interface and multi-dimensional analysis (OLAP) on Hadoop supporting extremely large datasets, originally contributed from eBay Inc.

Apache Kylin™ lets you query massive dataset at sub-second latency in 3 steps.

  1. Identify a Star Schema on Hadoop.
  2. Build Cube from the identified tables.
  3. Query with ANSI-SQL and get results in sub-second, via ODBC, JDBC or RESTful API.
Apache Kylin - Extreme OLAP Engine for Big Data
Apache Kylin

WHAT IS KYLIN?

– Extremely Fast OLAP Engine at Scale: 

Kylin is designed to reduce query latency on Hadoop for 10+ billions of rows of data

– ANSI SQL Interface on Hadoop: 

Kylin offers ANSI SQL on Hadoop and supports most ANSI SQL query functions

– Interactive Query Capability: 

Users can interact with Hadoop data via Kylin at sub-second latency, better than Hive queries for the same dataset

– MOLAP Cube:

User can define a data model and pre-build in Kylin with more than 10+ billions of raw data records

– Seamless Integration with BI Tools:

Kylin currently offers integration capability with BI Tools like Tableau, PowerBI, and Excel. Integration with Microstrategy is coming soon

– Other Highlights: 

– Job Management and Monitoring
– Compression and Encoding Support
– Incremental Refresh of Cubes
– Leverage HBase Coprocessor for query latency
– Both approximate and precise Query Capabilities for Distinct Count
– Approximate Top-N Query Capability
– Easy Web interface to manage, build, monitor and query cubes
– Security capability to set ACL at Cube/Project Level
– Support LDAP and SAML Integration

Apache Kylin v2.3.2 was released. 

Improvement

  • [KYLIN-3345] – Use Apache Parent POM 19
  • [KYLIN-3372] – Upgrade jackson-databind version due to security concerns
  • [KYLIN-3415] – Remove “external” module

Bug

  • [KYLIN-3115] – Incompatible RowKeySplitter initialize between build and merge job
  • [KYLIN-3336] – java.lang.NoSuchMethodException: org.apache.kylin.tool.HBaseUsageExtractor.execute([Ljava.lang.String;)
  • [KYLIN-3348] – “missing LastBuildJobID” error when building new cube segment
  • [KYLIN-3352] – Segment pruning bug, e.g. date_col > “max_date+1”
  • [KYLIN-3363] – Wrong partition condition appended in JDBC Source
  • [KYLIN-3388] – Data may become not correct if mappers fail during the redistribute step, “distribute by rand()”
  • [KYLIN-3400] – WipeCache and createCubeDesc causes deadlock
  • [KYLIN-3401] – The current using zip compress tool has an arbitrary file write vulnerability
  • [KYLIN-3404] – Last optimized time detail was not showing after cube optimization

Download

Reference: kylin.apache.org

Leave a Reply

Your email address will not be published. Required fields are marked *