With Hive and Stinger we are focused on enabling the SQL ecosystem and to do that we’ve put Hive on a clear roadmap to SQL compliance.
That includes adding critical datatypes like character and date types as well as implementing common SQL semantics seen in most databases.
checkcast
Tpch query 1 and query 6.
Before:
1Tb of tpc-hdata compreses to 200Gb of ORC data.
30Tb of tpc-ds data compresses to approx ~6Tb of ORC data.