Essentials of Apache Hive
Apache Hive is an open-source data warehouse built on top of Hadoop for analyzing data at scale. It’s easy to use SQL interface has influenced a decade of transformation and with constant re-invention and compelling features. Apache Hive remains a defacto standard to get started in distributed systems. In this session, we explore the internals of Apache Hive to understand various joins & optimizations that can be done to fine-tune operations