Apache Parquet is a popular column
storage file format used by Hadoop systems, such as Pig,
Spark, and Hive. The file format is
language independent and has a binary representation. Parquet is used to
efficiently store large data sets and has the extension .parquet. This
blog post aims to understand ...
[Read more...]