Apache Parquet v1.8.0 Release Notes
Release Date: 2015-07-12 // almost 9 years ago-
๐ Bug
- ๐ PARQUET-151 - Null Pointer exception in parquet.hadoop.ParquetFileWriter.mergeFooters
- ๐ PARQUET-152 - Encoding issue with fixed length byte arrays
- PARQUET-164 - Warn when parquet memory manager kicks in
- PARQUET-199 - Add a callback when the MemoryManager adjusts row group size
- PARQUET-201 - Column with OriginalType INT_8 failed at filtering
- PARQUET-227 - Parquet thrift can write unions that have 0 or more than 1 set value
- PARQUET-246 - ArrayIndexOutOfBoundsException with Parquet write version v2
- PARQUET-251 - Binary column statistics error when reuse byte[] among rows
- ๐ PARQUET-252 - parquet scrooge support should support nested container type
- ๐ป PARQUET-254 - Wrong exception message for unsupported INT96 type
- โช PARQUET-269 - Restore scrooge-maven-plugin to 3.17.0 or greater
- ๐ PARQUET-284 - Should use ConcurrentHashMap instead of HashMap in ParquetMetadataConverter
- PARQUET-285 - Implement nested types write rules in parquet-avro
- PARQUET-287 - Projecting unions in thrift causes TExceptions in deserializatoin
- PARQUET-296 - Set master branch version back to 1.8.0-SNAPSHOT
- PARQUET-297 - created_by in file meta data doesn't contain parquet library version
- PARQUET-314 - Fix broken equals implementation(s)
- PARQUET-316 - Run.sh is broken in parquet-benchmarks
- ๐ PARQUET-317 - writeMetaDataFile crashes when a relative root Path is used
- โช PARQUET-320 - Restore semver checks
- PARQUET-324 - row count incorrect if data file has more than 231 rows
- PARQUET-325 - Do not target row group sizes if padding is set to 0
- PARQUET-329 - ThriftReadSupport#THRIFT_COLUMN_FILTER_KEY was removed (incompatible change)
๐ Improvement
- PARQUET-175 - Allow setting of a custom protobuf class when reading parquet file using parquet-protobuf.
- PARQUET-223 - Add Map and List builiders
- ๐ PARQUET-245 - Travis CI runs tests even if build fails
- PARQUET-248 - Simplify ParquetWriters's constructors
- PARQUET-253 - AvroSchemaConverter has confusing Javadoc
- ๐ท PARQUET-259 - Support Travis CI in parquet-cpp
- โก๏ธ PARQUET-264 - Update README docs for graduation
- ๐ PARQUET-266 - Add support for lists of primitives to Pig schema converter
- โก๏ธ PARQUET-272 - Updates docs decscription to match data model
- โก๏ธ PARQUET-274 - Updates URLs to link against the apache user instead of Parquet on github
- โก๏ธ PARQUET-276 - Updates CONTRIBUTING file with new repo info
- PARQUET-286 - Avro object model should use Utf8
- ๐ PARQUET-288 - Add dictionary support to Avro converters
- ๐ PARQUET-289 - Allow object models to extend the ParquetReader builders
- ๐ PARQUET-290 - Add Avro data model to the reader builder
- PARQUET-306 - Improve alignment between row groups and HDFS blocks
- PARQUET-308 - Add accessor to ParquetWriter to get current data size
- ๐ PARQUET-309 - Remove unnecessary compile dependency on parquet-generator
- 0๏ธโฃ PARQUET-321 - Set the HDFS padding default to 8MB
- PARQUET-327 - Show statistics in the dump output
๐ New Feature
- PARQUET-229 - Make an alternate, stricter thrift column projection API
- ๐ PARQUET-243 - Add avro-reflect support
Task
- ๐ PARQUET-262 - When 1.7.0 is released, restore semver plugin config
- ๐ PARQUET-292 - Release Parquet 1.8.0