Apache Parquet v1.8.0 Release Notes

Release Date: 2015-07-12 // almost 9 years ago
  • ๐Ÿ› Bug

    • ๐Ÿ”€ PARQUET-151 - Null Pointer exception in parquet.hadoop.ParquetFileWriter.mergeFooters
    • ๐Ÿ›  PARQUET-152 - Encoding issue with fixed length byte arrays
    • PARQUET-164 - Warn when parquet memory manager kicks in
    • PARQUET-199 - Add a callback when the MemoryManager adjusts row group size
    • PARQUET-201 - Column with OriginalType INT_8 failed at filtering
    • PARQUET-227 - Parquet thrift can write unions that have 0 or more than 1 set value
    • PARQUET-246 - ArrayIndexOutOfBoundsException with Parquet write version v2
    • PARQUET-251 - Binary column statistics error when reuse byte[] among rows
    • ๐Ÿ‘ PARQUET-252 - parquet scrooge support should support nested container type
    • ๐Ÿ‘ป PARQUET-254 - Wrong exception message for unsupported INT96 type
    • โช PARQUET-269 - Restore scrooge-maven-plugin to 3.17.0 or greater
    • ๐Ÿ“‡ PARQUET-284 - Should use ConcurrentHashMap instead of HashMap in ParquetMetadataConverter
    • PARQUET-285 - Implement nested types write rules in parquet-avro
    • PARQUET-287 - Projecting unions in thrift causes TExceptions in deserializatoin
    • PARQUET-296 - Set master branch version back to 1.8.0-SNAPSHOT
    • PARQUET-297 - created_by in file meta data doesn't contain parquet library version
    • PARQUET-314 - Fix broken equals implementation(s)
    • PARQUET-316 - Run.sh is broken in parquet-benchmarks
    • ๐Ÿ“‡ PARQUET-317 - writeMetaDataFile crashes when a relative root Path is used
    • โช PARQUET-320 - Restore semver checks
    • PARQUET-324 - row count incorrect if data file has more than 231 rows
    • PARQUET-325 - Do not target row group sizes if padding is set to 0
    • PARQUET-329 - ThriftReadSupport#THRIFT_COLUMN_FILTER_KEY was removed (incompatible change)

    ๐Ÿ‘Œ Improvement

    • PARQUET-175 - Allow setting of a custom protobuf class when reading parquet file using parquet-protobuf.
    • PARQUET-223 - Add Map and List builiders
    • ๐Ÿ— PARQUET-245 - Travis CI runs tests even if build fails
    • PARQUET-248 - Simplify ParquetWriters's constructors
    • PARQUET-253 - AvroSchemaConverter has confusing Javadoc
    • ๐Ÿ‘ท PARQUET-259 - Support Travis CI in parquet-cpp
    • โšก๏ธ PARQUET-264 - Update README docs for graduation
    • ๐Ÿ‘ PARQUET-266 - Add support for lists of primitives to Pig schema converter
    • โšก๏ธ PARQUET-272 - Updates docs decscription to match data model
    • โšก๏ธ PARQUET-274 - Updates URLs to link against the apache user instead of Parquet on github
    • โšก๏ธ PARQUET-276 - Updates CONTRIBUTING file with new repo info
    • PARQUET-286 - Avro object model should use Utf8
    • ๐Ÿ‘ PARQUET-288 - Add dictionary support to Avro converters
    • ๐Ÿ— PARQUET-289 - Allow object models to extend the ParquetReader builders
    • ๐Ÿ— PARQUET-290 - Add Avro data model to the reader builder
    • PARQUET-306 - Improve alignment between row groups and HDFS blocks
    • PARQUET-308 - Add accessor to ParquetWriter to get current data size
    • ๐Ÿšš PARQUET-309 - Remove unnecessary compile dependency on parquet-generator
    • 0๏ธโƒฃ PARQUET-321 - Set the HDFS padding default to 8MB
    • PARQUET-327 - Show statistics in the dump output

    ๐Ÿ†• New Feature

    • PARQUET-229 - Make an alternate, stricter thrift column projection API
    • ๐Ÿ‘ PARQUET-243 - Add avro-reflect support

    Task

    • ๐Ÿš€ PARQUET-262 - When 1.7.0 is released, restore semver plugin config
    • ๐Ÿš€ PARQUET-292 - Release Parquet 1.8.0