home.social

#apachehadoop — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #apachehadoop, aggregated by home.social.

  1. A little more color on this announcement..
    fosstodon.org/@cwensel/1105490

    First, removed support, so I had to splice the original source into Cascading. But the ParquetScheme didn't honor type information fully. So there is a new TypedParquetScheme that has native support for JSON and Timestamps.

    Second, Parquet requires the FileSystem, which means we get the wonderful S3A implementation. But we also get a 331MB jar dependency with the aws bundle.

  2. Das neue Release der analytischen Datenbank sieht einige Änderungen bei Authentifizierung und Autorisierung vor, darunter die Integration mit Apache Knox.
    Hadoop: Apache Impala 4.0 mit erweitertem Multithreading