home.social
  1. so my cloud runs seem to initialize the SharedArray object multiple times and is passing different instances to the remote processes. Inits once locally, and historically I don't remember this being an issue in the cloud.

    grafana.com/docs/k6/latest/jav

    I have support and slack questions open, but I find it odd if i'm the only person experiencing this.

  2. For those a little familiar with Cascading, in , it was originally designed to run on , and then , but it also has a local planner.

    This lets developers create non-clustered data applications, without the Hadoop/Tez etc dependencies or runtime.

    I've been using the local planner in production for over 5 years now.

    But Parquet requires Hadoop libraries, and this is ok, there is a shim between the libraries that allow Parquet and S3AFileSystem to be used locally.

  3. A little more color on this announcement..
    fosstodon.org/@cwensel/1105490

    First, removed support, so I had to splice the original source into Cascading. But the ParquetScheme didn't honor type information fully. So there is a new TypedParquetScheme that has native support for JSON and Timestamps.

    Second, Parquet requires the FileSystem, which means we get the wonderful S3A implementation. But we also get a 331MB jar dependency with the aws bundle.