-
so my #Grafana #k6 cloud runs seem to initialize the SharedArray object multiple times and is passing different instances to the remote processes. Inits once locally, and historically I don't remember this being an issue in the cloud.
https://grafana.com/docs/k6/latest/javascript-api/k6-data/sharedarray/
I have support and slack questions open, but I find it odd if i'm the only person experiencing this.
-
For those a little familiar with Cascading, in #java, it was originally designed to run on #ApacheHadoop, and then #ApacheTez, but it also has a local planner.
This lets developers create non-clustered data applications, without the Hadoop/Tez etc dependencies or runtime.
I've been using the local planner in production for over 5 years now.
But Parquet requires Hadoop libraries, and this is ok, there is a shim between the libraries that allow Parquet and S3AFileSystem to be used locally.
-
A little more color on this announcement..
https://fosstodon.org/@cwensel/110549001614086663First, #ApacheParquet removed #Cascading support, so I had to splice the original source into Cascading. But the ParquetScheme didn't honor type information fully. So there is a new TypedParquetScheme that has native support for JSON and Timestamps.
Second, Parquet requires the #ApacheHadoop FileSystem, which means we get the wonderful S3A implementation. But we also get a 331MB jar dependency with the aws bundle.