home.social

#apachebeam — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #apachebeam, aggregated by home.social.

  1. @mrudokas @didier And that's why I avoid #Java. Even #Kotlin and #Scala already has way less boilerplate (but I still dislike it). Part of the problem is the language, other parts is the libraries - and the alternative #JVM languages only change one part of the equation.

    Now you know why I prefer even #C++ over those. In C++ there's already way less boilerplate, and thus also way less efficiency of such tools. In fact I only ever get useful matches at all when writing code for a batch processing framework similar to #ApacheBeam.

    Which has a REALLY nice #Go interface where stateless parallel functions can just be Go functions. No class/struct/whatever around it, no extra methods, no weird context objects, just pure functions from type A to type B. That's how I want things to be.

    #Haskell also is a very good example, and in fact one of the languages with least amount of boilerplate I know.

    Otherwise, the point is "my team does not have the power to change the existing ecosystem". We have projects to deliver, can't spend too much resources in creating better APIs to existing systems. We can do incremental improvements though - in C++ that is possible, in Java it is a lost cause.

  2. I'm not feeling the warm fuzzies about trying to do GCP Dataflow (aka Apache Beam) with Typescript. Doesn't feel fully baked.

    Also doesn't feel like a lot of doc/examples nor many people using it. I don't think our org should be on the bleeding edge of something like this for a core piece of our infrastructure.

    Dunno. Will keep working on it for a bit.

    #GCP #GoogleCloud #Dataflow #ApacheBeam

  3. I'm not feeling the warm fuzzies about trying to do GCP Dataflow (aka Apache Beam) with Typescript. Doesn't feel fully baked.

    Also doesn't feel like a lot of doc/examples nor many people using it. I don't think our org should be on the bleeding edge of something like this for a core piece of our infrastructure.

    Dunno. Will keep working on it for a bit.

    #GCP #GoogleCloud #Dataflow #ApacheBeam

  4. I'm not feeling the warm fuzzies about trying to do GCP Dataflow (aka Apache Beam) with Typescript. Doesn't feel fully baked.

    Also doesn't feel like a lot of doc/examples nor many people using it. I don't think our org should be on the bleeding edge of something like this for a core piece of our infrastructure.

    Dunno. Will keep working on it for a bit.

  5. I'm not feeling the warm fuzzies about trying to do GCP Dataflow (aka Apache Beam) with Typescript. Doesn't feel fully baked.

    Also doesn't feel like a lot of doc/examples nor many people using it. I don't think our org should be on the bleeding edge of something like this for a core piece of our infrastructure.

    Dunno. Will keep working on it for a bit.

    #GCP #GoogleCloud #Dataflow #ApacheBeam

  6. I'm not feeling the warm fuzzies about trying to do GCP Dataflow (aka Apache Beam) with Typescript. Doesn't feel fully baked.

    Also doesn't feel like a lot of doc/examples nor many people using it. I don't think our org should be on the bleeding edge of something like this for a core piece of our infrastructure.

    Dunno. Will keep working on it for a bit.

    #GCP #GoogleCloud #Dataflow #ApacheBeam

  7. #CaseStudy - Discover how #Yelp reworked its data streaming architecture with #ApacheBeam & #ApacheFlink!

    The company replaced a fragmented set of data pipelines for streaming transactional data into its analytical systems, like Amazon Redshift and in-house data lake, using Apache data streaming projects to create a unified and flexible solution.

    Dive into the details: bit.ly/3WgkTL7

    #InfoQ #SoftwareArchitecture #EventDrivenArchitecture #DataPipelines #Streaming

  8. #CaseStudy - Discover how #Yelp reworked its data streaming architecture with #ApacheBeam & #ApacheFlink!

    The company replaced a fragmented set of data pipelines for streaming transactional data into its analytical systems, like Amazon Redshift and in-house data lake, using Apache data streaming projects to create a unified and flexible solution.

    Dive into the details: bit.ly/3WgkTL7

    #InfoQ #SoftwareArchitecture #EventDrivenArchitecture #DataPipelines #Streaming

  9. #CaseStudy - Discover how #Yelp reworked its data streaming architecture with #ApacheBeam & #ApacheFlink!

    The company replaced a fragmented set of data pipelines for streaming transactional data into its analytical systems, like Amazon Redshift and in-house data lake, using Apache data streaming projects to create a unified and flexible solution.

    Dive into the details: bit.ly/3WgkTL7

    #InfoQ #SoftwareArchitecture #EventDrivenArchitecture #DataPipelines #Streaming

  10. #CaseStudy - Discover how #Yelp reworked its data streaming architecture with #ApacheBeam & #ApacheFlink!

    The company replaced a fragmented set of data pipelines for streaming transactional data into its analytical systems, like Amazon Redshift and in-house data lake, using Apache data streaming projects to create a unified and flexible solution.

    Dive into the details: bit.ly/3WgkTL7

    #InfoQ #SoftwareArchitecture #EventDrivenArchitecture #DataPipelines #Streaming

  11. - Discover how reworked its data streaming architecture with & !

    The company replaced a fragmented set of data pipelines for streaming transactional data into its analytical systems, like Amazon Redshift and in-house data lake, using Apache data streaming projects to create a unified and flexible solution.

    Dive into the details: bit.ly/3WgkTL7

  12. Good morning ☕☀️

    I am #recruiting a Data Engineer into my team.

    We're building pipelines in #trivago that analyse, verify and normalise the content coming from partners, to deliver them clean to the rest of the company.

    I am looking for a pragmatic engineer engaged for quality and stability, that takes the challenge to process big volumes of data. We use ATM #Python and #ApacheBeam in #GCP, but you don't need them to apply if you are using other data tooling.

    We offer a competitive salary, a constant challenge, a very enthusiastic team and an authentic atmosphere of multicultural colleagues.

    We're based in #Düsseldorf, and we work in english in an hybrid scheme of 2 days #homeOffice / 3 days in person. Also unlimited vacation days, 20 days per year fully remote, kitchen, coffee, daily fruits...

    Apply now!
    careers.trivago.com/job/r71930

  13. Good morning ☕☀️

    I am #recruiting a Data Engineer into my team.

    We're building pipelines in #trivago that analyse, verify and normalise the content coming from partners, to deliver them clean to the rest of the company.

    I am looking for a pragmatic engineer engaged for quality and stability, that takes the challenge to process big volumes of data. We use ATM #Python and #ApacheBeam in #GCP, but you don't need them to apply if you are using other data tooling.

    We offer a competitive salary, a constant challenge, a very enthusiastic team and an authentic atmosphere of multicultural colleagues.

    We're based in #Düsseldorf, and we work in english in an hybrid scheme of 2 days #homeOffice / 3 days in person. Also unlimited vacation days, 20 days per year fully remote, kitchen, coffee, daily fruits...

    Apply now!
    careers.trivago.com/job/r71930

  14. Good morning ☕☀️

    I am #recruiting a Data Engineer into my team.

    We're building pipelines in #trivago that analyse, verify and normalise the content coming from partners, to deliver them clean to the rest of the company.

    I am looking for a pragmatic engineer engaged for quality and stability, that takes the challenge to process big volumes of data. We use ATM #Python and #ApacheBeam in #GCP, but you don't need them to apply if you are using other data tooling.

    We offer a competitive salary, a constant challenge, a very enthusiastic team and an authentic atmosphere of multicultural colleagues.

    We're based in #Düsseldorf, and we work in english in an hybrid scheme of 2 days #homeOffice / 3 days in person. Also unlimited vacation days, 20 days per year fully remote, kitchen, coffee, daily fruits...

    Apply now!
    careers.trivago.com/job/r71930

  15. Good morning ☕☀️

    I am #recruiting a Data Engineer into my team.

    We're building pipelines in #trivago that analyse, verify and normalise the content coming from partners, to deliver them clean to the rest of the company.

    I am looking for a pragmatic engineer engaged for quality and stability, that takes the challenge to process big volumes of data. We use ATM #Python and #ApacheBeam in #GCP, but you don't need them to apply if you are using other data tooling.

    We offer a competitive salary, a constant challenge, a very enthusiastic team and an authentic atmosphere of multicultural colleagues.

    We're based in #Düsseldorf, and we work in english in an hybrid scheme of 2 days #homeOffice / 3 days in person. Also unlimited vacation days, 20 days per year fully remote, kitchen, coffee, daily fruits...

    Apply now!
    careers.trivago.com/job/r71930

  16. Good morning ☕☀️

    I am #recruiting a Data Engineer into my team.

    We're building pipelines in #trivago that analyse, verify and normalise the content coming from partners, to deliver them clean to the rest of the company.

    I am looking for a pragmatic engineer engaged for quality and stability, that takes the challenge to process big volumes of data. We use ATM #Python and #ApacheBeam in #GCP, but you don't need them to apply if you are using other data tooling.

    We offer a competitive salary, a constant challenge, a very enthusiastic team and an authentic atmosphere of multicultural colleagues.

    We're based in #Düsseldorf, and we work in english in an hybrid scheme of 2 days #homeOffice / 3 days in person. Also unlimited vacation days, 20 days per year fully remote, kitchen, coffee, daily fruits...

    Apply now!
    careers.trivago.com/job/r71930

  17. Quiet before the storm – the "backstage view". 5 minutes later the room was packed! 🤩

    Just done presenting on data pipelines, #apachebeam, #OSS, #Java & #Cloud @ Devfest Bucharest, on the Cloud stage. 💾

    Thanks to the awesome GDGBucharest team for the organization. 👏

  18. Quiet before the storm – the "backstage view". 5 minutes later the room was packed! 🤩

    Just done presenting on data pipelines, #apachebeam, #OSS, #Java & #Cloud @ Devfest Bucharest, on the Cloud stage. 💾

    Thanks to the awesome GDGBucharest team for the organization. 👏

  19. Quiet before the storm – the "backstage view". 5 minutes later the room was packed! 🤩

    Just done presenting on data pipelines, #apachebeam, #OSS, #Java & #Cloud @ Devfest Bucharest, on the Cloud stage. 💾

    Thanks to the awesome GDGBucharest team for the organization. 👏

  20. Quiet before the storm – the "backstage view". 5 minutes later the room was packed! 🤩

    Just done presenting on data pipelines, #apachebeam, #OSS, #Java & #Cloud @ Devfest Bucharest, on the Cloud stage. 💾

    Thanks to the awesome GDGBucharest team for the organization. 👏

  21. Quiet before the storm – the "backstage view". 5 minutes later the room was packed! 🤩

    Just done presenting on data pipelines, #apachebeam, #OSS, #Java & #Cloud @ Devfest Bucharest, on the Cloud stage. 💾

    Thanks to the awesome GDGBucharest team for the organization. 👏

  22. ✍️ blogged about unified batch and stream data processing using mkuthan.github.io/blog/2023/09

  23. Google und OpenMined machen die Vorzüge des differenzierten Datenschutzes auch der Python Developer Community als Open Source zugänglich.
    PipelineDP: Differential Privacy Framework für das Python-Universum
  24. Google und OpenMined machen die Vorzüge des differenzierten Datenschutzes auch der Python Developer Community als Open Source zugänglich.
    PipelineDP: Differential Privacy Framework für das Python-Universum