home.social

#datapipeline β€” Public Fediverse posts

Live and recent posts from across the Fediverse tagged #datapipeline, aggregated by home.social.

  1. Every data professional should understand these seven core concepts.

    From data warehouses and lakes to pipelines, meshes, and governance, these form the foundation of modern analytics infrastructure.
    Mastering them bridges the gap between raw data and actionable business insights.

    πŸ“• ebokify.com/ai-data-science

    #DataEngineering #DataScience #DataAnalytics #ETL #DataWarehouse #BigData #BusinessIntelligence #DataPipeline #DataGovernance

  2. πŸ”₯ Hot take alert! πŸ”₯ "I would love there to be a SaaS apocalypse" in observability – that's according to Ari Zilka, CEO at #datapipeline engine startup MyDecisive AI, and a former exec at New Relic and #
    Hortonworks. MyDecisive's #opensourcesoftware targets data management costs from #observability vendors such as Datadog by reducing data at the source; it can also proactively alert on and take action on #OpenTelemetry data as it comes in from enterprise applications.

    If Zilka has his way, #o11y will return to enterprises as an in-house discipline, but without requiring an army of engineers to set it up.

    youtu.be/emk_NMicaI8?si=6KeETc

  3. RE: mastodon.social/@TanjaStrange/

    This #story isn’t really about a typo.

    It’s about:
    - digital identity accuracy
    - metadata pipelines in media distribution
    - algorithmic visibility of artists
    - and a broken chain of responsibility

    #namesmatter #entertainment #news #metadata #technews #ICYMI #respecttheartist #QAFail #datapipeline #retailers #actorsunions #moviestudios #amazon

  4. NASA's #data collection has undergone massive shifts in lifecycle management, #FAIR_data, technological trends, and policy. Evolving from 1980s magnetic tapes to a network of over 30 online repositories, the tech trends are easiest to identify. Adoption of #DataPipeline and #DataStandards an ongoing focus. #DataGovernance and #DataRescue are emerging. NASA #DataStewardship navigates rapid technology and long-term science.

    Bugbee, K., & Ramachandran, R. (2025) doi.org/10.1029/2025EA004413 #SciLit

  5. NASA's #data collection has undergone massive shifts in lifecycle management, #FAIR_data, technological trends, and policy. Evolving from 1980s magnetic tapes to a network of over 30 online repositories, the tech trends are easiest to identify. Adoption of #DataPipeline and #DataStandards an ongoing focus. #DataGovernance and #DataRescue are emerging. NASA #DataStewardship navigates rapid technology and long-term science.

    Bugbee, K., & Ramachandran, R. (2025) doi.org/10.1029/2025EA004413 #SciLit

  6. NASA's #data collection has undergone massive shifts in lifecycle management, #FAIR_data, technological trends, and policy. Evolving from 1980s magnetic tapes to a network of over 30 online repositories, the tech trends are easiest to identify. Adoption of #DataPipeline and #DataStandards an ongoing focus. #DataGovernance and #DataRescue are emerging. NASA #DataStewardship navigates rapid technology and long-term science.

    Bugbee, K., & Ramachandran, R. (2025) doi.org/10.1029/2025EA004413 #SciLit

  7. NASA's #data collection has undergone massive shifts in lifecycle management, #FAIR_data, technological trends, and policy. Evolving from 1980s magnetic tapes to a network of over 30 online repositories, the tech trends are easiest to identify. Adoption of #DataPipeline and #DataStandards an ongoing focus. #DataGovernance and #DataRescue are emerging. NASA #DataStewardship navigates rapid technology and long-term science.

    Bugbee, K., & Ramachandran, R. (2025) doi.org/10.1029/2025EA004413 #SciLit

  8. NASA's #data collection has undergone massive shifts in lifecycle management, #FAIR_data, technological trends, and policy. Evolving from 1980s magnetic tapes to a network of over 30 online repositories, the tech trends are easiest to identify. Adoption of #DataPipeline and #DataStandards an ongoing focus. #DataGovernance and #DataRescue are emerging. NASA #DataStewardship navigates rapid technology and long-term science.

    Bugbee, K., & Ramachandran, R. (2025) doi.org/10.1029/2025EA004413 #SciLit

  9. Have you ever needed to extract text from images embedded in a ? I can highly recommend the open source tool which is easy to automate in for example a .

    It uses under the hood and has many options to experiment with to get the best possible accuracy for your language and PDF content.

    You can get started with just a few commands:

    samuelplumppu.se/blog/automate

  10. #Coroot is excited to partner with #Glassflow to share how you can optimize data streaming, storage, and observability using a fully-#FOSS stack (including tools like GlassFlow, #Kafka, #Clickhouse and Coroot!): t.ly/oVAOL

    #SRE #DevOps #tech #datapipeline #Python #Apache #linux #observability

  11. Data patterns for community survey data integrated into a data harmonization workflow description, O'Brien etal 2021 tackles the issue of diverse data harmonization for ecological community surveys. sciencedirect.com/science/arti #DataScience #DataPipeline #DataCuration #SciLit

  12. Update: Since this post is getting attention again, I should probably let everyone know that I got #FediHired by a nice little AI company about two months ago πŸ‘©πŸ»β€πŸŽ€ thank you all for helping make that possible, and I'm sorry for blowing up your timelines with this update πŸ€ͺ

    Well, the company I work(ed) for just folded. Poof! πŸ’©β›ˆοΈ.
    No notice, no severance, no nothing. 🀬

    So I'm asking for boosts and leads. If anyone needs someone who knows data, please reach out.

    I'm located in Seattle, WA. Remote preferred.

    Last few titles include: CDO, Head of Data, and Sr. Operations Analyst.

    I've worked in gaming, fintech, and B2C/C2C marketplaces most recently.

    I'm proficient in Python, SQL, statistics, team management, (almost) all things data-related, and a host of other stuff.

    I'm opinionated and anti-capitalist, but I also routinely bring in multiple times my department's cost in profits for the companies I work for.

    I'm also hella nice (despite my RBF) and easy to get along with.

    God, pitching myself is so awkward.

    That's the toot. Thanks.

    πŸ’œπŸ’œπŸ’œ
    Edit: I wanted to say thank you to everyone who is sharing this and a huge thank you to everyone who has reached out with encouragement and leads. This community is amazing.

    Edit 2: I'm going through all the replies I got, and will reach out to a lot of you soon. I've just finished updating my resume and LinkedIn.

    I realize that I'm going to have to give up some of my anonymity when I respond to people, so I'd really appreciate it if anyone I send my resume or LinkedIn profile to could keep my work and personal life separate. Thank you.
    πŸ’œπŸ’œπŸ’œ

    ---

    **Gif is of parent company explaining our off-boarding process**

    #PleaseBoost #Boost #FediHire #LWF #Layoffs #LookingForJob #LookingForWork #Data #DataScience #Python #SQL #Snowflake #ELT #DataPipeline #Analytics #BusinessIntelligence #CDO #HeadOfData #PleaseHelp #Gaming #Fintech #ChiefDataOfficer #HireAlice #ThankYou

  13. Update: Since this post is getting attention again, I should probably let everyone know that I got #FediHired by a nice little AI company about two months ago πŸ‘©πŸ»β€πŸŽ€ thank you all for helping make that possible, and I'm sorry for blowing up your timelines with this update πŸ€ͺ

    Well, the company I work(ed) for just folded. Poof! πŸ’©β›ˆοΈ.
    No notice, no severance, no nothing. 🀬

    So I'm asking for boosts and leads. If anyone needs someone who knows data, please reach out.

    I'm located in Seattle, WA. Remote preferred.

    Last few titles include: CDO, Head of Data, and Sr. Operations Analyst.

    I've worked in gaming, fintech, and B2C/C2C marketplaces most recently.

    I'm proficient in Python, SQL, statistics, team management, (almost) all things data-related, and a host of other stuff.

    I'm opinionated and anti-capitalist, but I also routinely bring in multiple times my department's cost in profits for the companies I work for.

    I'm also hella nice (despite my RBF) and easy to get along with.

    God, pitching myself is so awkward.

    That's the toot. Thanks.

    πŸ’œπŸ’œπŸ’œ
    Edit: I wanted to say thank you to everyone who is sharing this and a huge thank you to everyone who has reached out with encouragement and leads. This community is amazing.

    Edit 2: I'm going through all the replies I got, and will reach out to a lot of you soon. I've just finished updating my resume and LinkedIn.

    I realize that I'm going to have to give up some of my anonymity when I respond to people, so I'd really appreciate it if anyone I send my resume or LinkedIn profile to could keep my work and personal life separate. Thank you.
    πŸ’œπŸ’œπŸ’œ

    ---

    **Gif is of parent company explaining our off-boarding process**

    #PleaseBoost #Boost #FediHire #LWF #Layoffs #LookingForJob #LookingForWork #Data #DataScience #Python #SQL #Snowflake #ELT #DataPipeline #Analytics #BusinessIntelligence #CDO #HeadOfData #PleaseHelp #Gaming #Fintech #ChiefDataOfficer #HireAlice #ThankYou

  14. Update: Since this post is getting attention again, I should probably let everyone know that I got #FediHired by a nice little AI company about two months ago πŸ‘©πŸ»β€πŸŽ€ thank you all for helping make that possible, and I'm sorry for blowing up your timelines with this update πŸ€ͺ

    Well, the company I work(ed) for just folded. Poof! πŸ’©β›ˆοΈ.
    No notice, no severance, no nothing. 🀬

    So I'm asking for boosts and leads. If anyone needs someone who knows data, please reach out.

    I'm located in Seattle, WA. Remote preferred.

    Last few titles include: CDO, Head of Data, and Sr. Operations Analyst.

    I've worked in gaming, fintech, and B2C/C2C marketplaces most recently.

    I'm proficient in Python, SQL, statistics, team management, (almost) all things data-related, and a host of other stuff.

    I'm opinionated and anti-capitalist, but I also routinely bring in multiple times my department's cost in profits for the companies I work for.

    I'm also hella nice (despite my RBF) and easy to get along with.

    God, pitching myself is so awkward.

    That's the toot. Thanks.

    πŸ’œπŸ’œπŸ’œ
    Edit: I wanted to say thank you to everyone who is sharing this and a huge thank you to everyone who has reached out with encouragement and leads. This community is amazing.

    Edit 2: I'm going through all the replies I got, and will reach out to a lot of you soon. I've just finished updating my resume and LinkedIn.

    I realize that I'm going to have to give up some of my anonymity when I respond to people, so I'd really appreciate it if anyone I send my resume or LinkedIn profile to could keep my work and personal life separate. Thank you.
    πŸ’œπŸ’œπŸ’œ

    ---

    **Gif is of parent company explaining our off-boarding process**

    #PleaseBoost #Boost #FediHire #LWF #Layoffs #LookingForJob #LookingForWork #Data #DataScience #Python #SQL #Snowflake #ELT #DataPipeline #Analytics #BusinessIntelligence #CDO #HeadOfData #PleaseHelp #Gaming #Fintech #ChiefDataOfficer #HireAlice #ThankYou

  15. Update: Since this post is getting attention again, I should probably let everyone know that I got #FediHired by a nice little AI company about two months ago πŸ‘©πŸ»β€πŸŽ€ thank you all for helping make that possible, and I'm sorry for blowing up your timelines with this update πŸ€ͺ

    Well, the company I work(ed) for just folded. Poof! πŸ’©β›ˆοΈ.
    No notice, no severance, no nothing. 🀬

    So I'm asking for boosts and leads. If anyone needs someone who knows data, please reach out.

    I'm located in Seattle, WA. Remote preferred.

    Last few titles include: CDO, Head of Data, and Sr. Operations Analyst.

    I've worked in gaming, fintech, and B2C/C2C marketplaces most recently.

    I'm proficient in Python, SQL, statistics, team management, (almost) all things data-related, and a host of other stuff.

    I'm opinionated and anti-capitalist, but I also routinely bring in multiple times my department's cost in profits for the companies I work for.

    I'm also hella nice (despite my RBF) and easy to get along with.

    God, pitching myself is so awkward.

    That's the toot. Thanks.

    πŸ’œπŸ’œπŸ’œ
    Edit: I wanted to say thank you to everyone who is sharing this and a huge thank you to everyone who has reached out with encouragement and leads. This community is amazing.

    Edit 2: I'm going through all the replies I got, and will reach out to a lot of you soon. I've just finished updating my resume and LinkedIn.

    I realize that I'm going to have to give up some of my anonymity when I respond to people, so I'd really appreciate it if anyone I send my resume or LinkedIn profile to could keep my work and personal life separate. Thank you.
    πŸ’œπŸ’œπŸ’œ

    ---

    **Gif is of parent company explaining our off-boarding process**

    #PleaseBoost #Boost #FediHire #LWF #Layoffs #LookingForJob #LookingForWork #Data #DataScience #Python #SQL #Snowflake #ELT #DataPipeline #Analytics #BusinessIntelligence #CDO #HeadOfData #PleaseHelp #Gaming #Fintech #ChiefDataOfficer #HireAlice #ThankYou

  16. Hevo draws in $8 million Series A for its no-code data pipeline service - Hevo founders Manish Jethani and Sourabh Agarwal
    According to data pipeline startup Hevo, many small... more: feedproxy.google.com/~r/Techcr #nocodedatapipeline #fundings&exits #dataanalytics #datapipeline #enterprise #startups #hevodata #nocode #india #asia #hevo #tc