home.social

#analyticsengineering — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #analyticsengineering, aggregated by home.social.

  1. I hosted the PipeRider Community Office Hours yesterday and one cool feature we demoed was this online dbt manifest viewer.

    Generate a Lineage Diff from just your dbt project's manifest files.

    Try it out for yourself here:

    staging.cloud.piperider.io/onl

    Watch it in action on Youtube:

    youtube.com/watch?v=LHuTb3e_4O

    #DataEngineering #DataViz #dbt #DataTools #AnalyticsEngineering #OpenSource #PipeRider

  2. Looking for interesting datasets to practice with?
    (and full end-to-end data pipelines to investigate)

    Here are 5 (and a few extras) from the recent DataTalksClub Data Engineering Zoomcamp:

    medium.com/inthepipeline/5-int

    Datasets Include:

    - Daily data on Berlin bike thefts
    - Air quality
    - SF Eviction data
    - Minneapolis 311
    - NYC Restaurant Inspection

    #DataEngineering #DataOps #Datasets #DataProjects #AnalyticsEngineering #DataPipelines #dbt #elt #DataViz

  3. CW: dbt model materialization patterns

    Do you materialize as table by default? View? How much do you use ephemeral materialization? When is a view better than a table?

    My team has pretty small data and we use tables for everything except event streams (incremental) and some intermediate (ephemeral) models.

    I'm really curious what other teams' materialization patterns are like.

    #datadon #dbt #DataModeling #AnalyticsEngineering #DataWarehousing