home.social

#dataproduct — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #dataproduct, aggregated by home.social.

  1. The arrival of #AI #agents creates urgency around the need to guide and govern them.

    After 15 years of building reliable AI solutions for banks and other enterprises, Jacobus Geluk sees a standards-based #dataProduct marketplace as key to #AIagent success.

    His proposed new #useCaseTree specification articulates the business needs that data products address, complementing his earlier work on the #DPROD data-product description standard.

    knowledgegraphinsights.com/jac

  2. The Treasury Board's policy suite is conceptually a giant graph structure, but is frustratingly resistant to automated analysis.

    Some annoyances:

    The policy suite straddles tbs-sct.gc.ca and canada.ca and policies often draw their authorities from material on laws-lois.justice.gc.ca

    There is frustratingly little common structure you can rely on. If you think you found a structure, you just need to see a few more policies

    Links between policies or to laws rarely link to relevant sections

    Only a few policies have an XML data representation, most are available only as HTML, making web scraping the most reliable approach

    Markers indicating sections, clauses etc. are not consistent across HTML documents making web scraping extremely annoying

    Multiple requirements often occur in a single ("and")

    Enabling programmatic analysis of policy would be broadly valuable both inside and outside government.

    This should be an #opendata #dataproduct but it seems like these documents are largely treated like marketing material: if it looks OK in the browser it's done.

    #gcdigital

  3. Does anyone here know of some literature on how to share entity relations between domains? Let's say Lionel Messi, with id 123 in some sport system start an acting career and gets id 456 in #imdb. How would the sports department communicate this to some other, third domain so they can join and aggregate? How do you deal with deletion of an entity? Just tombstones? I'd love some research on this topic, as it seems to reinvent itself time and time again
    #datamesh #dataproduct #kafka #protobuf

  4. I've been too long without posting. There's been a lot going on in data engineering, the modern data stack, the business(es) of data, … not to mention FTX and ChatGPT (but, of course, you already know about all of that.)
    medium.com/@rhm2k/resonance-ca

    #DataEngineering #DataProduct #DataEconomics

  5. "A Data Product is set of prepared data or information (and hence specifically not raw data) that is ready to be consumed by a wide set of consumers."

    Willem Koenders provides insights into one of the fundamental concepts of contemporary data engineering.

    #DataEngineering
    #DataProduct

    medium.com/@koendit/whats-the-