home.social

Search

103 results for “cwensel”

  1. @seldo I don't know any firsthand,

    but I spent the last couple weeks exploring what a #RAG pipeline would look like so I could write a sample application/pipeline using my #OpenSource #clusterless project

    github.com/ClusterlessHQ

    unfortunately the idea I had wasn't ultimately suitable for RAG and could be a simple BERT/BART summarizer pipeline without having a open/elasticsearch backend or other vector db.

    still looking for a fun RAG based prototype I could build and share.

  2. need to dig into this, but i've been doing replay (redrive) on StepFunctions for years with my pipelines

    aws.amazon.com/blogs/big-data/

    replay is one feature I haven't added back to yet, though all the metadata is there.

    github.com/ClusterlessHQ

  3. need to dig into this, but i've been doing replay (redrive) on #aws StepFunctions for years with my #data pipelines

    aws.amazon.com/blogs/big-data/

    replay is one feature I haven't added back to #Clusterless yet, though all the metadata is there.

    github.com/ClusterlessHQ

  4. need to dig into this, but i've been doing replay (redrive) on #aws StepFunctions for years with my #data pipelines

    aws.amazon.com/blogs/big-data/

    replay is one feature I haven't added back to #Clusterless yet, though all the metadata is there.

    github.com/ClusterlessHQ

  5. need to dig into this, but i've been doing replay (redrive) on #aws StepFunctions for years with my #data pipelines

    aws.amazon.com/blogs/big-data/

    replay is one feature I haven't added back to #Clusterless yet, though all the metadata is there.

    github.com/ClusterlessHQ

  6. need to dig into this, but i've been doing replay (redrive) on #aws StepFunctions for years with my #data pipelines

    aws.amazon.com/blogs/big-data/

    replay is one feature I haven't added back to #Clusterless yet, though all the metadata is there.

    github.com/ClusterlessHQ

  7. @c_chep

    currently all my examples (and scenario tester) use jsonnet, but it's got weak overall support.

    CUE looks interesting, but no Java implementation for embedding (if that was a thing I was considering)

  8. @c_chep

    currently all my #clusterless examples (and scenario tester) use jsonnet, but it's got weak overall support.

    CUE looks interesting, but no Java implementation for embedding (if that was a thing I was considering)

  9. @c_chep

    currently all my #clusterless examples (and scenario tester) use jsonnet, but it's got weak overall support.

    CUE looks interesting, but no Java implementation for embedding (if that was a thing I was considering)

  10. @c_chep

    currently all my #clusterless examples (and scenario tester) use jsonnet, but it's got weak overall support.

    CUE looks interesting, but no Java implementation for embedding (if that was a thing I was considering)

  11. @c_chep

    currently all my #clusterless examples (and scenario tester) use jsonnet, but it's got weak overall support.

    CUE looks interesting, but no Java implementation for embedding (if that was a thing I was considering)

  12. Tessellate is now on Docker Hub

    hub.docker.com/r/clusterless/t

    Tessellate is a command line tool for reading and writing to/from multiple locations and across multiple formats.

  13. Tessellate is now on Docker Hub

    hub.docker.com/r/clusterless/t

    Tessellate is a command line tool for reading and writing #data to/from multiple locations and across multiple formats.

    #clusterless

  14. Tessellate is now on Docker Hub

    hub.docker.com/r/clusterless/t

    Tessellate is a command line tool for reading and writing #data to/from multiple locations and across multiple formats.

    #clusterless

  15. Tessellate is now on Docker Hub

    hub.docker.com/r/clusterless/t

    Tessellate is a command line tool for reading and writing #data to/from multiple locations and across multiple formats.

    #clusterless

  16. Tessellate is now on Docker Hub

    hub.docker.com/r/clusterless/t

    Tessellate is a command line tool for reading and writing #data to/from multiple locations and across multiple formats.

    #clusterless

  17. Automating CloudWatch log export into S3 is no simple task.

    Next release will now have a new Component type called Activity that is simply a scheduled task..

    The first Activity will be function that exports cloud watch logs created within the previous interval.

    As they arrive, any arc can subscribe to the data drop and do things. To simplify that task, I'll update

    The cw log is a delimited text file with two columns, one is json. unlike all the others in aws!

  18. Automating #AWS CloudWatch log export into S3 is no simple task.

    Next #clusterless release will now have a new Component type called Activity that is simply a scheduled task..

    The first Activity will be function that exports cloud watch logs created within the previous interval.

    As they arrive, any arc can subscribe to the data drop and do things. To simplify that task, I'll update #tessellate

    The cw log is a delimited text file with two columns, one is json. unlike all the others in aws!

  19. Automating #AWS CloudWatch log export into S3 is no simple task.

    Next #clusterless release will now have a new Component type called Activity that is simply a scheduled task..

    The first Activity will be function that exports cloud watch logs created within the previous interval.

    As they arrive, any arc can subscribe to the data drop and do things. To simplify that task, I'll update #tessellate

    The cw log is a delimited text file with two columns, one is json. unlike all the others in aws!

  20. Automating #AWS CloudWatch log export into S3 is no simple task.

    Next #clusterless release will now have a new Component type called Activity that is simply a scheduled task..

    The first Activity will be function that exports cloud watch logs created within the previous interval.

    As they arrive, any arc can subscribe to the data drop and do things. To simplify that task, I'll update #tessellate

    The cw log is a delimited text file with two columns, one is json. unlike all the others in aws!

  21. Automating #AWS CloudWatch log export into S3 is no simple task.

    Next #clusterless release will now have a new Component type called Activity that is simply a scheduled task..

    The first Activity will be function that exports cloud watch logs created within the previous interval.

    As they arrive, any arc can subscribe to the data drop and do things. To simplify that task, I'll update #tessellate

    The cw log is a delimited text file with two columns, one is json. unlike all the others in aws!

  22. ok, here's a new one for users.

    would anyone be interested in an automated way to extract CloudWatch logs (continuously) into an s3 bucket.

    and have them converted into (/etc) for downstream custom processing. or simply partitioned with partition updates to AWS Athena/Glue?

    the challenge for users is getting the `detail` json field exposed since it's app specific.

    with devs could then inject custom processing for custom app logs into the pipeline

  23. ok, here's a new one for #aws users.

    would anyone be interested in an automated way to extract CloudWatch logs (continuously) into an s3 bucket.

    and have them converted into #parquet (/etc) for downstream custom processing. or simply partitioned with partition updates to AWS Athena/Glue?

    the challenge for users is getting the `detail` json field exposed since it's app specific.

    with #clusterless devs could then inject custom processing for custom app logs into the #data pipeline

  24. ok, here's a new one for #aws users.

    would anyone be interested in an automated way to extract CloudWatch logs (continuously) into an s3 bucket.

    and have them converted into #parquet (/etc) for downstream custom processing. or simply partitioned with partition updates to AWS Athena/Glue?

    the challenge for users is getting the `detail` json field exposed since it's app specific.

    with #clusterless devs could then inject custom processing for custom app logs into the #data pipeline

  25. ok, here's a new one for #aws users.

    would anyone be interested in an automated way to extract CloudWatch logs (continuously) into an s3 bucket.

    and have them converted into #parquet (/etc) for downstream custom processing. or simply partitioned with partition updates to AWS Athena/Glue?

    the challenge for users is getting the `detail` json field exposed since it's app specific.

    with #clusterless devs could then inject custom processing for custom app logs into the #data pipeline

  26. ok, here's a new one for #aws users.

    would anyone be interested in an automated way to extract CloudWatch logs (continuously) into an s3 bucket.

    and have them converted into #parquet (/etc) for downstream custom processing. or simply partitioned with partition updates to AWS Athena/Glue?

    the challenge for users is getting the `detail` json field exposed since it's app specific.

    with #clusterless devs could then inject custom processing for custom app logs into the #data pipeline