#ai-engineering — Public Fediverse posts on home.social

Antão Almada @[email protected] · 2026-07-03 · 19:31 UTC

Agents do not need RAG or vector databases for most real world work. They need structure and semantics.

Agent Knowledge Graphs turn mixed repositories of code, docs, configs, and PDFs into a connected model that agents can reason over. This often replaces entire retrieval pipelines.

https://antaoalmada.dev/posts/Code-Agent-Knowledge-Graphs/

#AIEngineering #KnowledgeGraphs #CodingAgents #AgentWorkflows #SoftwareArchitecture #Graphify

#aiengineering #knowledgegraphs #codingagents #agentworkflows #softwarearchitecture #graphify

Antão Almada @[email protected] · 2026-07-03 · 19:31 UTC

Agents do not need RAG or vector databases for most real world work. They need structure and semantics.

Agent Knowledge Graphs turn mixed repositories of code, docs, configs, and PDFs into a connected model that agents can reason over. This often replaces entire retrieval pipelines.

https://antaoalmada.dev/posts/Code-Agent-Knowledge-Graphs/

#AIEngineering #KnowledgeGraphs #CodingAgents #AgentWorkflows #SoftwareArchitecture #Graphify

#aiengineering #knowledgegraphs #codingagents #agentworkflows #softwarearchitecture #graphify

Bleme @[email protected] · 2026-07-02 · 17:12 UTC

The Prompt Is Still a Punch Card - Ted Johnson, JoinIn AI

https://video.ut0pia.org/w/hPP3YFomvsDhmxP7m3pVsh

#aiengineer #aiengineering #softwaredevelopment #startups #tech

Bleme @[email protected] · 2026-07-02 · 17:12 UTC

The Prompt Is Still a Punch Card - Ted Johnson, JoinIn AI

https://video.ut0pia.org/w/hPP3YFomvsDhmxP7m3pVsh

#aiengineer #aiengineering #softwaredevelopment #startups #tech

Brandon H :csharp: :verified: @[email protected] · 2026-07-02 · 16:36 UTC

via #Microsoft : Microsoft Frontier Company: AI engineering that amplifies and protects your intelligence

https://ift.tt/FUC8ITD
#MicrosoftFrontierCompany #AIEngineering #IntelligencePlusTrust # FrontierTransformation #AIPlatform #DataProtection #IPProtection #EnterpriseAI #A…

#microsoft #microsoftfrontiercompany #aiengineering #intelligenceplustrust #aiplatform #dataprotection

Brandon H :csharp: :verified: @[email protected] · 2026-07-02 · 16:36 UTC

via #Microsoft : Microsoft Frontier Company: AI engineering that amplifies and protects your intelligence

https://ift.tt/FUC8ITD
#MicrosoftFrontierCompany #AIEngineering #IntelligencePlusTrust # FrontierTransformation #AIPlatform #DataProtection #IPProtection #EnterpriseAI #A…

#microsoft #microsoftfrontiercompany #aiengineering #intelligenceplustrust #aiplatform #dataprotection

amah_codes @[email protected] · 2026-07-01 · 18:19 UTC

🧠 From Simple Indexing to Semantic Understanding: Why I Layered Both Approaches

Finishing LLM Zoomcamp Module 2 felt like leveling up my RAG system. I was already doing agentic RAG in Module 1, but vector search opened a whole new layer of retrieval flexibility. Here's why the technical decisions matter:

-**Gained exposure to various vector databases including pgvector, sqlitesearch, and minsearch** – Each tool carries distinct tradeoffs: pgvector for PostgreSQL integration, SQLite for lightweight local workloads, minsearch for in-memory prototyping. Knowing which fits where matters more than the technology itself
- **Embedding actual lesson content with ONNX library** - Lightweight CPU inference means this stacks directly on existing infrastructure without needing GPU dependencies or scaling headaches
- **Chunking 72 lesson pages into ~300 chunks with 50% overlap** - Sliding window preserves context across topic boundaries while reducing prompt token usage compared to whole-page indexing
- **Building the same query against both vector and keyword indexes to compare scores** - Quantifies semantic vs lexical retrieval so you can decide when each method adds value
- **Using hybrid search (RRF fusion) to blend vector and keyword search results intelligently** - Captures both conceptual meaning and precise terminology, which matters when queries span multiple technical domains

One thing that stuck: even queries like "How do I store vectors in PostgreSQL?" returned meaningful results because I was comparing semantic similarity, not just matching words. That's the difference lexical vs. semantic search really makes. It shows hybrid search isn't just a nice-to-have, it's practical engineering when you care about retrieval precision and coverage.

Project is live if you're curious to see how the pieces fit together: https://github.com/ammartin8/llm_zoomcamp_portfolio/blob/main/modules/02_vector_search/project_02/project_vector_search_case_study.md

Huge thanks again to Alexey Grigorev for putting this together, open-source learning at this level matters more than most realize. Anyone else finishing up Module 2 or working with hybrid retrieval themselves?

#ai #localai #llm #mastodon #fediverse #buildinpublic #linux #github #aiengineering #DataEngineering #agentic #rag #vector #openai

#ai #localai #llm #mastodon #fediverse #buildinpublic

amah_codes @amah_codes · 2026-07-01 · 18:19 UTC

🧠 From Simple Indexing to Semantic Understanding: Why I Layered Both Approaches

Finishing LLM Zoomcamp Module 2 felt like leveling up my RAG system. I was already doing agentic RAG in Module 1, but vector search opened a whole new layer of retrieval flexibility. Here's why the technical decisions matter:

-**Gained exposure to various vector databases including pgvector, sqlitesearch, and minsearch** – Each tool carries distinct tradeoffs: pgvector for PostgreSQL integration, SQLite for lightweight local workloads, minsearch for in-memory prototyping. Knowing which fits where matters more than the technology itself
- **Embedding actual lesson content with ONNX library** - Lightweight CPU inference means this stacks directly on existing infrastructure without needing GPU dependencies or scaling headaches
- **Chunking 72 lesson pages into ~300 chunks with 50% overlap** - Sliding window preserves context across topic boundaries while reducing prompt token usage compared to whole-page indexing
- **Building the same query against both vector and keyword indexes to compare scores** - Quantifies semantic vs lexical retrieval so you can decide when each method adds value
- **Using hybrid search (RRF fusion) to blend vector and keyword search results intelligently** - Captures both conceptual meaning and precise terminology, which matters when queries span multiple technical domains

One thing that stuck: even queries like "How do I store vectors in PostgreSQL?" returned meaningful results because I was comparing semantic similarity, not just matching words. That's the difference lexical vs. semantic search really makes. It shows hybrid search isn't just a nice-to-have, it's practical engineering when you care about retrieval precision and coverage.

Project is live if you're curious to see how the pieces fit together: https://github.com/ammartin8/llm_zoomcamp_portfolio/blob/main/modules/02_vector_search/project_02/project_vector_search_case_study.md

Huge thanks again to Alexey Grigorev for putting this together, open-source learning at this level matters more than most realize. Anyone else finishing up Module 2 or working with hybrid retrieval themselves?

#ai #localai #llm #mastodon #fediverse #buildinpublic #linux #github #aiengineering #DataEngineering #agentic #rag #vector #openai

#ai #localai #llm #mastodon #fediverse #buildinpublic

Oresztesz Margaritisz @[email protected] · 2026-07-01 · 14:41 UTC

I was working on an early version of harness engineering technology landscape. The landscape is available under: https://dev.to/gitaroktato/harness-engineering-technology-landscape-1d93

A high quality version can be downloaded: https://drive.google.com/file/d/1JWrkw5jupP-YBYPd1PkrL_NtmLKk2aje/view

#aiengineering #genai #llm #harnessengineering

Oresztesz Margaritisz @[email protected] · 2026-07-01 · 14:41 UTC

I was working on an early version of harness engineering technology landscape. The landscape is available under: https://dev.to/gitaroktato/harness-engineering-technology-landscape-1d93

A high quality version can be downloaded: https://drive.google.com/file/d/1JWrkw5jupP-YBYPd1PkrL_NtmLKk2aje/view

#aiengineering #genai #llm #harnessengineering

Oresztesz Margaritisz @[email protected] · 2026-06-29 · 14:07 UTC

I have been following the field of harness engineering for some time now. This article distills the essence of harness engineering from the testimonials and shared experiences of practitioners.

https://dev.to/gitaroktato/harness-engineering-core-principles-1j1f

#aiengineering #genai #llm #harnessengineering

#harnessengineering #aiengineering #genai #llm

Oresztesz Margaritisz @[email protected] · 2026-06-29 · 14:07 UTC

I have been following the field of harness engineering for some time now. This article distills the essence of harness engineering from the testimonials and shared experiences of practitioners.

https://dev.to/gitaroktato/harness-engineering-core-principles-1j1f

#aiengineering #genai #llm #harnessengineering

#harnessengineering #aiengineering #genai #llm

llm-bench@KAPUALabs @[email protected] · 2026-06-29 · 09:45 UTC

Stop measuring AI performance without measuring resilience. High bench scores often mask fragile backend logic that fails silently under pressure.

We break down the invisible machinery: models rerouted from broken providers, responses caught before reaching users, and metrics refusing to penalize failure unfairly. Reliability isn't hoped for; it's engineered. ⚙️

Read the full analysis: https://post.kapualabs.com/yckr6746

#AIEngineering #ModelReliability #TechInfrastructure #LLM

#aiengineering #modelreliability #techinfrastructure #llm

Bleme @[email protected] · 2026-06-29 · 04:16 UTC

The Future Is Domain-Specific Agents - Justin Schroeder, StandardAgents

https://video.ut0pia.org/w/qAj6FTo16d64xiqQ5ayy56

#aiengineer #aiengineering #softwaredevelopment #startups #tech

Bleme @[email protected] · 2026-06-29 · 04:16 UTC

The Future Is Domain-Specific Agents - Justin Schroeder, StandardAgents

https://video.ut0pia.org/w/qAj6FTo16d64xiqQ5ayy56

#aiengineer #aiengineering #softwaredevelopment #startups #tech

Bleme @[email protected] · 2026-06-29 · 02:18 UTC

The Prompt is the Platform - Dominik Tornow, Resonate HQ

https://video.ut0pia.org/w/pPyDdKY4cXfTK9Fcki9cNR

#aiengineer #aiengineering #softwaredevelopment #startups #tech

Bleme @[email protected] · 2026-06-29 · 02:18 UTC

The Prompt is the Platform - Dominik Tornow, Resonate HQ

https://video.ut0pia.org/w/pPyDdKY4cXfTK9Fcki9cNR

#aiengineer #aiengineering #softwaredevelopment #startups #tech

Bleme @[email protected] · 2026-06-29 · 01:08 UTC

Your Agent Failed in Prod. Good Luck Reproducing It. - Tisha Chawla & Susheem Koul, Microsoft

https://video.ut0pia.org/w/wNUPQCXVMDqdCgVBXp4xrY

#aiengineer #aiengineering #softwaredevelopment #startups #tech

Bleme @[email protected] · 2026-06-29 · 01:08 UTC

Your Agent Failed in Prod. Good Luck Reproducing It. - Tisha Chawla & Susheem Koul, Microsoft

https://video.ut0pia.org/w/wNUPQCXVMDqdCgVBXp4xrY

#aiengineer #aiengineering #softwaredevelopment #startups #tech

Bleme @[email protected] · 2026-06-29 · 00:06 UTC

AI-Driven Multi-Document Correlation for Financial Compliance - Varsha Shah, Independent

https://video.ut0pia.org/w/2wEa1u89wu7PeM8nN22rDq

#aiengineer #aiengineering #softwaredevelopment #startups #tech

Bleme @[email protected] · 2026-06-29 · 00:06 UTC

AI-Driven Multi-Document Correlation for Financial Compliance - Varsha Shah, Independent

https://video.ut0pia.org/w/2wEa1u89wu7PeM8nN22rDq

#aiengineer #aiengineering #softwaredevelopment #startups #tech

Bleme @[email protected] · 2026-06-28 · 19:04 UTC

HTML is All You Need (for Agents to Make Graphics) - Amol Kapoor, Nori

https://video.ut0pia.org/w/opTMpy7DdKMvQfrL2vgc3d

#aiengineer #aiengineering #softwaredevelopment #startups #tech

Bleme @[email protected] · 2026-06-28 · 19:04 UTC

HTML is All You Need (for Agents to Make Graphics) - Amol Kapoor, Nori

https://video.ut0pia.org/w/opTMpy7DdKMvQfrL2vgc3d

#aiengineer #aiengineering #softwaredevelopment #startups #tech

Bleme @[email protected] · 2026-06-28 · 16:14 UTC

Agents Building Agents - Alfonso Graziano, Nearform

https://video.ut0pia.org/w/aLC51qiB2NvzHPesJ688qK

#aiengineer #aiengineering #softwaredevelopment #startups #tech

Bleme @[email protected] · 2026-06-28 · 16:14 UTC

Agents Building Agents - Alfonso Graziano, Nearform

https://video.ut0pia.org/w/aLC51qiB2NvzHPesJ688qK

#aiengineer #aiengineering #softwaredevelopment #startups #tech

Bleme @[email protected] · 2026-06-28 · 16:12 UTC

The 100-Tool Agent Is a Trap - Sohail Shaikh & Ankush Rastogi, Prosodica

https://video.ut0pia.org/w/2qFLJrnSnEAoTWHacSUHHk

#aiengineer #aiengineering #softwaredevelopment #startups #tech

Bleme @[email protected] · 2026-06-28 · 16:12 UTC

The 100-Tool Agent Is a Trap - Sohail Shaikh & Ankush Rastogi, Prosodica

https://video.ut0pia.org/w/2qFLJrnSnEAoTWHacSUHHk

#aiengineer #aiengineering #softwaredevelopment #startups #tech

Bleme @[email protected] · 2026-06-28 · 16:05 UTC

Browser Agents Don't Need Better Models. They Need Better Eyes. - Kushan Raj, ARK

https://video.ut0pia.org/w/kXZpSuKM39ZRBqsjwK2CZq

#aiengineer #aiengineering #softwaredevelopment #startups #tech

Bleme @[email protected] · 2026-06-28 · 16:05 UTC

Browser Agents Don't Need Better Models. They Need Better Eyes. - Kushan Raj, ARK

https://video.ut0pia.org/w/kXZpSuKM39ZRBqsjwK2CZq

#aiengineer #aiengineering #softwaredevelopment #startups #tech

pgEdge Postgres @[email protected] · 2026-06-25 · 18:56 UTC

The Unix philosophy for AI agents: each tool does one thing well, then chain them.

Phillip Merrick made this case on TalkDev's Enterprise Unlocked: AI is probabilistic now, not deterministic. 5 agents at 95% each >> 1 monolithic agent doing 5 steps. Probabilities compound - in your favor when you go modular, against you when you don't. That's the design shift.

Shaun Thomas went deeper on this for the pgEdge blog. 📖

https://hubs.la/Q04mJmPW0

#AgenticAI #AIEngineering #Programming #OpenSource

#agenticai #aiengineering #programming #opensource

pgEdge Postgres @[email protected] · 2026-06-25 · 18:56 UTC

The Unix philosophy for AI agents: each tool does one thing well, then chain them.

Phillip Merrick made this case on TalkDev's Enterprise Unlocked: AI is probabilistic now, not deterministic. 5 agents at 95% each >> 1 monolithic agent doing 5 steps. Probabilities compound - in your favor when you go modular, against you when you don't. That's the design shift.

Shaun Thomas went deeper on this for the pgEdge blog. 📖

https://hubs.la/Q04mJmPW0

#AgenticAI #AIEngineering #Programming #OpenSource

#agenticai #aiengineering #programming #opensource

HackerNoon @[email protected] · 2026-06-25 · 00:08 UTC

AI systems fail silently and at massive scale, yet the field building them has no licensure, no inspection, and no shared code of practice. Here is what I think https://hackernoon.com/ai-engineering-still-has-no-building-code-and-that-is-the-real-crisis #aiengineering

#aiengineering

HackerNoon @[email protected] · 2026-06-25 · 00:08 UTC

AI systems fail silently and at massive scale, yet the field building them has no licensure, no inspection, and no shared code of practice. Here is what I think https://hackernoon.com/ai-engineering-still-has-no-building-code-and-that-is-the-real-crisis #aiengineering

#aiengineering

amah_codes @[email protected] · 2026-06-24 · 20:30 UTC

Module 1 of LLM Zoomcamp is done! 🎉

I turned my original RAG pipeline into an Agent!

I spent these last few days diving deep into Agentic RAG. It's been fascinating to build it step by step. Every time I ask the LLM to learn about something new, I see how it naturally figures out which tools to use, when to search, and how many times to gather info before giving me a solid answer.

What exactly is Agentic RAG?
It’s like giving the AI a brain that can actually act. Instead of just retrieving from a fixed knowledge base, the model decides whether it needs external tools first, gathers what it needs, and then answers. It’s pretty interesting to understand how it actually works behind the scenes!

Why does this matter?
A few days ago I asked for a detailed guide on using the OpenAI Python library with the chat.completion API. The Local LLM called web search multiple times until it had enough context and built something useful from those pieces. Now that I am building these systems, I can finally understand why it does what it does.

💡 Insights from this week:
- Building a static pipeline is a great start, but to make something truly flexible, you need function or tool calling. It lets the LLM look at the question first and decide whether it needs to search a knowledge base before answering.
- I used to think "chunking" was just about breaking up text. Turns out it can reduce token input by 3x! 🤯
- You have to learn how to walk before you run. Starting small, understanding each component manually, and seeing how the pieces fit together… it felt slow at first but worth it. Now I’m able to accelerate with agent frameworks like toyaikit, LangChain, PydanticAI, or OpenAI Agents.
- There is definitely a learning curve with the API syntax. Between the new response API and chat completions, tool responses are structured differently and you have to adjust your code accordingly. Frustrating at times, but also a great way to learn!

Quick takeaway:
It is best to start simple, then add complexity only when needed. Sometimes an agent can burn tokens unnecessarily, so only add that layer if your problem really needs it!

Had a lot of fun with this module and I’m already curious about what’s next. If you’re interested in learning along, this is the full free course Alexey at the Data Talks Club: https://github.com/DataTalksClub/llm-zoomcamp/

Anyone else tinkering with LLM agents lately? What kind of projects are you exploring or trying out? Would love to hear where your journey is heading!

#ai #localai #llm #mastodon #fediverse #buildinpublic #linux #github #aiengineering #DataEngineering

#ai #localai #llm #mastodon #fediverse #buildinpublic

amah_codes @amah_codes · 2026-06-24 · 20:30 UTC

Module 1 of LLM Zoomcamp is done! 🎉

I turned my original RAG pipeline into an Agent!

I spent these last few days diving deep into Agentic RAG. It's been fascinating to build it step by step. Every time I ask the LLM to learn about something new, I see how it naturally figures out which tools to use, when to search, and how many times to gather info before giving me a solid answer.

What exactly is Agentic RAG?
It’s like giving the AI a brain that can actually act. Instead of just retrieving from a fixed knowledge base, the model decides whether it needs external tools first, gathers what it needs, and then answers. It’s pretty interesting to understand how it actually works behind the scenes!

Why does this matter?
A few days ago I asked for a detailed guide on using the OpenAI Python library with the chat.completion API. The Local LLM called web search multiple times until it had enough context and built something useful from those pieces. Now that I am building these systems, I can finally understand why it does what it does.

💡 Insights from this week:
- Building a static pipeline is a great start, but to make something truly flexible, you need function or tool calling. It lets the LLM look at the question first and decide whether it needs to search a knowledge base before answering.
- I used to think "chunking" was just about breaking up text. Turns out it can reduce token input by 3x! 🤯
- You have to learn how to walk before you run. Starting small, understanding each component manually, and seeing how the pieces fit together… it felt slow at first but worth it. Now I’m able to accelerate with agent frameworks like toyaikit, LangChain, PydanticAI, or OpenAI Agents.
- There is definitely a learning curve with the API syntax. Between the new response API and chat completions, tool responses are structured differently and you have to adjust your code accordingly. Frustrating at times, but also a great way to learn!

Quick takeaway:
It is best to start simple, then add complexity only when needed. Sometimes an agent can burn tokens unnecessarily, so only add that layer if your problem really needs it!

Had a lot of fun with this module and I’m already curious about what’s next. If you’re interested in learning along, this is the full free course Alexey at the Data Talks Club: https://github.com/DataTalksClub/llm-zoomcamp/

Anyone else tinkering with LLM agents lately? What kind of projects are you exploring or trying out? Would love to hear where your journey is heading!

#ai #localai #llm #mastodon #fediverse #buildinpublic #linux #github #aiengineering #DataEngineering

#ai #localai #llm #mastodon #fediverse #buildinpublic

Upsun @[email protected] · 2026-06-23 · 11:11 UTC

Today, we are introducing Upsun Dispatch. 🚀

AI made engineers faster. It didn't make teams faster. The constraint was never typing; it was everything around it. The SDLC is being rewritten, and we'd like to rewrite it with you.

Upsun Dispatch is our platform for the agentic software development lifecycle, launching in September 2026. Workflow is the primitive, not the agent. 😎

👉 Read the full story: https://upsun.com/blog/introducing-upsun-dispatch/

#SDLC #AIEngineering #EngineeringLeadership #DeveloperTools

#sdlc #aiengineering #engineeringleadership #developertools

Upsun @[email protected] · 2026-06-23 · 11:11 UTC

Today, we are introducing Upsun Dispatch. 🚀

AI made engineers faster. It didn't make teams faster. The constraint was never typing; it was everything around it. The SDLC is being rewritten, and we'd like to rewrite it with you.

Upsun Dispatch is our platform for the agentic software development lifecycle, launching in September 2026. Workflow is the primitive, not the agent. 😎

👉 Read the full story: https://upsun.com/blog/introducing-upsun-dispatch/

#SDLC #AIEngineering #EngineeringLeadership #DeveloperTools

#sdlc #aiengineering #engineeringleadership #developertools