#responsesapi — Public Fediverse posts on home.social

Brandon H :csharp: :verified: @[email protected] · 2026-05-12 · 22:12 UTC

via #AIFoundry : Foundry Local 1.1: Live Transcription, Embeddings, and Responses API

https://ift.tt/O1MorgK
#FoundryLocal #LiveTranscription #EmbeddingsAPI #ResponsesAPI #WebGPU #ONNXRuntime #NodeAPI #PythonSDK #JavaScriptSDK #CSharpSDK #DotNet #NetStandard #WinML #Koffi #No…

#aifoundry #foundrylocal #livetranscription #embeddingsapi #responsesapi #webgpu

Brandon H :csharp: :verified: @[email protected] · 2026-05-12 · 22:12 UTC

via #AIFoundry : Foundry Local 1.1: Live Transcription, Embeddings, and Responses API

https://ift.tt/O1MorgK
#FoundryLocal #LiveTranscription #EmbeddingsAPI #ResponsesAPI #WebGPU #ONNXRuntime #NodeAPI #PythonSDK #JavaScriptSDK #CSharpSDK #DotNet #NetStandard #WinML #Koffi #No…

#aifoundry #foundrylocal #livetranscription #embeddingsapi #responsesapi #webgpu

Brandon H :csharp: :verified: @bc3tech · 2026-05-12 · 22:12 UTC

via #AIFoundry : Foundry Local 1.1: Live Transcription, Embeddings, and Responses API

https://ift.tt/O1MorgK
#FoundryLocal #LiveTranscription #EmbeddingsAPI #ResponsesAPI #WebGPU #ONNXRuntime #NodeAPI #PythonSDK #JavaScriptSDK #CSharpSDK #DotNet #NetStandard #WinML #Koffi #No…

#aifoundry #foundrylocal #livetranscription #embeddingsapi #responsesapi #webgpu

Brandon H :csharp: :verified: @[email protected] · 2026-05-12 · 22:12 UTC

via #AIFoundry : Foundry Local 1.1: Live Transcription, Embeddings, and Responses API

https://ift.tt/O1MorgK
#FoundryLocal #LiveTranscription #EmbeddingsAPI #ResponsesAPI #WebGPU #ONNXRuntime #NodeAPI #PythonSDK #JavaScriptSDK #CSharpSDK #DotNet #NetStandard #WinML #Koffi #No…

#no #koffi #winml #netstandard #dotnet #csharpsdk

Brandon H :csharp: :verified: @[email protected] · 2026-05-12 · 22:12 UTC

via #AIFoundry : Foundry Local 1.1: Live Transcription, Embeddings, and Responses API

https://ift.tt/O1MorgK
#FoundryLocal #LiveTranscription #EmbeddingsAPI #ResponsesAPI #WebGPU #ONNXRuntime #NodeAPI #PythonSDK #JavaScriptSDK #CSharpSDK #DotNet #NetStandard #WinML #Koffi #No…

#aifoundry #foundrylocal #livetranscription #embeddingsapi #responsesapi #webgpu

Andreas Becker @[email protected] · 2026-04-23 · 08:44 UTC

OpenAI integriert WebSockets in die Responses API und senkt die Latenz bei Agenten-Workflows um bis zu 40 Prozent.

Durch persistente Verbindungen und serverseitiges Kontext-Caching entfallen redundante HTTP-Anfragen. Modelle wie GPT-5.3-Codex generieren so auf Cerebras-Hardware bis zu 4.000 Token pro Sekunde. Drittanbieter verzeichnen messbare Leistungssprünge.

#OpenAI #ResponsesAPI #Cerebras #LLM #News
https://www.all-ai.de/news/news26/openai-api-speed

#openai #responsesapi #cerebras #llm #news

Andreas Becker @[email protected] · 2026-04-23 · 08:44 UTC

OpenAI integriert WebSockets in die Responses API und senkt die Latenz bei Agenten-Workflows um bis zu 40 Prozent.

Durch persistente Verbindungen und serverseitiges Kontext-Caching entfallen redundante HTTP-Anfragen. Modelle wie GPT-5.3-Codex generieren so auf Cerebras-Hardware bis zu 4.000 Token pro Sekunde. Drittanbieter verzeichnen messbare Leistungssprünge.

#OpenAI #ResponsesAPI #Cerebras #LLM #News
https://www.all-ai.de/news/news26/openai-api-speed

#openai #responsesapi #cerebras #llm #news

Andreas Becker @[email protected] · 2026-04-23 · 08:44 UTC

OpenAI integriert WebSockets in die Responses API und senkt die Latenz bei Agenten-Workflows um bis zu 40 Prozent.

Durch persistente Verbindungen und serverseitiges Kontext-Caching entfallen redundante HTTP-Anfragen. Modelle wie GPT-5.3-Codex generieren so auf Cerebras-Hardware bis zu 4.000 Token pro Sekunde. Drittanbieter verzeichnen messbare Leistungssprünge.

#OpenAI #ResponsesAPI #Cerebras #LLM #News
https://www.all-ai.de/news/news26/openai-api-speed

#openai #responsesapi #cerebras #llm #news

Andreas Becker @[email protected] · 2026-04-23 · 08:44 UTC

OpenAI integriert WebSockets in die Responses API und senkt die Latenz bei Agenten-Workflows um bis zu 40 Prozent.

Durch persistente Verbindungen und serverseitiges Kontext-Caching entfallen redundante HTTP-Anfragen. Modelle wie GPT-5.3-Codex generieren so auf Cerebras-Hardware bis zu 4.000 Token pro Sekunde. Drittanbieter verzeichnen messbare Leistungssprünge.

#OpenAI #ResponsesAPI #Cerebras #LLM #News
https://www.all-ai.de/news/news26/openai-api-speed

#news #llm #cerebras #responsesapi #openai

Andreas Becker @[email protected] · 2026-04-23 · 08:44 UTC

OpenAI integriert WebSockets in die Responses API und senkt die Latenz bei Agenten-Workflows um bis zu 40 Prozent.

Durch persistente Verbindungen und serverseitiges Kontext-Caching entfallen redundante HTTP-Anfragen. Modelle wie GPT-5.3-Codex generieren so auf Cerebras-Hardware bis zu 4.000 Token pro Sekunde. Drittanbieter verzeichnen messbare Leistungssprünge.

#OpenAI #ResponsesAPI #Cerebras #LLM #News
https://www.all-ai.de/news/news26/openai-api-speed

#openai #responsesapi #cerebras #llm #news

hasamba @[email protected] · 2026-02-12 · 18:29 UTC

----------------

🎯 AI
===================

Executive summary: OpenAI introduces three agentic primitives to make long-horizon, knowledge-work agents practical: reusable Skills (with a SKILL.md manifest), an upgraded Shell execution environment (hosted containers accessible via the Responses API), and server-side compaction for maintaining long conversational context.

Technical details:
• Skills are versioned bundles of files plus a SKILL.md manifest exposing metadata (name, description, path) that the model uses to decide invocation.
• The Shell tool provides a real terminal environment implemented as OpenAI-hosted containers (or local shells with identical semantics) that support stateful runs, tool calls, multi-turn continuations, and artifact creation.
• Compaction operates server-side to compress conversation history when context thresholds are crossed; two modes exist: automatic in-stream compaction and an explicit /responses/compact endpoint for manual control.

How it works conceptually:
• Routing: the model reads skill metadata (name, description, path) and only loads SKILL.md when it decides to invoke the skill, reducing prompt bloat.
• Execution: hosted Shell runs through the Responses API, coupling tool execution and artifact storage with the agent’s conversation state.
• Continuity: compaction preserves semantic continuity by reducing token footprint while retaining essential context for long workflows.

Best practices and patterns observed:
• Skill descriptions should be written as routing logic: concrete “Use when” vs. “Don’t use when,” explicit outputs, and success criteria.
• Include negative examples and edge cases in skill metadata to reduce misfires; Glean reported an initial ~20% drop in correct triggers that recovered after adding negative examples.
• Keep stable procedures in Skills (versioned playbooks) and leave dynamic planning in the model to avoid prompt spaghetti.

Limitations and considerations:
• Skills require careful authoring of routing metadata to avoid misrouting.
• Hosted Shell access implies controlled internet/process boundaries; security and audit controls around hosted execution environments are critical in deployment designs.

🔹 agents #skills #compaction #ResponsesAPI

🔗 Source: https://developers.openai.com/blog/skills-shell-tips/

#responsesapi #compaction #skills

michabbb @[email protected] · 2025-09-23 · 21:19 UTC

🛡️ Enterprise-grade security with proper authentication handling, structured responses containing tool discovery, reasoning steps, and execution results

💰 Simple pricing model - pay only for tokens consumed by selected #GroqCloud model, bring your own #MCP server and API key with third-party fees billed directly

🔗 Works with #ResponsesAPI (native #MCP support) and #ChatCompletions API (retrofitted). #ResponsesAPI recommended for multi-step workflows and approval controls

#groqcloud #mcp #responsesapi #chatcompletions

michabbb @[email protected] · 2025-09-23 · 21:19 UTC

🛡️ Enterprise-grade security with proper authentication handling, structured responses containing tool discovery, reasoning steps, and execution results

💰 Simple pricing model - pay only for tokens consumed by selected #GroqCloud model, bring your own #MCP server and API key with third-party fees billed directly

🔗 Works with #ResponsesAPI (native #MCP support) and #ChatCompletions API (retrofitted). #ResponsesAPI recommended for multi-step workflows and approval controls

#groqcloud #mcp #responsesapi #chatcompletions

michabbb @[email protected] · 2025-09-23 · 21:19 UTC

🛡️ Enterprise-grade security with proper authentication handling, structured responses containing tool discovery, reasoning steps, and execution results

💰 Simple pricing model - pay only for tokens consumed by selected #GroqCloud model, bring your own #MCP server and API key with third-party fees billed directly

🔗 Works with #ResponsesAPI (native #MCP support) and #ChatCompletions API (retrofitted). #ResponsesAPI recommended for multi-step workflows and approval controls

#groqcloud #mcp #responsesapi #chatcompletions

michabbb @[email protected] · 2025-09-23 · 21:19 UTC

🛡️ Enterprise-grade security with proper authentication handling, structured responses containing tool discovery, reasoning steps, and execution results

💰 Simple pricing model - pay only for tokens consumed by selected #GroqCloud model, bring your own #MCP server and API key with third-party fees billed directly

🔗 Works with #ResponsesAPI (native #MCP support) and #ChatCompletions API (retrofitted). #ResponsesAPI recommended for multi-step workflows and approval controls

#chatcompletions #responsesapi #mcp #groqcloud

michabbb @[email protected] · 2025-09-23 · 21:19 UTC

🛡️ Enterprise-grade security with proper authentication handling, structured responses containing tool discovery, reasoning steps, and execution results

💰 Simple pricing model - pay only for tokens consumed by selected #GroqCloud model, bring your own #MCP server and API key with third-party fees billed directly

🔗 Works with #ResponsesAPI (native #MCP support) and #ChatCompletions API (retrofitted). #ResponsesAPI recommended for multi-step workflows and approval controls

#groqcloud #mcp #responsesapi #chatcompletions

tech news ᳇ eicker.news @[email protected] · 2025-05-22 · 18:29 UTC

»#OpenAI updates its new #ResponsesAPI rapidly with #MCP support, #GPT4o native #image gen, and more #enterprisefeatures.« https://venturebeat.com/programming-development/openai-updates-its-new-responses-api-rapidly-with-mcp-support-gpt-4o-native-image-gen-and-more-enterprise-features/?eicker.news #tech #media #news

#openai #responsesapi #mcp #gpt4o #image #enterprisefeatures

tech news ᳇ eicker.news @[email protected] · 2025-05-22 · 18:29 UTC

»#OpenAI updates its new #ResponsesAPI rapidly with #MCP support, #GPT4o native #image gen, and more #enterprisefeatures.« https://venturebeat.com/programming-development/openai-updates-its-new-responses-api-rapidly-with-mcp-support-gpt-4o-native-image-gen-and-more-enterprise-features/?eicker.news #tech #media #news

#openai #responsesapi #mcp #gpt4o #image #enterprisefeatures

tech news ᳇ eicker.news @[email protected] · 2025-05-22 · 18:29 UTC

»#OpenAI updates its new #ResponsesAPI rapidly with #MCP support, #GPT4o native #image gen, and more #enterprisefeatures.« https://venturebeat.com/programming-development/openai-updates-its-new-responses-api-rapidly-with-mcp-support-gpt-4o-native-image-gen-and-more-enterprise-features/?eicker.news #tech #media #news

#openai #responsesapi #mcp #gpt4o #image #enterprisefeatures

tech news ᳇ eicker.news @[email protected] · 2025-05-22 · 18:29 UTC

»#OpenAI updates its new #ResponsesAPI rapidly with #MCP support, #GPT4o native #image gen, and more #enterprisefeatures.« https://venturebeat.com/programming-development/openai-updates-its-new-responses-api-rapidly-with-mcp-support-gpt-4o-native-image-gen-and-more-enterprise-features/?eicker.news #tech #media #news

#news #media #tech #enterprisefeatures #image #gpt4o

tech news ᳇ eicker.news @[email protected] · 2025-05-22 · 18:29 UTC

»#OpenAI updates its new #ResponsesAPI rapidly with #MCP support, #GPT4o native #image gen, and more #enterprisefeatures.« https://venturebeat.com/programming-development/openai-updates-its-new-responses-api-rapidly-with-mcp-support-gpt-4o-native-image-gen-and-more-enterprise-features/?eicker.news #tech #media #news

#openai #responsesapi #mcp #gpt4o #image #enterprisefeatures