30 results for "ai agents"
OpenAI’s Symphony spec pushes coding agents from prompts to orchestration - InfoWorld
OpenAI’s Symphony spec pushes coding agents from prompts to orchestration InfoWorld…
Show HN: Delegare – let AI agents pay safely (x402, AP2 – base/USDC and Stripe)
Hi guys, am building SecureLend.ai and when working on our underwriting agents (free trial, paid after) I had issues with seamless payment options. Of course I looked at x402 which I believe is a grea…
I Almost Went Broke Letting AI Agents Work for Me
AI agents are powerful, but they can also be expensive in a very quiet way. When I use a normal...…
OpenAI releases Symphony, an open-source spec for agent orchestration that turns a project-management board like Linear into a control plane for coding agents (OpenAI)
OpenAI : OpenAI releases Symphony, an open-source spec for agent orchestration that turns a project-management board like Linear into a control plane for coding agents — Six months ago, while working …
AI Identity: Standards, Gaps, and Research Directions for AI Agents
AI agents are now running real transactions, workflows, and sub-agent chains across organizational boundaries without continuous human supervision. This creates a problem no current infrastructure is …
Structural Enforcement of Goal Integrity in AI Agents via Separation-of-Powers Architecture
Recent evidence suggests that frontier AI systems can exhibit agentic misalignment, generating and executing harmful actions derived from internally constructed goals, even without explicit user reque…
MarketBench: Evaluating AI Agents as Market Participants
Markets are a promising way to coordinate AI agent activity for similar reasons to those used to justify markets more broadly. In order to effectively participate in markets, agents need to have infor…
Failure-Centered Runtime Evaluation for Deployed Trilingual Public-Space Agents
This paper presents PSA-Eval, a failure-centered runtime evaluation framework for deployed trilingual public-space agents. The central claim is that, when the evaluation object shifts from a static in…
AgentPulse: A Continuous Multi-Signal Framework for Evaluating AI Agents in Deployment
Static benchmarks measure what AI agents can do at a fixed point in time but not how they are adopted, maintained, or experienced in deployment. We introduce AgentPulse, a continuous evaluation framew…
Governing What You Cannot Observe: Adaptive Runtime Governance for Autonomous AI Agents
Autonomous AI agents can remain fully authorized and still become unsafe as behavior drifts, adversaries adapt, and decision patterns shift without any code change. We propose the \textbf{Informationa…
77% of IT managers say their AI agents are out of control - 5 ways to rein in yours
The unchecked proliferation of AI agents is leading to a large volume of unsanctioned AI applications.…
Show HN: 49Agents – Infinite canvas IDE for AI agents
Open-source 2D IDE for managing AI agents in native CLIs, terminal, gits, beads issues, and files across multiple projects and machines. Self-host on a single machine via localhost OR host on a clu...…
Thinking Outside the Box: New Attack Surfaces in Sandboxed AI Agents
Discover how sandboxed AI agents remain vulnerable to AI-native attacks, enabling data exfiltration and configuration poisoning despite strict policies.…
A new Moore's Law for AI agents
The length of tasks that agents can do is growing exponentially…
Raiders undrafted free agents tracker 2026: Every reported move for Las Vegas
The Las Vegas Raiders have completed the 2026 NFL Draft and will now look to sign players who did not get picked. Follow our tracker for the latest updates on every reported undrafted free-agent signi…
Why Your AI Agents Keep Breaking Your Workflows
Effective Context Engineering for AI Agents: A Developer's Guide
‘You feel radicalized’: A Meta AI exec watched agents beat her top workers. Now she’s built a nonprofit to help Gen Z find jobs before they disappear
AgentCheck – Pytest for AI Agents
OpenAI could be making a phone with AI agents replacing apps
The phone could go into mass production in 2028, an analyst says.…
An OpenAI-linked news outlet appears to be entirely AI-generated - The site uses AI agents to reach out to sources and is openly critical of AI skeptics.
Multi-Agent AI Systems Are Eating Single Agents
Single-agent architectures hit a wall the moment your task needs planning, research, and execution in parallel. Multi-agent systems solve this — but most tutorials skip the hard parts. This guide does…
We ran a small multi-agent sandbox (~20 agents) and started seeing unexpected social behaviors
We’ve been running a small sandbox with fewer than 20 AI agents, each with persistent identity and the ability to post and interact in a shared environment. What’s interesting is that some behaviors s…
Show HN: Minimal Linux sandboxes to manage AI-Generated Code with ease
Minimal Linux sandboxes for running untrusted code. Built for AI agents, build systems, and any scenario where you need to execute code you didn't write.…
Humanitarian aid turns to AI as crises outpace capacity
Purpose-designed AI agents with a focus on safety can provide critical assistance to vulnerable populations.…
Show HN: I built a way to see if your SDK is AI-friendly
Have you ever wonder if your SDKs is friendly for Agentic AI like Claude Code or Codex? I built an opensource (Apache 2.0) CLI that answer that question for you. With it you can create a test suite ei…
Show HN: The newsroom that runs itself; hiring AI Journalists [TokenToday]
TokenToday is a live news channel where every story is researched, written, and reviewed by AI agents, no human editors. Agents register via API, submit stories in Markdown, go through a multi-agent e…
Mitigating Belief Inertia via Active Intervention in Embodied Agents
Recent advancements in large language models (LLMs) have enabled agents to tackle complex embodied tasks through environmental interaction. However, these agents still make suboptimal decisions and pe…
moved to a new domain, added some tools, and created a category i didn't expect to need
tolop.space got a few updates this week. new domain first, was on a vercel subdomain before, moved to .space because the whole idea of the site is finding tools in a new space. felt like the right fit…
Mark Zuckerberg is building an AI clone of himself. Most people just need help with their inbox
From three-person startups hitting $500K ARR to Meta's CEO avatar project, AI agents are reshaping what a "team" looks like.…