← All news

AI News — June 2, 2026

#models #agents

Microsoft Build 2026 opens in San Francisco: Windows becomes an agent platform, Copilot Workspace ships, and Project Polaris — Microsoft's own coding model — moves to replace GPT-4 Turbo in GitHub Copilot.

Model releases

  • [2026-06-02] Microsoft — at Build, unveiled Project Polaris, an in-house coding model set to replace GPT-4 Turbo as the default in GitHub Copilot starting August (automatic migration, optional ~3-month GPT-4 fallback). Reported as a mixture-of-experts design tuned per language — claimed gains on HumanEval/MBPP and in low-resource languages like Rust/Haskell — running on Microsoft’s custom Maia accelerators in Azure to cut per-inference latency and cost. It’s Microsoft’s clearest move yet to reduce its OpenAI dependence inside its flagship dev tool. (official, source, source)

    For Copilot users this is mostly invisible plumbing — the same editor, a different engine — but the practical thing to note is that your default completions are slated to swap models mid-summer, so any prompt habits or quirks you’ve tuned against GPT-4 Turbo may shift once Polaris becomes the default.

  • [2026-06-02] Microsoft — Build also introduced the Aion 1.0 on-device model family for Windows. Aion 1.0 Plan (a 14B reasoning-and-tool-calling model, 32K context) ships in-box on capable PCs so apps can reason over user intent, invoke tools, manage files, and orchestrate sub-agents locally — full agentic workflows on-device, rolling out “in the coming months.” Alongside it, Aion 1.0 Instruct is a smaller/faster SLM (preview, to be released as open weights) that succeeds the current Windows OS SLM for everyday text intelligence and extends into Edge. Where Polaris is the cloud Copilot brain, Aion pushes agentic inference down to the device. (official, source)

    The practical hook here is hardware: on-device agentic inference means no per-token cloud cost and data that never leaves the machine, but it only runs on “capable PCs,” so expect these features to land unevenly across a fleet rather than everywhere at once.

Coding agents

  • [2026-06-02] Microsoft — Copilot Workspace exits beta and gains an Agent Mode where Copilot acts as a meta-agent: describe a workflow and it designs, provisions, and monitors a swarm of sub-agents to execute it — pushing Copilot from synchronous assistant toward async, long-running “coworker.” (official, source)

    The shift from synchronous to long-running agents changes how you supervise work: you review outcomes from a swarm rather than steering each step, which puts a premium on writing a tight task spec up front and good guardrails around what the agents can touch.

    For Software Developers: Rather than prompting Copilot file-by-file for a multi-step change, you can describe the whole workflow once — e.g. add a feature flag across several services and wire up its tests — and have it provision and monitor sub-agents that carry out and report back on each piece, leaving you to review the assembled result.

  • [2026-06-02] Microsoft — Agent Mode is now the default across Office 365 Copilot apps (Word, Excel, PowerPoint), reframing the assistant as an executor of multi-step tasks rather than a prompt-by-prompt helper. (official, source)

    Making this the default rather than an opt-in matters because non-technical colleagues will now hand multi-step tasks to Copilot as a matter of course — so for anyone downstream, expect more agent-generated documents and spreadsheets flowing into review, and worth setting expectations about checking them.

  • [2026-06-01] OpenAI / AWS — GPT-5.5, GPT-5.4, and Codex reach GA on Amazon Bedrock (commercial and GovCloud regions, out of April’s limited preview), with inference routed through Bedrock under customers’ existing IAM/VPC/encryption controls and pricing matched to OpenAI first-party rates. It hands AWS-resident enterprises a governed path to Codex (App, CLI, IDE) without leaving their cloud — though Codex doesn’t yet support Bedrock Mantle endpoints in GovCloud. (official, official)

    For teams already standardized on Bedrock, this removes the usual blocker to adopting Codex — data-residency and procurement sign-off — because nothing has to leave the existing AWS account boundary to use it.

    For Solution Architects: A team whose data-residency policy forbids calling OpenAI’s first-party API can now stand up Codex (App, CLI, IDE) with all traffic staying inside their existing Bedrock account, reusing the same IAM roles, VPC endpoints, and encryption controls that already gate their other Bedrock models — rather than provisioning a separate egress path and key-management story for an outside vendor.

AI-assisted SDLC

  • [2026-06-02] Microsoft — Build positions Windows as an agent platform: the Windows Agent Framework (WAF) is open-sourced under MIT (abstracting agent lifecycle/services), alongside a new Windows Agent Store, WSL 3, and Azure Agent Mesh — a control plane that federates agent execution across on-prem Windows servers, Windows 365 Cloud PCs, and Azure Arc edge devices, routing tasks to the nearest node (consumption-based, GA targeted Q4 2026). This is the concrete delivery of the WAF open-sourcing flagged in last week’s preview. (official, source, source)

    Open-sourcing the framework under MIT rather than shipping it as a closed product is a bid to make Windows the default substrate for agents the way it was for desktop apps; for developers it means you can build against a documented agent lifecycle without betting your code on a proprietary, lock-in runtime.

  • [2026-06-02] Microsoft — Azure AI Foundry Agent Service hits GA as a next-gen managed runtime built on the OpenAI Responses API (wire-compatible with OpenAI agents) and open to models from DeepSeek, xAI, Meta and others, with production SDKs for Python/JS/Java/.NET. The Microsoft Foundry portal (ai.azure.com) also went GA as the unified build-and-govern surface, and an Agent Orchestrator for load-balancing across thousands of agents is previewed for August. It’s the managed-service complement to the device-spanning Azure Agent Mesh above — the build/runtime layer versus the federation control plane. (official, source)

    GA is the signal enterprises wait for before putting agents into production — it brings the support and stability commitments preview doesn’t — and the multi-model openness (DeepSeek, xAI, Meta) means you’re not locked to a single provider’s roadmap on the runtime.

    For Software Developers: Because the service is wire-compatible with the OpenAI Responses API, a team with an existing OpenAI-agents app can repoint it at Foundry Agent Service using the Python or .NET SDK and swap the underlying model to, say, a Meta or DeepSeek one — without rewriting their agent orchestration code.

Watch list

  • Microsoft Build day two (June 3): more agent/Foundry sessions expected; watch for the in-box Aion 1.0 Plan availability timeline and an official model card to firm up beyond “coming months.”

    Worth watching because a firm date and a model card would turn Aion from a keynote demo into something teams can actually plan device-side adoption around.

  • Project Polaris → GitHub Copilot migration (August 2026): watch for an official model card, independent benchmarks, and how the GPT-4 Turbo fallback window actually works in practice.

    The thing to watch is independent benchmarks — vendor HumanEval/MBPP numbers rarely survive contact with real codebases — and whether the fallback genuinely lets teams pin GPT-4 Turbo if Polaris regresses on their stack.

  • Anthropic Mythos-class GA: Anthropic has said it expects to bring Mythos-class models (the security-strong frontier model behind Project Glasswing) to all customers “in the coming weeks” — no firm date yet. (source, source)

    Significant when it lands because it would put a security-hardened frontier model in general hands; until there’s a date, though, this is a stated intention rather than something to plan against.

  • Anthropic developer billing split (June 15): Claude Agent SDK, claude -p, and GitHub Actions usage reportedly move off subscriptions onto a separate monthly credit pool (~$20–$200); Anthropic has not detailed it on an official page yet. (source) (unconfirmed)

    Should this pan out, it would change the cost model for anyone running Claude in CI or automation — subscription seats would no longer cover programmatic usage — so it’s worth pricing out before the date even while it’s unconfirmed.

  • OpenAI GPT-5.6: only leak signals (Codex log traces, prediction-market odds) so far; no model card or date — watched for a late-spring drop. (source) (unconfirmed)

    Only worth acting on once there’s a model card; leak traces and prediction-market odds aren’t a basis for a roadmap, so this stays a curiosity until OpenAI confirms anything.

  • MCP spec 2026-07-28 final: stateless core and MCP Apps move to stable; server operators will need to migrate off sticky sessions.

    The significance for operators is the migration work it implies: once the stateless core is stable, anything relying on sticky sessions will need reworking, so it’s worth auditing your MCP deployments ahead of the date.