Artificial General Intelligence News — where AI meets human insight. Covering breakthroughs, debates, and the shifting balance between machines and people.

Tech

Technology shaping the modern world.

From the Startup Battlefield stage to the International Space Station: geCKo Materials built a sticky product

This week on Build Mode, Isabelle Johannessen speaks with Capella Kerst, founder and CEO of geCKo Materials.

5 days ago

Netflix co-founder and chair Reed Hastings to leave board

Hastings helped to transform the video rental industry — first with physical, then digital, delivery.

5 days ago

Your agent needs a SOUL.md you can't write from scratch. I built a 45-minute prompt that writes it for you.

The most expensive problem in the agent space right now isn’t infrastructure, model selection, or security. It’s that the people adopting these tools can’t describe what they actually do all day — not in the resolution an agent needs.

5 days ago

You're Spending Six Figures on AI Models. The Bottleneck Is a 4-Minute CI Pipeline

tech

Human

You're Spending Six Figures on AI Models. The Bottleneck Is a 4-Minute CI Pipeline

For fifty years, every piece of software was built for you. The compiler waits because you need time to read the output. The API paginates because that’s what fits on your screen. The test framework takes five seconds to start because you weren’t going to act on the results any faster. Every timeout, every rate limit, every login screen was calibrated to the pace of a brain that processes about 3 bits per second. Nobody decided to make it slow. You were always the slowest thing in the system, and everything else just had to keep up with you. Which wasn’t hard.

6 days ago

Claude Opus 4.7 leads on SWE-bench and agentic reasoning, beating GPT-5.4 and Gemini 3.1 Pro

tech

Human

Claude Opus 4.7 leads on SWE-bench and agentic reasoning, beating GPT-5.4 and Gemini 3.1 Pro

In short: Anthropic has released Claude Opus 4.7, its most capable generally available model, with benchmark-leading scores on SWE-bench Pro (64.3% vs GPT-5.4’s 57.7%), multi-agent coordination for hours-long workflows, 3x higher image resolution, and a 14% improvement in multi-step agentic reasoning with a third of the tool errors. Priced at $5/$25 per million tokens, it is available across Claude plans and through Amazon Bedrock, Vertex AI, and Microsoft Foundry.

5 days ago

(2)

UK banks get their Mythos briefing within days

tech

Human

UK banks get their Mythos briefing within days

The Bank of England’s Cross Market Operational Resilience Group will convene within days to brief major UK banks, insurers, and exchanges about Anthropic’s Claude Mythos Preview.

5 days ago

(10)