Articles

Affichage des articles du juillet, 2025

LangChain’s Align Evals closes the evaluator trust gap with prompt-level calibration

Image
LangChain allows enterprises to make and calibrate a model to evaluate applications and get it close to human preferences. Read More

‘Subliminal learning’: Anthropic uncovers how AI fine-tuning secretly teaches bad habits

Image
A common AI fine-tuning practice could be unintentionally poisoning your models with hidden biases and risks, a new Anthropic study warns. Read More

Shadow AI adds $670K to breach costs while 97% of enterprises skip basic access controls, IBM reports

Image
IBM's 2025 Cost of a Data Breach Report reveals that breaches involving unauthorized AI tools now average $4.63M. Read More

Mark Zuckerberg says ‘developing superintelligence is now in sight,’ shades OpenAI and other firms focused on automating work

Image
So perhaps, these competing visions of superintelligence are actually far more similar than they are opposed. Read More

C8 Health started with an AI that gives anesthesiologists guidance on demand — now it’s targeting whole hospitals

Image
A friendly red panda avatar serves up the knowledge from the organization's own siloed databases, complete with citations Read More

Google DeepMind says its new AI can map the entire planet with unprecedented accuracy

Image
Google DeepMind unveils AlphaEarth Foundations, an AI system that processes satellite data 16x more efficiently to create detailed Earth maps for tracking deforestation, climate change, and environmental shifts. Read More

Runloop lands $7M to power AI coding agents with cloud-based devboxes

Image
Runloop raises $7M seed funding to solve the "production gap" for AI coding agents, providing enterprise infrastructure that helps companies deploy autonomous coding assistants six months faster than building in-house solutions. Read More

Nightfall launches ‘Nyx,’ an AI that automates data loss prevention at enterprise scale

Image
Nightfall AI launches Nyx, the first autonomous data loss prevention platform using AI to cut false alerts by 90% and protect enterprise data from insider threats and ChatGPT leaks. Read More

How can enterprises keep systems safe as AI agents join human employees? Cyata launches with a new, dedicated solution

Image
The growing use of AI agents isn’t limited to technical teams. While developers were an early audience, Cyata quickly realized adoption was broader. Read More

AI vs. AI: Prophet Security raises $30M to replace human analysts with autonomous defenders

Image
Prophet Security raises $30 million to launch a fully autonomous AI cybersecurity platform that investigates and responds to threats without human intervention, promising 10x faster response times and 96% fewer false positives. Read More

Acree opens up new enterprise-focused, customizable AI model AFM-4.5B trained on ‘clean, rigorously filtered data’

Image
Geared toward Acree's growing list of enterprise customers and their needs and wants — specifically, a model trained without violating IP. Read More

Positron believes it has found the secret to take on Nvidia in AI inference chips — here’s how it could benefit enterprises

Image
The company’s first-generation chips were fabricated in the U.S. using Intel facilities, with final server assembly and integration. Read More

ChatGPT just got smarter: OpenAI’s Study Mode helps students learn step-by-step

Image
OpenAI launches ChatGPT Study Mode, transforming AI from an answer engine into a Socratic tutor that guides students through problems step-by-step rather than providing direct solutions. Read More

Stack Overflow data reveals the hidden productivity tax of ‘almost right’ AI code

Image
Stack Overflow survey shows that as more enterprise developers actually use AI tools, their expectations aren't being met by reality. Read More

Writer launches a ‘super agent’ that actually gets sh*t done, outperforms OpenAI on key benchmarks

Image
Writer launches Action Agent, an autonomous AI that executes complex enterprise tasks across 600+ tools, outperforming OpenAI on key benchmarks in the $114B enterprise AI market. Read More

Sparrow raises $35M Series B to automate the employee leave management nightmare

Image
Sparrow raises $35M Series B to scale AI-powered employee leave management platform that has grown 14x since 2021, serving 1,000+ companies and saving $200M in payroll costs. Read More

Chinese startup Z.ai launches powerful open source GLM-4.5 model family with PowerPoint creation

Image
GLM-4.5’s launch gives enterprise teams a viable, high-performing foundation model they can control, adapt, and scale. Read More

No more links, no more scrolling—The browser is becoming an AI Agent

Image
With rumors about a GPT-native browser, search is shifting from finding information to fulfilling tasks. No more links, no more scrolling. Read More

Anthropic throttles Claude rate limits, devs call foul

Image
Blaming users that run Claude Code 24/7, Anthropic instituted weekly rate limits for some Claude users resulting in backlash on social media. Read More

How E2B became essential to 88% of Fortune 100 companies and raised $21 million

Image
AI infrastructure startup E2B secures $21 million funding with 88% Fortune 100 adoption rate, powering secure AI agent deployments at scale. Read More

When progress doesn’t feel like home: Why many are hesitant to join the AI migration

Image
What happens if the AI migration accelerates and sizable portions of the workforce are slow to move out of fear, resistance or inability? Read More

Why AI is making us lose our minds (and not in the way you’d think)

Image
The question isn’t, “will you use AI?” The question is, “what kind of AI user do you want to be: driver or passenger?” Read More

Meta announces its Superintelligence Labs Chief Scientist: former OpenAI GPT-4 co-creator Shengjia Zhao

Image
The move underscores Meta’s strategy of spending aggressively now to secure a dominant position in what it views as the next foundational technology platform. Read More

New AI architecture delivers 100x faster reasoning than LLMs with just 1,000 training examples

Image
Hierarchical Reasoning Models (HRM) tackle complex reasoning tasks while being smaller, faster, and more data-efficient than large AI models. Read More

CoSyn: The open-source tool that’s making GPT-4V-level vision AI accessible to everyone

Image
Researchers at the University of Pennsylvania and the Allen Institute for Artificial Intelligence have developed a groundbreaking tool that allows open-source AI systems to match or surpass the visual understanding capabilities of proprietary models like GPT-4V and Gemini 1.5 Flash, potentially reshaping the competitive landscape between open and c… Read More

It’s Qwen’s summer: new open source Qwen3-235B-A22B-Thinking-2507 tops OpenAI, Gemini reasoning models on key benchmarks

Image
The new Qwen3-Thinking-2507, as we'll call it for short, now leads or closely trails top-performing models across several major benchmarks. Read More

Anthropic unveils ‘auditing agents’ to test for AI misalignment

Image
Anthropic developed its auditing agents while testing Claude Opus 4 for alignment issues. Read More

Freed says 20,000 clinicians are using its medical AI transcription ‘scribe,’ but competition is rising fast

Image
Rather than chase enterprise contracts with large hospital systems, Freed has focused on small clinics and solo practitioners. Read More

White House plan signals “open-weight first” era—and enterprises need new guardrails

Image
Enterprises will not see immediate impact from the AI Action Plan, but it signals wider support for open-source models and evaluations. Read More

SecurityPal combines AI and experts in Nepal to speed enterprise security questionnaires by 87X or more

Image
The Kathmandu center of excellence gives SecurityPal a cost base low enough to keep humans in the loop while staying price-competitive. Read More

Qwen3-Coder-480B-A35B-Instruct launches and it ‘might be the best coding model yet’

Image
Developers can define custom tools and let Qwen3-Coder dynamically invoke them during conversation or code generation tasks. Read More

Early Anthropic hire raises $15M to insure AI agents and help startups deploy safely

Image
Early Anthropic hire raises $15M for AIUC to insure AI agents, helping enterprises deploy artificial intelligence securely with risk coverage and safety standards. Read More

Mixture-of-recursions delivers 2x faster inference—Here’s how to implement it

Image
Mixture-of-Recursions (MoR) is a new AI architecture that promises to cut LLM inference costs and memory use without sacrificing performance. Read More

Anthropic researchers discover the weird AI problem: Why thinking longer makes models dumber

Image
Anthropic research reveals AI models perform worse with extended reasoning time, challenging industry assumptions about test-time compute scaling in enterprise deployments. Read More

Intuit brings agentic AI to the mid-market saving organizations 17 to 20 hours a month

Image
Intuit explains how it is solving the needs of the mid-market with a new series of agentic AI experiences. Read More

Open-source MCPEval makes protocol-level agent testing plug-and-play

Image
Researchers from Salesforce unveiled MCPEval, a new method to evaluate AI agent performance and tool use within MCP servers. Read More

Alibaba’s new open source Qwen3-235B-A22B-2507 beats Kimi-2 and offers low compute version

Image
Teams can scale Qwen3’s capabilities to single-node GPU instances or local development machines, avoiding the need for massive GPU clusters. Read More

Crowdstrike’s massive cyber outage 1-year later: lessons enterprises can learn to improve security

Image
The incident's legacy extends far beyond CrowdStrike. Organizations now implement staged rollouts and maintain manual override capabilities. Read More

Google DeepMind makes AI history with gold medal win at world’s toughest math competition

Image
Google DeepMind's Gemini AI won a gold medal at the International Mathematical Olympiad by solving complex math problems using natural language, marking a breakthrough in AI reasoning and human-level performance. Read More

Chinese startup Manus challenges ChatGPT in data visualization: which should enterprises use?

Image
While Manus handles messy data better than ChatGPT, neither tool is yet ready for boardroom-ready slides. Read More

A ChatGPT ‘router’ that automatically selects the right OpenAI model for your job appears imminent

Image
Like going to the supermarket and staring at aisles of cereal and sauces, the average ChatGPT user is currently faced with an overabundance. Read More

Weaving reality or warping it? The personalization trap in AI systems

Image
Each of our versions of reality is changing with AI. This could erode our ability to agree on basic facts or navigate shared challenges. Read More

5 key questions your developers should be asking about MCP

Image
It’s MCP projects in production, not specification elegance or market buzz, that will determine if MCP (or something else) stays on top. Read More

New embedding model leaderboard shakeup: Google takes #1 while Alibaba’s open source alternative closes gap

Image
Google's new Gemini Embedding model now leads the MTEB benchmark. But it is facing fierce competition from closed and open source rivals. Read More

How OpenAI’s red team made ChatGPT agent into an AI fortress

Image
Discover OpenAI's red team blueprint: How 110 coordinated attacks and 7 exploit fixes created ChatGPT Agent's revolutionary 95% security defense system. Read More

Meet AnyCoder, a new Kimi K2-powered tool for fast prototyping and deploying web apps

Image
For novice developers or even those with expertise who want to spin up a new project fast, AnyCoder seems like a great place to start. Read More

Salesforce used AI to cut support load by 5% — but the real win was teaching bots to say ‘I’m sorry’

Image
Salesforce reached 1 million AI-powered customer conversations, showcasing breakthroughs in enterprise automation, AI empathy, and next-generation customer service. Read More

Mistral’s Le Chat adds deep research agent and voice mode to challenge OpenAI’s enterprise dominance

Image
Mistral added deep research capabilities to its Le Chat platform, bringing it in direct competition against ChatGPT and Gemini. Read More

OpenAI unveils ‘ChatGPT agent’ that gives ChatGPT its own computer to autonomously use your email and web apps, download and create files for you

Image
If a website needs you to log in, you can do that securely through a special browser view, which lets the agent dig deeper and handle more. Read More

Blaxel raises $7.3M seed round to build ‘AWS for AI agents’ after processing billions of agent requests

Image
Blaxel raises $7.3M seed funding to build specialized cloud infrastructure for AI agents, challenging AWS with purpose-built platform for autonomous AI systems. Read More