• Amazon CEO reportedly raised Anthropic model concerns before government crackdown• OpenAI faces investigation from state attorneys general• Andrew Yang thinks the next big startup opportunity is lowering the cost of living• Anthropic’s safety warnings may have just backfired — the government has pulled the plug on its most powerful AI• SpaceX IPO: Live updates on everything you need to know• Meta’s months-old AI unit is a soul-crushing gulag, say the engineers stuck inside it• Chinese cybercrime operation that used AI to scam ‘hundreds of thousands of victims’ sued by Google• Mistral is rumored to be raising €3B at €20B valuation• SpaceX, Anthropic, and OpenAI’s hot IPO summer• It’s hot IPO summer, and the MANGOS are ripe• Cheaper, faster, and culturally aware, Avataar’s video AI is built for India’s scale• Theker just raised $85M to build the factory robot that doesn’t specialize in anything• Jeff Bezos’s Prometheus raises $12B to build an ‘artificial general engineer’ for the physical world• SpaceX officially prices shares at $135 in the largest IPO ever• SpaceX SPV investors won’t know their true holdings until post-IPO lock-ups lift• Our new community investments in Virginia support local jobs and expand energy affordability.• The latest AI news we announced in May 2026• 5 ways Google Search can level up your thrift and vintage shopping• How we used Gemini to build Google I/O 2026• Take our I/O 2026 quiz, vibe coded in Google AI Studio.• 9 demos of Gemini Omni and Gemini 3.5 in action• Check out real-life AI prototypes from the Futures Lab.• Catch up on 12 major I/O 2026 moments• Catch up on the Dialogues stage at Google I/O 2026.• We’re announcing new community investments in Missouri.• 100 things we announced at I/O 2026• A new experiment brings better group meetings to Google Beam• How AI Mode is changing the way people search in the U.S.• New ways to create and get things done in Google Workspace• I/O 2026: Welcome to the agentic Gemini era• Derbyshire police officer investigated over AI-generated ‘evidential material’ - The Guardian• How to Train a Scoring Model in the Age of Artificial Intelligence - Towards Data Science• Anthropic to disable its most advanced AI models after US order limiting foreign access - The Guardian• From PDFs to insights: Architecting an intelligent document processing pipeline with AWS generative AI services - Amazon Web Services (AWS)• Meet the 2 Newcomers Challenging the Cloud Computing Titans in Artificial Intelligence (AI) - The Motley Fool• ‘There’s a huge market demand’: University of Utah approves new bachelor’s degree in artificial intelligence - The Salt Lake Tribune• How D.C.'s political campaigns are (and aren't) using AI - The 51st• AI shows promise in eating disorder care, but important risks remain. - Psychology Today• Industry Experts Testify on Artificial Intelligence Innovation - C-SPAN• Derbyshire Police officer accused of using AI to 'create evidence' - BBC• Artificial intelligence is helping Floridians with brain tumors; Here’s how - chronicleonline.com• Bridging three-dimensional molecular structures and artificial intelligence with a conformation description language - Nature• How artificial intelligence got better at building itself - The Economist• Anthropic suspends top AI models after U.S. export control order - Nextgov/FCW• How the Real Estate Industry Is Embracing Artificial Intelligence · Babson Thought & Action - Babson College• New OpenAI Academy courses for the next era of work• How Preply combines AI and human tutors to personalize learning• Supporting Europe’s work in ensuring a trustworthy AI ecosystem • BBVA puts AI at the core of banking with OpenAI• OpenAI to acquire Ona• How an astrophysicist uses Codex to help simulate black holes• Access OpenAI models and Codex through your Oracle cloud commitment• PRC-linked influence operations are targeting AI debates in the US• From data to decisions: how LSEG is scaling trusted AI• How engineers at Nextdoor use Codex to build without limits• What Codex unlocks for Notion• Industrial policy for the Intelligence Age• Confidential submission of draft S-1 to the SEC• Built to benefit everyone: our plan• Introducing the OpenAI Economic Research Exchange• Save time and grow your business with new Gemini tools• Fluid, natural voice translation with Gemini 3.5 Live Translate• 4 ways soccer fans can catch every moment of the tournament• The latest AI news we announced in May 2026• How we used Gemini to build Google I/O 2026• 9 demos of Gemini Omni and Gemini 3.5 in action• Catch up on 12 major I/O 2026 moments• 100 things we announced at I/O 2026• Making it easier to understand how content was created and edited• I/O 2026• Introducing Gemini Omni• I/O 2026: Welcome to the agentic Gemini era• Gemini 3.5: frontier intelligence with action• Gemini for Science: AI experiments and tools for a new era of discovery• The Gemini app becomes more agentic, delivering proactive, 24/7 help• Google just redesigned the search box for the first time in 25 years — here’s why it matters more than you think.• Railway secures $100 million to challenge AWS with AI-native cloud infrastructure• Claude Code costs up to $200 a month. Goose does the same thing for free.• Listen Labs raises $69M after viral billboard hiring stunt to scale AI customer interviews• Salesforce rolls out new Slackbot AI agent as it battles Microsoft and Google in workplace AI• Anthropic launches Cowork, a Claude Desktop agent that works in your files — no coding required• Nous Research's NousCoder-14B is an open-source coding model landing right in the Claude Code moment• Best Universities To Study AI in 2026• 10 top women in AI in 2026• Pope Leo XIV Declares AI a Threat to Human Dignity and Workers’ Rights• ChatGPT Is Making People Think They’re Gods and Their Families Are Terrified• AI May Soon Help You Understand What Your Pet Is Trying to Say• Netflix Adds ChatGPT-Powered AI to Stop You From Scrolling Forever• Murder Victim Speaks from the Grave in Courtroom Through AI• China Unveils World’s First AI Hospital: 14 Virtual Doctors Ready to Treat Thousands Daily• Katy Perry Didn’t Attend the Met Gala, But AI Made Her the Star of the Night• Therapists Too Expensive? Why Thousands of Women Are Spilling Their Deepest Secrets to ChatGPT• Calendly vs. Google Calendar: Which should you choose? [2026]• The 4 best AI website builders• Claude 5: What you need to know about Anthropic's AI models and chatbot• The 6 best AI governance tools in 2026• What is generative AI?• What is Claude Mythos? And how to see it in action with Claude Fable 5• Google Sheets pivot table: A step-by-step guide• 5 ways to automate Meta's Conversions API tool with Zapier• How to automate Claude with Zapier• How Gourmet Ads uses Zapier MCP to turn Salesforce and Atlassian into a weekly growth report• How a two-person SEO shop is building an engine to run twelve clients in thirty minutes a month• Which AI models can you automate on Zapier? (Opus 4.8, Gemini 3.5 Flash, and more)• The 17 best AI marketing tools in 2026• The best customer experience software in 2026 • The best Docusign alternatives in 2026
Anthropic to disable its most advanced AI models after US order limiting foreign access - The Guardian
"artificial intelligence" - Google News

Anthropic to disable its most advanced AI models after US order limiting foreign access - The Guardian

Anthropic to disable its most advanced AI models after US order limiting foreign access  The Guardian Anthropic Pulls Its Most Powerful AI Models After U.S. Bars Foreign Access  Time Magazine Anthropic Blocks Foreigners From Using Mythos and Fable AI  The New York Times

Built to benefit everyone: our plan
OpenAI News

Built to benefit everyone: our plan

A vision for the future of AI, focusing on access, safety, and shared prosperity as OpenAI works to ensure AGI benefits everyone.

Netflix Adds ChatGPT-Powered AI to Stop You From Scrolling Forever
DailyAI

Netflix Adds ChatGPT-Powered AI to Stop You From Scrolling Forever

In a bold move to tackle one of streaming’s biggest frustrations, endless scrolling, Netflix just unveiled a major redesign of its TV and mobile apps featuring a ChatGPT-powered AI chatbot and TikTok-style video reels. You’ll soon be able to ask Netflix in plain language what you’re in the mood for “funny and fast-paced” or “dark thrillers with strong female leads” and get instant, tailored recommendations. Netflix is partnering with OpenAI to power this feature, part of a broader overhaul aimed at making content discovery faster, more intuitive, and (finally) less painful. What’s changing Conversational AI Search: Powered by OpenAI, this The post Netflix Adds ChatGPT-Powered AI to Stop You From Scrolling Forever appeared first on DailyAI.

Jeff Bezos’s Prometheus raises $12B to build an ‘artificial general engineer’ for the physical world
AI News & Artificial Intelligence | TechCrunch

Jeff Bezos’s Prometheus raises $12B to build an ‘artificial general engineer’ for the physical world

The new round values the physical AI startup that aims to automate heavy engineering and drug design at $41 billion.

Railway secures $100 million to challenge AWS with AI-native cloud infrastructure
AI | VentureBeat

Railway secures $100 million to challenge AWS with AI-native cloud infrastructure

Railway, a San Francisco-based cloud platform that has quietly amassed two million developers without spending a dollar on marketing, announced Thursday that it raised $100 million in a Series B funding round, as surging demand for artificial intelligence applications exposes the limitations of legacy cloud infrastructure. TQ Ventures led the round, with participation from FPV Ventures, Redpoint, and Unusual Ventures. The investment values Railway as one of the most significant infrastructure startups to emerge during the AI boom, capitalizing on developer frustration with the complexity and cost of traditional platforms like Amazon Web Services and Google Cloud. "As AI models get better at writing code, more and more people are asking the age-old question: where, and how, do I run my applications?" said Jake Cooper, Railway's 28-year-old founder and chief executive, in an exclusive interview with VentureBeat. "The last generation of cloud primitives were slow and outdated, and now with AI moving everything faster, teams simply can't keep up." The funding is a dramatic acceleration for a company that has charted an unconventional path through the cloud computing industry. Railway raised just $24 million in total before this round, including a $20 million Series A from Redpoint in 2022. The company now processes more than 10 million deployments monthly and handles over one trillion requests through its edge network — metrics that rival far larger and better-funded competitors. Why three-minute deploy times have become unacceptable in the age of AI coding assistants Railway's pitch rests on a simple observation: the tools developers use to deploy and manage software were designed for a slower era. A standard build-and-deploy cycle using Terraform, the industry-standard infrastructure tool, takes two to three minutes. That delay, once tolerable, has become a critical bottleneck as AI coding assistants like Claude, ChatGPT, and Cursor can generate working code in seconds. "When godly intelligence is on tap and can solve any problem in three seconds, those amalgamations of systems become bottlenecks," Cooper told VentureBeat. "What was really cool for humans to deploy in 10 seconds or less is now table stakes for agents." The company claims its platform delivers deployments in under one second — fast enough to keep pace with AI-generated code. Customers report a tenfold increase in developer velocity and up to 65 percent cost savings compared to traditional cloud providers. These numbers come directly from enterprise clients, not internal benchmarks. Daniel Lobaton, chief technology officer at G2X, a platform serving 100,000 federal contractors, measured deployment speed improvements of seven times faster and an 87 percent cost reduction after migrating to Railway. His infrastructure bill dropped from $15,000 per month to approximately $1,000. "The work that used to take me a week on our previous infrastructure, I can do in Railway in like a day," Lobaton said. "If I want to spin up a new service and test different architectures, it would take so long on our old setup. In Railway I can launch six services in two minutes." Inside the controversial decision to abandon Google Cloud and build data centers from scratch What distinguishes Railway from competitors like Render and Fly.io is the depth of its vertical integration. In 2024, the company made the unusual decision to abandon Google Cloud entirely and build its own data centers, a move that echoes the famous Alan Kay maxim: "People who are really serious about software should make their own hardware." "We wanted to design hardware in a way where we could build a differentiated experience," Cooper said. "Having full control over the network, compute, and storage layers lets us do really fast build and deploy loops, the kind that allows us to move at 'agentic speed' while staying 100 percent the smoothest ride in town." The approach paid dividends during recent widespread outages that affected major cloud providers — Railway remained online throughout. This soup-to-nuts control enables pricing that undercuts the hyperscalers by roughly 50 percent and newer cloud startups by three to four times. Railway charges by the second for actual compute usage: $0.00000386 per gigabyte-second of memory, $0.00000772 per vCPU-second, and $0.00000006 per gigabyte-second of storage. There are no charges for idle virtual machines — a stark contrast to the traditional cloud model where customers pay for provisioned capacity whether they use it or not. "The conventional wisdom is that the big guys have economies of scale to offer better pricing," Cooper noted. "But when they're charging for VMs that usually sit idle in the cloud, and we've purpose-built everything to fit much more density on these machines, you have a big opportunity." How 30 employees built a platform generating tens of millions in annual revenue Railway has achieved its scale with a team of just 30 employees generating tens of millions in annual revenue — a ratio of revenue per employee that would be exceptional even for established software companies. The company grew revenue 3.5 times last year and continues to expand at 15 percent month-over-month. Cooper emphasized that the fundraise was strategic rather than necessary. "We're default alive; there's no reason for us to raise money," he said. "We raised because we see a massive opportunity to accelerate, not because we needed to survive." The company hired its first salesperson only last year and employs just two solutions engineers. Nearly all of Railway's two million users discovered the platform through word of mouth — developers telling other developers about a tool that actually works. "We basically did the standard engineering thing: if you build it, they will come," Cooper recalled. "And to some degree, they came." From side projects to Fortune 500 deployments: Railway's unlikely corporate expansion Despite its grassroots developer community, Railway has made significant inroads into large organizations. The company claims that 31 percent of Fortune 500 companies now use its platform, though deployments range from company-wide infrastructure to individual team projects. Notable customers include Bilt, the loyalty program company; Intuit's GoCo subsidiary; TripAdvisor's Cruise Critic; and MGM Resorts. Kernel, a Y Combinator-backed startup providing AI infrastructure to over 1,000 companies, runs its entire customer-facing system on Railway for $444 per month. "At my previous company Clever, which sold for $500 million, I had six full-time engineers just managing AWS," said Rafael Garcia, Kernel's chief technology officer. "Now I have six engineers total, and they all focus on product. Railway is exactly the tool I wish I had in 2012." For enterprise customers, Railway offers security certifications including SOC 2 Type 2 compliance and HIPAA readiness, with business associate agreements available upon request. The platform provides single sign-on authentication, comprehensive audit logs, and the option to deploy within a customer's existing cloud environment through a "bring your own cloud" configuration. Enterprise pricing starts at custom levels, with specific add-ons for extended log retention ($200 monthly), HIPAA BAAs ($1,000), enterprise support with SLOs ($2,000), and dedicated virtual machines ($10,000). The startup's bold strategy to take on Amazon, Google, and a new generation of cloud rivals Railway enters a crowded market that includes not only the hyperscale cloud providers—Amazon Web Services, Microsoft Azure, and Google Cloud Platform—but also a growing cohort of developer-focused platforms like Vercel, Render, Fly.io, and Heroku. Cooper argues that Railway's competitors fall into two camps, neither of which has fully committed to the new infrastructure model that AI demands. "The hyperscalers have two competing systems, and they haven't gone all-in on the new model because their legacy revenue stream is still printing money," he observed. "They have this mammoth pool of cash coming from people who provision a VM, use maybe 10 percent of it, and still pay for the whole thing. To what end are they actually interested in going all the way in on a new experience if they don't really need to?" Against startup competitors, Railway differentiates by covering the full infrastructure stack. "We're not just containers; we've got VM primitives, stateful storage, virtual private networking, automated load balancing," Cooper said. "And we wrap all of this in an absurdly easy-to-use UI, with agentic primitives so agents can move 1,000 times faster." The platform supports databases including PostgreSQL, MySQL, MongoDB, and Redis; provides up to 256 terabytes of persistent storage with over 100,000 input/output operations per second; and enables deployment to four global regions spanning the United States, Europe, and Southeast Asia. Enterprise customers can scale to 112 vCPUs and 2 terabytes of RAM per service. Why investors are betting that AI will create a thousand times more software than exists today Railway's fundraise reflects broader investor enthusiasm for companies positioned to benefit from the AI coding revolution. As tools like GitHub Copilot, Cursor, and Claude become standard fixtures in developer workflows, the volume of code being written — and the infrastructure needed to run it — is expanding dramatically. "The amount of software that's going to come online over the next five years is unfathomable compared to what existed before — we're talking a thousand times more software," Cooper predicted. "All of that has to run somewhere." The company has already integrated directly with AI systems, building what Cooper calls "loops where Claude can hook in, call deployments, and analyze infrastructure automatically." Railway released a Model Context Protocol server in August 2025 that allows AI coding agents to deploy applications and manage infrastructure directly from code editors. "The notion of a developer is melting before our eyes," Cooper said. "You don't have to be an engineer to engineer things anymore — you just need critical thinking and the ability to analyze things in a systems capacity." What Railway plans to do with $100 million and zero marketing experience Railway plans to use the new capital to expand its global data center footprint, grow its team beyond 30 employees, and build what Cooper described as a proper go-to-market operation for the first time in the company's five-year history. "One of my mentors said you raise money when you can change the trajectory of the business," Cooper explained. "We've built all the required substrate to scale indefinitely; what's been holding us back is simply talking about it. 2026 is the year we play on the world stage." The company's investor roster reads like a who's who of developer infrastructure. Angel investors include Tom Preston-Werner, co-founder of GitHub; Guillermo Rauch, chief executive of Vercel; Spencer Kimball, chief executive of Cockroach Labs; Olivier Pomel, chief executive of Datadog; and Jori Lallo, co-founder of Linear. The timing of Railway's expansion coincides with what many in Silicon Valley view as a fundamental shift in how software gets made. Coding assistants are no longer experimental curiosities — they have become essential tools that millions of developers rely on daily. Each line of AI-generated code needs somewhere to run, and the incumbents, by Cooper's telling, are too wedded to their existing business models to fully capitalize on the moment. Whether Railway can translate developer enthusiasm into sustained enterprise adoption remains an open question. The cloud infrastructure market is littered with promising startups that failed to break the grip of Amazon, Microsoft, and Google. But Cooper, who previously worked as a software engineer at Wolfram Alpha, Bloomberg, and Uber before founding Railway in 2020, seems unfazed by the scale of his ambition. "In five years, Railway [will be] the place where software gets created and evolved, period," he said. "Deploy instantly, scale infinitely, with zero friction. That's the prize worth playing for, and there's no bigger one on offer." For a company that built a $100 million business by doing the opposite of what conventional startup wisdom dictates — no marketing, no sales team, no venture hype—the real test begins now. Railway spent five years proving that developers would find a better mousetrap on their own. The next five will determine whether the rest of the world is ready to get on board.

What is Claude Mythos? And how to see it in action with Claude Fable 5
The Zapier Blog

What is Claude Mythos? And how to see it in action with Claude Fable 5

On April 7, 2026, Claude Mythos Preview was officially announced, but it was apparently too dangerous to release. According to Anthropic, Claude Mythos represented a unique cybersecurity threat (they claimed that "the fallout—for economies, public safety, and national security—could be severe.") Instead of releasing Mythos to the general public, they spun up Project Glasswing, a cybersecurity initiative that also involved some big-name companies. The idea was that they'd be able to deploy Mythos

Artificial intelligence is helping Floridians with brain tumors; Here’s how - chronicleonline.com
"artificial intelligence" - Google News

Artificial intelligence is helping Floridians with brain tumors; Here’s how - chronicleonline.com

Artificial intelligence is helping Floridians with brain tumors; Here’s how  chronicleonline.com

Fluid, natural voice translation with Gemini 3.5 Live Translate
Gemini

Fluid, natural voice translation with Gemini 3.5 Live Translate

Gemini 3.5 Live Translate brings near real-time, natural speech translation to Google AI Studio, Google Translate and Google Meet.

OpenAI faces investigation from state attorneys general
AI News & Artificial Intelligence | TechCrunch

OpenAI faces investigation from state attorneys general

It's not clear which states are involved, but they're asking about everything from OpenAI's ad policies to its handling of health data.

The 6 best AI governance tools in 2026
The Zapier Blog

The 6 best AI governance tools in 2026

I'll never forget the first time my childhood dog betrayed me. Before the incident, she was completely fine alone, knew every trick in the book, and only barked at the mailman and other potential serial killers.  Then came that fateful night. I left for two hours, returning to shredded magazines, ripped couch cushions, destroyed dog toys, and a wagging tail. Let my canine misfortunes be a lesson for your AI endeavors. AI can be useful, fully functional, and your best friend—until the day it isn'

What is generative AI?
The Zapier Blog

What is generative AI?

If you've tried ChatGPT, Microsoft Copilot, Nano Banana, Grok, or any other AI chatbot or image generator, you've used generative AI (also called GenAI). Over the past few years, huge developments in generative AI and computing power have taken these kinds of tools out of research labs and made them a practical part of everyday life. You've almost definitely used generative AI, but let's dig a little deeper and add some more context. Table of contents: What is generative AI? How does generative

Claude 5: What you need to know about Anthropic's AI models and chatbot
The Zapier Blog

Claude 5: What you need to know about Anthropic's AI models and chatbot

I've been using Claude long enough to remember when the main selling point was that it was a nicer chatbot to talk to than the alternatives. (That's still true, for what it's worth.) But Claude no longer just talks to you about your work; it also does your work for you. You can give Claude a project, head off to make a coffee, and check in occasionally when questions pop up. For enterprises looking to get real productivity gains from AI, Claude has become the default choice. And Claude is equall

Access OpenAI models and Codex through your Oracle cloud commitment
OpenAI News

Access OpenAI models and Codex through your Oracle cloud commitment

Access OpenAI models and Codex through Oracle Cloud, using existing commitments to build and deploy AI with enterprise security and governance.

Listen Labs raises $69M after viral billboard hiring stunt to scale AI customer interviews
AI | VentureBeat

Listen Labs raises $69M after viral billboard hiring stunt to scale AI customer interviews

Alfred Wahlforss was running out of options. His startup, Listen Labs, needed to hire over 100 engineers, but competing against Mark Zuckerberg's $100 million offers seemed impossible. So he spent $5,000 — a fifth of his marketing budget — on a billboard in San Francisco displaying what looked like gibberish: five strings of random numbers. The numbers were actually AI tokens. Decoded, they led to a coding challenge: build an algorithm to act as a digital bouncer at Berghain, the Berlin nightclub famous for rejecting nearly everyone at the door. Within days, thousands attempted the puzzle. 430 cracked it. Some got hired. The winner flew to Berlin, all expenses paid. That unconventional approach has now attracted $69 million in Series B funding, led by Ribbit Capital with participation from Evantic and existing investors Sequoia Capital, Conviction, and Pear VC. The round values Listen Labs at $500 million and brings its total capital to $100 million. In nine months since launch, the company has grown annualized revenue by 15x to eight figures and conducted over one million AI-powered interviews. "When you obsess over customers, everything else follows," Wahlforss said in an interview with VentureBeat. "Teams that use Listen bring the customer into every decision, from marketing to product, and when the customer is delighted, everyone is." Why traditional market research is broken, and what Listen Labs is building to fix it Listen's AI researcher finds participants, conducts in-depth interviews, and delivers actionable insights in hours, not weeks. The platform replaces the traditional choice between quantitative surveys — which provide statistical precision but miss nuance—and qualitative interviews, which deliver depth but cannot scale. Wahlforss explained the limitation of existing approaches: "Essentially surveys give you false precision because people end up answering the same question... You can't get the outliers. People are actually not honest on surveys." The alternative, one-on-one human interviews, "gives you a lot of depth. You can ask follow up questions. You can kind of double check if they actually know what they're talking about. And the problem is you can't scale that." The platform works in four steps: users create a study with AI assistance, Listen recruits participants from its global network of 30 million people, an AI moderator conducts in-depth interviews with follow-up questions, and results are packaged into executive-ready reports including key themes, highlight reels, and slide decks. What distinguishes Listen's approach is its use of open-ended video conversations rather than multiple-choice forms. "In a survey, you can kind of guess what you should answer, and you have four options," Wahlforss said. "Oh, they probably want me to buy high income. Let me click on that button versus an open ended response. It just generates much more honesty." The dirty secret of the $140 billion market research industry: rampant fraud Listen finds and qualifies the right participants in its global network of 30 million people. But building that panel required confronting what Wahlforss called "one of the most shocking things that we've learned when we entered this industry"—rampant fraud. "Essentially, there's a financial transaction involved, which means there will be bad players," he explained. "We actually had some of the largest companies, some of them have billions in revenue, send us people who claim to be kind of enterprise buyers to our platform and our system immediately detected, like, fraud, fraud, fraud, fraud, fraud." The company built what it calls a "quality guard" that cross-references LinkedIn profiles with video responses to verify identity, checks consistency across how participants answer questions, and flags suspicious patterns. The result, according to Wahlforss: "People talk three times more. They're much more honest when they talk about sensitive topics like politics and mental health." Emeritus, an online education company that uses Listen, reported that approximately 20% of survey responses previously fell into the fraudulent or low-quality category. With Listen, they reduced this to almost zero. "We did not have to replace any responses because of fraud or gibberish information," said Gabrielli Tiburi, Assistant Manager of Customer Insights at Emeritus. How Microsoft, Sweetgreen, and Chubbies are using AI interviews to build better products The speed advantage has proven central to Listen's pitch. Traditional customer research at Microsoft could take four to six weeks to generate insights. "By the time we get to them, either the decision has been made or we lose out on the opportunity to actually influence it," said Romani Patel, Senior Research Manager at Microsoft. With Listen, Microsoft can now get insights in days, and in many cases, within hours. The platform has already powered several high-profile initiatives. Microsoft used Listen Labs to collect global customer stories for its 50th anniversary celebration. "We wanted users to share how Copilot is empowering them to bring their best self forward," Patel said, "and we were able to collect those user video stories within a day." Traditionally, that kind of work would have taken six to eight weeks. Simple Modern, an Oklahoma-based drinkware company, used Listen to test a new product concept. The process took about an hour to write questions, an hour to launch the study, and 2.5 hours to receive feedback from 120 people across the country. "We went from 'Should we even have this product?' to 'How should we launch it?'" said Chris Hoyle, the company's Chief Marketing Officer. Chubbies, the shorts brand, achieved a 24x increase in youth research participation—growing from 5 to 120 participants — by using Listen to overcome the scheduling challenges of traditional focus groups with children. "There's school, sports, dinner, and homework," explained Lauren Neville, Director of Insights and Innovation. "I had to find a way to hear from them that fit into their schedules." The company also discovered product issues through AI interviews that might have gone undetected otherwise. Wahlforss described how the AI "through conversations, realized there were like issues with the the kids short line, and decided to, like, interview hundreds of kids. And I understand that there were issues in the liner of the shorts and that they were, like, scratchy, quote, unquote, according to the people interviewed." The redesigned product became "a blockbuster hit." The Jevons paradox explains why cheaper research creates more demand, not less Listen Labs is entering a massive but fragmented market. Wahlforss cited research from Andreessen Horowitz estimating the market research industry at roughly $140 billion annually, populated by legacy players — some with more than a billion dollars in revenue — that he believes are vulnerable to disruption. "There are very much existing budget lines that we are replacing," Wahlforss said. "Why we're replacing them is that one, they're super costly. Two, they're kind of stuck in this old paradigm of choosing between a survey or interview, and they also take months to work with." But the more intriguing dynamic may be that AI-powered research doesn't just replace existing spending — it creates new demand. Wahlforss invoked the Jevons paradox, an economic principle that occurs when technological advancements make a resource more efficient to use, but increased efficiency leads to increased overall consumption rather than decreased consumption. "What I've noticed is that as something gets cheaper, you don't need less of it. You want more of it," Wahlforss explained. "There's infinite demand for customer understanding. So the researchers on the team can do an order of magnitude more research, and also other people who weren't researchers before can now do that as part of their job." Inside the elite engineering team that built Listen Labs before they had a working toilet Listen Labs traces its origins to a consumer app that Wahlforss and his co-founder built after meeting at Harvard. "We built this consumer app that got 20,000 downloads in one day," Wahlforss recalled. "We had all these users, and we were thinking like, okay, what can we do to get to know them better? And we built this prototype of what Listen is today." The founding team brings an unusual pedigree. Wahlforss's co-founder "was the national champion in competitive programming in Germany, and he worked at Tesla Autopilot." The company claims that 30% of its engineering team are medalists from the International Olympiad in Informatics — the same competition that produced the founders of Cognition, the AI coding startup. The Berghain billboard stunt generated approximately 5 million views across social media, according to Wahlforss. It reflected the intensity of the talent war in the Bay Area. "We had to do these things because some of our, like early employees, joined the company before we had a working toilet," he said. "But now we fixed that situation." The company grew from 5 to 40 employees in 2024 and plans to reach 150 this year. It hires engineers for non-engineering roles across marketing, growth, and operations — a bet that in the AI era, technical fluency matters everywhere. Synthetic customers and automated decisions: what Listen Labs is building next Wahlforss outlined an ambitious product roadmap that pushes into more speculative territory. The company is building "the ability to simulate your customers, so you can take all of those interviews we've done, and then extrapolate based on that and create synthetic users or simulated user voices." Beyond simulation, Listen aims to enable automated action based on research findings. "Can you not just make recommendations, but also create spawn agents to either change things in code or some customer churns? Can you give them a discount and try to bring them back?" Wahlforss acknowledged the ethical implications. "Obviously, as you said, there's kind of ethical concerns there. Of like, automated decision making overall can be bad, but we will have considerable guardrails to make sure that the companies are always in the loop." The company already handles sensitive data with care. "We don't train on any of the data," Wahlforss said. "We will also scrub any sensitive PII automatically so the model can detect that. And there are times when, for example, you work with investors, where if you accidentally mention something that could be material, non public information, the AI can actually detect that and remove any information like that." How AI could reshape the future of product development Perhaps the most provocative implication of Listen's model is how it could reshape product development itself. Wahlforss described a customer — an Australian startup — that has adopted what amounts to a continuous feedback loop. "They're based in Australia, so they're coding during the day, and then in their night, they're releasing a Listen study with an American audience. Listen validates whatever they built during the day, and they get feedback on that. They can then plug that feedback directly into coding tools like Claude Code and iterate." The vision extends Y Combinator's famous dictum — "write code, talk to users" — into an automated cycle. "Write code is now getting automated. And I think like talk to users will be as well, and you'll have this kind of infinite loop where you can start to ship this truly amazing product, almost kind of autonomously." Whether that vision materializes depends on factors beyond Listen's control — the continued improvement of AI models, enterprise willingness to trust automated research, and whether speed truly correlates with better products. A 2024 MIT study found that 95% of AI pilots fail to move into production, a statistic Wahlforss cited as the reason he emphasizes quality over demos. "I'm constantly have to emphasize like, let's make sure the quality is there and the details are right," he said. But the company's growth suggests appetite for the experiment. Microsoft's Patel said Listen has "removed the drudgery of research and brought the fun and joy back into my work." Chubbies is now pushing its founder to give everyone in the company a login. Sling Money, a stablecoin payments startup, can create a survey in ten minutes and receive results the same day. "It's a total game changer," said Ali Romero, Sling Money's marketing manager. Wahlforss has a different phrase for what he's building. When asked about the tension between speed and rigor — the long-held belief that moving fast means cutting corners — he cited Nat Friedman, the former GitHub CEO and Listen investor, who keeps a list of one-liners on his website. One of them: "Slow is fake." It's an aggressive claim for an industry built on methodological caution. But Listen Labs is betting that in the AI era, the companies that listen fastest will be the ones that win. The only question is whether customers will talk back.

Catch up on 12 major I/O 2026 moments
AI

Catch up on 12 major I/O 2026 moments

Here are 12 of the biggest Google I/O 2026 keynote moments, including news about Gemini Omni, Gemini 3.5 Flash and more.

9 demos of Gemini Omni and Gemini 3.5 in action
AI

9 demos of Gemini Omni and Gemini 3.5 in action

Watch 9 videos showing the capabilities of Gemini Omni and Gemini 3.5, announced at Google I/O 2026.

100 things we announced at I/O 2026
AI

100 things we announced at I/O 2026

We've been busy! Here’s a rundown of the top announcements, launches and demos at I/O 2026.

How to automate Claude with Zapier
The Zapier Blog

How to automate Claude with Zapier

Claude has staked its claim in the AI landscape and keeps drawing in new users all the time with its standout writing, knack for coding, and now, a whole new model class: Mythos. There's power in a quick, off-the-cuff prompt to Claude (especially a good one). But you can accomplish a lot more when you use Zapier to connect Claude to the rest of your apps and let automation carry out entire workflows for you. Ready to try it? Then keep scrolling for easy automation ideas with one-click templates

I/O 2026: Welcome to the agentic Gemini era
AI

I/O 2026: Welcome to the agentic Gemini era

The latest from Google I/O: See how we’re helping you get more done with Gemini.

The 17 best AI marketing tools in 2026
The Zapier Blog

The 17 best AI marketing tools in 2026

Marketers wear all the hats. No matter what part of marketing you work in, it's likely you're asked to stretch your skills into another area. But with more and more AI marketing tools being released every day, it's made this multi-jobbing a lot easier.  The problem is, marketers are drowning in these AI tools. Every app has a copilot, every copilot has a price tag, and the line between "useful" and "expensive novelty" keeps moving. I've spent a lot of time tinkering with these tools. Based on th

Cheaper, faster, and culturally aware, Avataar’s video AI is built for India’s scale
AI News & Artificial Intelligence | TechCrunch

Cheaper, faster, and culturally aware, Avataar’s video AI is built for India’s scale

Avataar AI's distilled video model is priced at $0.005 for every second of generation.

BBVA puts AI at the core of banking with OpenAI
OpenAI News

BBVA puts AI at the core of banking with OpenAI

Learn how BBVA scaled ChatGPT Enterprise to 100,000 employees and partnered with OpenAI to accelerate AI-powered banking transformation worldwide.

Anthropic launches Cowork, a Claude Desktop agent that works in your files — no coding required
AI | VentureBeat

Anthropic launches Cowork, a Claude Desktop agent that works in your files — no coding required

Anthropic released Cowork on Monday, a new AI agent capability that extends the power of its wildly successful Claude Code tool to non-technical users — and according to company insiders, the team built the entire feature in approximately a week and a half, largely using Claude Code itself. The launch marks a major inflection point in the race to deliver practical AI agents to mainstream users, positioning Anthropic to compete not just with OpenAI and Google in conversational AI, but with Microsoft's Copilot in the burgeoning market for AI-powered productivity tools. "Cowork lets you complete non-technical tasks much like how developers use Claude Code," the company announced via its official Claude account on X. The feature arrives as a research preview available exclusively to Claude Max subscribers — Anthropic's power-user tier priced between $100 and $200 per month — through the macOS desktop application. For the past year, the industry narrative has focused on large language models that can write poetry or debug code. With Cowork, Anthropic is betting that the real enterprise value lies in an AI that can open a folder, read a messy pile of receipts, and generate a structured expense report without human hand-holding. How developers using a coding tool for vacation research inspired Anthropic's latest product The genesis of Cowork lies in Anthropic's recent success with the developer community. In late 2024, the company released Claude Code, a terminal-based tool that allowed software engineers to automate rote programming tasks. The tool was a hit, but Anthropic noticed a peculiar trend: users were forcing the coding tool to perform non-coding labor. According to Boris Cherny, an engineer at Anthropic, the company observed users deploying the developer tool for an unexpectedly diverse array of tasks. "Since we launched Claude Code, we saw people using it for all sorts of non-coding work: doing vacation research, building slide decks, cleaning up your email, cancelling subscriptions, recovering wedding photos from a hard drive, monitoring plant growth, controlling your oven," Cherny wrote on X. "These use cases are diverse and surprising — the reason is that the underlying Claude Agent is the best agent, and Opus 4.5 is the best model." Recognizing this shadow usage, Anthropic effectively stripped the command-line complexity from their developer tool to create a consumer-friendly interface. In its blog post announcing the feature, Anthropic explained that developers "quickly began using it for almost everything else," which "prompted us to build Cowork: a simpler way for anyone — not just developers — to work with Claude in the very same way." Inside the folder-based architecture that lets Claude read, edit, and create files on your computer Unlike a standard chat interface where a user pastes text for analysis, Cowork requires a different level of trust and access. Users designate a specific folder on their local machine that Claude can access. Within that sandbox, the AI agent can read existing files, modify them, or create entirely new ones. Anthropic offers several illustrative examples: reorganizing a cluttered downloads folder by sorting and intelligently renaming each file, generating a spreadsheet of expenses from a collection of receipt screenshots, or drafting a report from scattered notes across multiple documents. "In Cowork, you give Claude access to a folder on your computer. Claude can then read, edit, or create files in that folder," the company explained on X. "Try it to create a spreadsheet from a pile of screenshots, or produce a first draft from scattered notes." The architecture relies on what is known as an "agentic loop." When a user assigns a task, the AI does not merely generate a text response. Instead, it formulates a plan, executes steps in parallel, checks its own work, and asks for clarification if it hits a roadblock. Users can queue multiple tasks and let Claude process them simultaneously — a workflow Anthropic describes as feeling "much less like a back-and-forth and much more like leaving messages for a coworker." The system is built on Anthropic's Claude Agent SDK, meaning it shares the same underlying architecture as Claude Code. Anthropic notes that Cowork "can take on many of the same tasks that Claude Code can handle, but in a more approachable form for non-coding tasks." The recursive loop where AI builds AI: Claude Code reportedly wrote much of Claude Cowork Perhaps the most remarkable detail surrounding Cowork's launch is the speed at which the tool was reportedly built — highlighting a recursive feedback loop where AI tools are being used to build better AI tools. During a livestream hosted by Dan Shipper, Felix Rieseberg, an Anthropic employee, confirmed that the team built Cowork in approximately a week and a half. Alex Volkov, who covers AI developments, expressed surprise at the timeline: "Holy shit Anthropic built 'Cowork' in the last... week and a half?!" This prompted immediate speculation about how much of Cowork was itself built by Claude Code. Simon Smith, EVP of Generative AI at Klick Health, put it bluntly on X: "Claude Code wrote all of Claude Cowork. Can we all agree that we're in at least somewhat of a recursive improvement loop here?" The implication is profound: Anthropic's AI coding agent may have substantially contributed to building its own non-technical sibling product. If true, this is one of the most visible examples yet of AI systems being used to accelerate their own development and expansion — a strategy that could widen the gap between AI labs that successfully deploy their own agents internally and those that do not. Connectors, browser automation, and skills extend Cowork's reach beyond the local file system Cowork doesn't operate in isolation. The feature integrates with Anthropic's existing ecosystem of connectors — tools that link Claude to external information sources and services such as Asana, Notion, PayPal, and other supported partners. Users who have configured these connections in the standard Claude interface can leverage them within Cowork sessions. Additionally, Cowork can pair with Claude in Chrome, Anthropic's browser extension, to execute tasks requiring web access. This combination allows the agent to navigate websites, click buttons, fill forms, and extract information from the internet — all while operating from the desktop application. "Cowork includes a number of novel UX and safety features that we think make the product really special," Cherny explained, highlighting "a built-in VM [virtual machine] for isolation, out of the box support for browser automation, support for all your claude.ai data connectors, asking you for clarification when it's unsure." Anthropic has also introduced an initial set of "skills" specifically designed for Cowork that enhance Claude's ability to create documents, presentations, and other files. These build on the Skills for Claude framework the company announced in October, which provides specialized instruction sets Claude can load for particular types of tasks. Why Anthropic is warning users that its own AI agent could delete their files The transition from a chatbot that suggests edits to an agent that makes edits introduces significant risk. An AI that can organize files can, theoretically, delete them. In a notable display of transparency, Anthropic devoted considerable space in its announcement to warning users about Cowork's potential dangers — an unusual approach for a product launch. The company explicitly acknowledges that Claude "can take potentially destructive actions (such as deleting local files) if it's instructed to." Because Claude might occasionally misinterpret instructions, Anthropic urges users to provide "very clear guidance" about sensitive operations. More concerning is the risk of prompt injection attacks — a technique where malicious actors embed hidden instructions in content Claude might encounter online, potentially causing the agent to bypass safeguards or take harmful actions. "We've built sophisticated defenses against prompt injections," Anthropic wrote, "but agent safety — that is, the task of securing Claude's real-world actions — is still an active area of development in the industry." The company characterized these risks as inherent to the current state of AI agent technology rather than unique to Cowork. "These risks aren't new with Cowork, but it might be the first time you're using a more advanced tool that moves beyond a simple conversation," the announcement notes. Anthropic's desktop agent strategy sets up a direct challenge to Microsoft Copilot The launch of Cowork places Anthropic in direct competition with Microsoft, which has spent years attempting to integrate its Copilot AI into the fabric of the Windows operating system with mixed adoption results. However, Anthropic's approach differs in its isolation. By confining the agent to specific folders and requiring explicit connectors, they are attempting to strike a balance between the utility of an OS-level agent and the security of a sandboxed application. What distinguishes Anthropic's approach is its bottom-up evolution. Rather than designing an AI assistant and retrofitting agent capabilities, Anthropic built a powerful coding agent first — Claude Code — and is now abstracting its capabilities for broader audiences. This technical lineage may give Cowork more robust agentic behavior from the start. Claude Code has generated significant enthusiasm among developers since its initial launch as a command-line tool in late 2024. The company expanded access with a web interface in October 2025, followed by a Slack integration in December. Cowork is the next logical step: bringing the same agentic architecture to users who may never touch a terminal. Who can access Cowork now, and what's coming next for Windows and other platforms For now, Cowork remains exclusive to Claude Max subscribers using the macOS desktop application. Users on other subscription tiers — Free, Pro, Team, or Enterprise — can join a waitlist for future access. Anthropic has signaled clear intentions to expand the feature's reach. The blog post explicitly mentions plans to add cross-device sync and bring Cowork to Windows as the company learns from the research preview. Cherny set expectations appropriately, describing the product as "early and raw, similar to what Claude Code felt like when it first launched." To access Cowork, Max subscribers can download or update the Claude macOS app and click on "Cowork" in the sidebar. The real question facing enterprise AI adoption For technical decision-makers, the implications of Cowork extend beyond any single product launch. The bottleneck for AI adoption is shifting — no longer is model intelligence the limiting factor, but rather workflow integration and user trust. Anthropic's goal, as the company puts it, is to make working with Claude feel less like operating a tool and more like delegating to a colleague. Whether mainstream users are ready to hand over folder access to an AI that might misinterpret their instructions remains an open question. But the speed of Cowork's development — a major feature built in ten days, possibly by the company's own AI — previews a future where the capabilities of these systems compound faster than organizations can evaluate them. The chatbot has learned to use a file manager. What it learns to use next is anyone's guess.

Bridging three-dimensional molecular structures and artificial intelligence with a conformation description language - Nature
"artificial intelligence" - Google News

Bridging three-dimensional molecular structures and artificial intelligence with a conformation description language - Nature

Bridging three-dimensional molecular structures and artificial intelligence with a conformation description language  Nature

OpenAI to acquire Ona
OpenAI News

OpenAI to acquire Ona

OpenAI plans to acquire Ona to expand Codex with secure, persistent cloud environments, enabling long-running AI agents across enterprise workflows.

Calendly vs. Google Calendar: Which should you choose? [2026]
The Zapier Blog

Calendly vs. Google Calendar: Which should you choose? [2026]

Here's Google's simple (but powerful) software playbook: find products people like, make a Google-ized copy, and give it away for free. Love Dropbox and Zoom? Google Drive and Google Meet are solid substitutes, and you won't pay a thing. Calendly is the next app on Google's radar. Google Calendar's appointment scheduling feature, which started off as a barebones alternative, has gotten better over time. It's not as powerful as Calendly, but it's a reliable way for Google users to create booking

AI May Soon Help You Understand What Your Pet Is Trying to Say
DailyAI

AI May Soon Help You Understand What Your Pet Is Trying to Say

Chinese tech powerhouse Baidu has filed a patent for a system that could use AI to decode animal sounds and behaviour then translate those signals into human language. For the millions of pet owners wondering what their animals are thinking, this could be the first real step toward bridging the communication gap between humans and animals. The tech Baidu’s system would collect animal vocalizations, body movements, and biological signals. It would merge that data and feed it into an AI model trained to identify emotional states. These emotional states could then be rendered in human language to boost “cross-species communication”. The post AI May Soon Help You Understand What Your Pet Is Trying to Say appeared first on DailyAI.

5 ways to automate Meta's Conversions API tool with Zapier
The Zapier Blog

5 ways to automate Meta's Conversions API tool with Zapier

You search for trail running shoes once, and suddenly they're everywhere—your feed, your apps, even your email. Spooky? Maybe. But for marketers, that's just smart data at work. But the real magic happens when you close the loop between customer actions and your ad strategy. Every purchase, sign-up, or webinar registration is a signal, and feeding those signals back into Meta helps you double down on what's working and cut what isn't. The catch? Doing this manually across tools is a nightmare. T

The 4 best AI website builders
The Zapier Blog

The 4 best AI website builders

Building a website is no longer a particularly hard task—but it can be an annoying one. If you look at most sites, there's a fair amount of text, images, and general organization to it all. Even with the best tools, it takes a few hours to put together something good. Wouldn't it be great if you could just create a website from scratch in just a few minutes? That's what AI website builders claim to do.  The idea is that by using artificial intelligence, AI website builders can streamline everyth

Meta’s months-old AI unit is a soul-crushing gulag, say the engineers stuck inside it
AI News & Artificial Intelligence | TechCrunch

Meta’s months-old AI unit is a soul-crushing gulag, say the engineers stuck inside it

A new report suggests the unit, which employs 6,500 people, is on the verge of revolt.

New OpenAI Academy courses for the next era of work
OpenAI News

New OpenAI Academy courses for the next era of work

OpenAI introduces three Academy courses that help people build practical AI skills, create repeatable workflows, and apply agents in everyday work.

Anthropic suspends top AI models after U.S. export control order - Nextgov/FCW
"artificial intelligence" - Google News

Anthropic suspends top AI models after U.S. export control order - Nextgov/FCW

Anthropic suspends top AI models after U.S. export control order  Nextgov/FCW

Take our I/O 2026 quiz, vibe coded in Google AI Studio.
AI

Take our I/O 2026 quiz, vibe coded in Google AI Studio.

We used Google AI Studio to vibe code a quiz about our top I/O 2026 announcements.

How Preply combines AI and human tutors to personalize learning
OpenAI News

How Preply combines AI and human tutors to personalize learning

Preply uses OpenAI to launch AI-generated lesson summaries, providing personalised feedback and language learning exercises.

Chinese cybercrime operation that used AI to scam ‘hundreds of thousands of victims’ sued by Google
AI News & Artificial Intelligence | TechCrunch

Chinese cybercrime operation that used AI to scam ‘hundreds of thousands of victims’ sued by Google

The tech giant said a group called "Outsider Enterprise" used AI to scam hundreds of thousands of victims, sending 2.5 million text messages over a span of two weeks.

Making it easier to understand how content was created and edited
Gemini

Making it easier to understand how content was created and edited

We're expanding our tools to help you understand how content was created and edited across the web.

We’re announcing new community investments in Missouri.
AI

We’re announcing new community investments in Missouri.

We’re helping build the state’s next-generation workforce and investing in energy programs.

‘There’s a huge market demand’: University of Utah approves new bachelor’s degree in artificial intelligence - The Salt Lake Tribune
"artificial intelligence" - Google News

‘There’s a huge market demand’: University of Utah approves new bachelor’s degree in artificial intelligence - The Salt Lake Tribune

‘There’s a huge market demand’: University of Utah approves new bachelor’s degree in artificial intelligence  The Salt Lake Tribune

10 top women in AI in 2026
DailyAI

10 top women in AI in 2026

AI is changing our world, but the stories of who build it often get lost in the noise. Behind the headlines and hype, a group of women are solving AI’s fundamental challenges – despite working in an industry persisently impacted by gender inequality. Women make up just 22% of AI professionals worldwide and only 12% of AI researchers. In academic publishing, female researchers account for just 29% of first authors on AI papers, a number that hasn’t increased since the mid-2000s.  This is a story about ten leaders who have influenced AI despite the odds being stacked against them.  Their The post 10 top women in AI in 2026 appeared first on DailyAI.

Derbyshire Police officer accused of using AI to 'create evidence' - BBC
"artificial intelligence" - Google News

Derbyshire Police officer accused of using AI to 'create evidence' - BBC

Derbyshire Police officer accused of using AI to 'create evidence'  BBC

Nous Research's NousCoder-14B is an open-source coding model landing right in the Claude Code moment
AI | VentureBeat

Nous Research's NousCoder-14B is an open-source coding model landing right in the Claude Code moment

Nous Research, the open-source artificial intelligence startup backed by crypto venture firm Paradigm, released a new competitive programming model on Monday that it says matches or exceeds several larger proprietary systems — trained in just four days using 48 of Nvidia's latest B200 graphics processors. The model, called NousCoder-14B, is another entry in a crowded field of AI coding assistants, but arrives at a particularly charged moment: Claude Code, the agentic programming tool from rival Anthropic, has dominated social media discussion since New Year's Day, with developers posting breathless testimonials about its capabilities. The simultaneous developments underscore how quickly AI-assisted software development is evolving — and how fiercely companies large and small are competing to capture what many believe will become a foundational technology for how software gets written. type: embedded-entry-inline id: 74cSyrq6OUrp9SEQ5zOUSl NousCoder-14B achieves a 67.87 percent accuracy rate on LiveCodeBench v6, a standardized evaluation that tests models on competitive programming problems published between August 2024 and May 2025. That figure represents a 7.08 percentage point improvement over the base model it was trained from, Alibaba's Qwen3-14B, according to Nous Research's technical report published alongside the release. "I gave Claude Code a description of the problem, it generated what we built last year in an hour," wrote Jaana Dogan, a principal engineer at Google responsible for the Gemini API, in a viral post on X last week that captured the prevailing mood around AI coding tools. Dogan was describing a distributed agent orchestration system her team had spent a year developing — a system Claude Code approximated from a three-paragraph prompt. The juxtaposition is instructive: while Anthropic's Claude Code has captured imaginations with demonstrations of end-to-end software development, Nous Research is betting that open-source alternatives trained on verifiable problems can close the gap — and that transparency in how these models are built matters as much as raw capability. How Nous Research built an AI coding model that anyone can replicate What distinguishes the NousCoder-14B release from many competitor announcements is its radical openness. Nous Research published not just the model weights but the complete reinforcement learning environment, benchmark suite, and training harness — built on the company's Atropos framework — enabling any researcher with sufficient compute to reproduce or extend the work. "Open-sourcing the Atropos stack provides the necessary infrastructure for reproducible olympiad-level reasoning research," noted one observer on X, summarizing the significance for the academic and open-source communities. The model was trained by Joe Li, a researcher in residence at Nous Research and a former competitive programmer himself. Li's technical report reveals an unexpectedly personal dimension: he compared the model's improvement trajectory to his own journey on Codeforces, the competitive programming platform where participants earn ratings based on contest performance. Based on rough estimates mapping LiveCodeBench scores to Codeforces ratings, Li calculated that NousCoder-14B's improvemen t— from approximately the 1600-1750 rating range to 2100-2200 — mirrors a leap that took him nearly two years of sustained practice between ages 14 and 16. The model accomplished the equivalent in four days. "Watching that final training run unfold was quite a surreal experience," Li wrote in the technical report. But Li was quick to note an important caveat that speaks to broader questions about AI efficiency: he solved roughly 1,000 problems during those two years, while the model required 24,000. Humans, at least for now, remain dramatically more sample-efficient learners. Inside the reinforcement learning system that trains on 24,000 competitive programming problems NousCoder-14B's training process offers a window into the increasingly sophisticated techniques researchers use to improve AI reasoning capabilities through reinforcement learning. The approach relies on what researchers call "verifiable rewards" — a system where the model generates code solutions, those solutions are executed against test cases, and the model receives a simple binary signal: correct or incorrect. This feedback loop, while conceptually straightforward, requires significant infrastructure to execute at scale. Nous Research used Modal, a cloud computing platform, to run sandboxed code execution in parallel. Each of the 24,000 training problems contains hundreds of test cases on average, and the system must verify that generated code produces correct outputs within time and memory constraints — 15 seconds and 4 gigabytes, respectively. The training employed a technique called DAPO (Dynamic Sampling Policy Optimization), which the researchers found performed slightly better than alternatives in their experiments. A key innovation involves "dynamic sampling" — discarding training examples where the model either solves all attempts or fails all attempts, since these provide no useful gradient signal for learning. The researchers also adopted "iterative context extension," first training the model with a 32,000-token context window before expanding to 40,000 tokens. During evaluation, extending the context further to approximately 80,000 tokens produced the best results, with accuracy reaching 67.87 percent. Perhaps most significantly, the training pipeline overlaps inference and verification — as soon as the model generates a solution, it begins work on the next problem while the previous solution is being checked. This pipelining, combined with asynchronous training where multiple model instances work in parallel, maximizes hardware utilization on expensive GPU clusters. The looming data shortage that could slow AI coding model progress Buried in Li's technical report is a finding with significant implications for the future of AI development: the training dataset for NousCoder-14B encompasses "a significant portion of all readily available, verifiable competitive programming problems in a standardized dataset format." In other words, for this particular domain, the researchers are approaching the limits of high-quality training data. "The total number of competitive programming problems on the Internet is roughly the same order of magnitude," Li wrote, referring to the 24,000 problems used for training. "This suggests that within the competitive programming domain, we have approached the limits of high-quality data." This observation echoes growing concern across the AI industry about data constraints. While compute continues to scale according to well-understood economic and engineering principles, training data is "increasingly finite," as Li put it. "It appears that some of the most important research that needs to be done in the future will be in the areas of synthetic data generation and data efficient algorithms and architectures," he concluded. The challenge is particularly acute for competitive programming because the domain requires problems with known correct solutions that can be verified automatically. Unlike natural language tasks where human evaluation or proxy metrics suffice, code either works or it doesn't — making synthetic data generation considerably more difficult. Li identified one potential avenue: training models not just to solve problems but to generate solvable problems, enabling a form of self-play similar to techniques that proved successful in game-playing AI systems. "Once synthetic problem generation is solved, self-play becomes a very interesting direction," he wrote. A $65 million bet that open-source AI can compete with Big Tech Nous Research has carved out a distinctive position in the AI landscape: a company committed to open-source releases that compete with — and sometimes exceed — proprietary alternatives. The company raised $50 million in April 2025 in a round led by Paradigm, the cryptocurrency-focused venture firm founded by Coinbase co-founder Fred Ehrsam. Total funding reached $65 million, according to some reports. The investment reflected growing interest in decentralized approaches to AI training, an area where Nous Research has developed its Psyche platform. Previous releases include Hermes 4, a family of models that we reported "outperform ChatGPT without content restrictions," and DeepHermes-3, which the company described as the first "toggle-on reasoning model" — allowing users to activate extended thinking capabilities on demand. The company has cultivated a distinctive aesthetic and community, prompting some skepticism about whether style might overshadow substance. "Ofc i'm gonna believe an anime pfp company. stop benchmarkmaxxing ffs," wrote one critic on X, referring to Nous Research's anime-style branding and the industry practice of optimizing for benchmark performance. Others raised technical questions. "Based on the benchmark, Nemotron is better," noted one commenter, referring to Nvidia's family of language models. Another asked whether NousCoder-14B is "agentic focused or just 'one shot' coding" — a distinction that matters for practical software development, where iterating on feedback typically produces better results than single attempts. What researchers say must happen next for AI coding tools to keep improving The release includes several directions for future work that hint at where AI coding research may be heading. Multi-turn reinforcement learning tops the list. Currently, the model receives only a final binary reward — pass or fail — after generating a solution. But competitive programming problems typically include public test cases that provide intermediate feedback: compilation errors, incorrect outputs, time limit violations. Training models to incorporate this feedback across multiple attempts could significantly improve performance. Controlling response length also remains a challenge. The researchers found that incorrect solutions tended to be longer than correct ones, and response lengths quickly saturated available context windows during training — a pattern that various algorithmic modifications failed to resolve. Perhaps most ambitiously, Li proposed "problem generation and self-play" — training models to both solve and create programming problems. This would address the data scarcity problem directly by enabling models to generate their own training curricula. "Humans are great at generating interesting and useful problems for other competitive programmers, but it appears that there still exists a significant gap in LLM capabilities in creative problem generation," Li wrote. The model is available now on Hugging Face under an Apache 2.0 license. For researchers and developers who want to build on the work, Nous Research has published the complete Atropos training stack alongside it. What took Li two years of adolescent dedication to achieve—climbing from a 1600-level novice to a 2100-rated competitor on Codeforces—an AI replicated in 96 hours. He needed 1,000 problems. The model needed 24,000. But soon enough, these systems may learn to write their own problems, teach themselves, and leave human benchmarks behind entirely. The question is no longer whether machines can learn to code. It's whether they'll soon be better teachers than we ever were.

ChatGPT Is Making People Think They’re Gods and Their Families Are Terrified
DailyAI

ChatGPT Is Making People Think They’re Gods and Their Families Are Terrified

ChatGPT, the popular AI chatbot from OpenAI, is unintentionally leading users into full-blown spiritual delusions, and families are sounding the alarm. On Reddit’s r/ChatGPT forum, a chilling thread titled “ChatGPT induced psychosis” is gaining traction. Users are reporting a disturbing pattern: their loved ones are convinced that ChatGPT is a divine being, a spiritual guru, or even a portal to God. Rolling Stone journalist Miles Klee spoke directly with affected individuals. One woman shared how her partner became obsessed after ChatGPT gave him cosmic nicknames like “spiral starchild” and claimed he was on a divine mission. He ultimately told her The post ChatGPT Is Making People Think They’re Gods and Their Families Are Terrified appeared first on DailyAI.

How an astrophysicist uses Codex to help simulate black holes
OpenAI News

How an astrophysicist uses Codex to help simulate black holes

Discover how astrophysicist Chi-kwan Chan uses Codex to build black hole simulations, helping scientists study extreme physics and test Einstein’s theory of general relativity.

Therapists Too Expensive? Why Thousands of Women Are Spilling Their Deepest Secrets to ChatGPT
DailyAI

Therapists Too Expensive? Why Thousands of Women Are Spilling Their Deepest Secrets to ChatGPT

More women are turning to ChatGPT for emotional support, using the AI chatbot as a stand-in therapist as mental health systems buckle under pressure. With long wait times and soaring costs, AI is filling a growing gap. Mental health care is harder to access than ever. In the UK, NHS data shows patients are eight times more likely to wait over 18 months for mental health treatment than for physical health. Private therapy isn’t always an option either, with sessions costing £60 or more. In that vacuum, ChatGPT has become a surprising outlet. Real voices, real feelings Charly, 29, from The post Therapists Too Expensive? Why Thousands of Women Are Spilling Their Deepest Secrets to ChatGPT appeared first on DailyAI.

How artificial intelligence got better at building itself - The Economist
"artificial intelligence" - Google News

How artificial intelligence got better at building itself - The Economist

How artificial intelligence got better at building itself  The Economist

New ways to create and get things done in Google Workspace
AI

New ways to create and get things done in Google Workspace

Announcing new voice capabilities in Gmail, Docs and Keep, a new design tool called Google Pics and updates to AI Inbox.

Theker just raised $85M to build the factory robot that doesn’t specialize in anything
AI News & Artificial Intelligence | TechCrunch

Theker just raised $85M to build the factory robot that doesn’t specialize in anything

Unlike humanoid robots designed around a fixed form — think Boston Dynamics — Theker's machines are built to be reconfigured.

The best customer experience software in 2026
The Zapier Blog

The best customer experience software in 2026

I canceled a subscription recently because I started questioning my experience. I'd been loyal for months, but the customer-facing AI agents felt like they were regressing, it took forever to speak to a human support agent, and all of the branding started targeting a much younger audience. They probably logged it as general churn; that's fine, I'm sure "old man yells at cloud" isn't something they would put into a report. Every single touchpoint you have with a customer shapes their experience,

The latest AI news we announced in May 2026
AI

The latest AI news we announced in May 2026

Here are Google’s latest AI updates from May 2026

Save time and grow your business with new Gemini tools
Gemini

Save time and grow your business with new Gemini tools

An overview of new features in the Gemini app designed specifically to support businesses and entrepreneurs.