TLDRs: OpenAI and Anthropic jointly tested AI models to identify hallucinations and misalignment risks. The cross-company evaluation revealed blind spots missed by internal safety reviews. Collaboration highlights how rivals balance competition with shared safety responsibilities. Increased scrutiny and lawsuits drive AI firms to adopt external safety evaluations. OpenAI and Anthropic, two of the leading AI [...] The post OpenAI and Anthropic Collaborate to Identify Safety Risks in AI Models appeared first on CoinCentral.TLDRs: OpenAI and Anthropic jointly tested AI models to identify hallucinations and misalignment risks. The cross-company evaluation revealed blind spots missed by internal safety reviews. Collaboration highlights how rivals balance competition with shared safety responsibilities. Increased scrutiny and lawsuits drive AI firms to adopt external safety evaluations. OpenAI and Anthropic, two of the leading AI [...] The post OpenAI and Anthropic Collaborate to Identify Safety Risks in AI Models appeared first on CoinCentral.

OpenAI and Anthropic Collaborate to Identify Safety Risks in AI Models

2025/08/29 01:52
3 min read

TLDRs:

  • OpenAI and Anthropic jointly tested AI models to identify hallucinations and misalignment risks.
  • The cross-company evaluation revealed blind spots missed by internal safety reviews.
  • Collaboration highlights how rivals balance competition with shared safety responsibilities.
  • Increased scrutiny and lawsuits drive AI firms to adopt external safety evaluations.

OpenAI and Anthropic, two of the leading AI companies, have undertaken a joint effort to test each other’s AI models for safety vulnerabilities.

This collaboration aimed to uncover potential risks that might be overlooked during internal evaluations, including hallucinations and misalignment, where the models fail to behave as intended.

The exercise was conducted over the summer, preceding the launch of OpenAI’s GPT-5 and Anthropic’s Claude Opus 4.1 update. Despite their competitive rivalry, the companies recognized that safety concerns transcend market competition and require cooperative solutions.

Testing Beyond Internal Limits

The joint evaluation revealed that even advanced internal testing can miss critical safety issues. Anthropic’s review of OpenAI’s GPT models flagged potential misuse and accuracy concerns, while OpenAI assessed Anthropic’s Claude models for instruction adherence, hallucinations, and susceptibility to manipulation.

Both companies noted strengths and blind spots in each other’s protocols, highlighting the value of external, unbiased assessments.

This approach mirrors practices in other high-stakes industries, such as finance, where third-party audits are standard to uncover vulnerabilities and prevent systemic risks. As AI technologies become increasingly influential in society, these evaluations are likely to become a regular part of responsible AI development.

Competition Meets Cooperation

The collaboration underscores the complex dynamics between AI rivals. Earlier this year, Anthropic temporarily restricted OpenAI’s access to its Claude models after discovering that OpenAI had used them for competitive benchmarking in violation of Anthropic’s terms of service. Yet, both companies maintained limited access for safety testing, demonstrating a selective cooperation strategy.

OpenAI described this initiative as the “first major cross-lab exercise in safety and alignment testing,” emphasizing that even fierce competitors can find common ground when addressing industry-wide safety concerns.

The effort also reflects differing philosophies, Anthropic prioritizes safety through “Constitutional AI,” while OpenAI focuses on rapid innovation and accessibility.

Safety Concerns Drive Industry Standards

The collaboration occurs amid heightened scrutiny of AI safety. Recent incidents, including lawsuits alleging harm linked to AI interactions, have amplified pressure on companies to demonstrate robust risk management.

By testing each other’s models, OpenAI and Anthropic aim to reduce legal, ethical, and reputational risks, while promoting safer AI deployment across the industry.

Experts suggest that cross-company evaluations may soon become standard practice, akin to third-party audits in finance or medical research. Such measures could help ensure AI technologies meet societal safety expectations, even as competition continues to drive innovation and market growth.

Looking Forward

The OpenAI-Anthropic collaboration signals a pivotal moment in AI development, a recognition that safety cannot be addressed in isolation.

While these companies remain market rivals, their shared commitment to responsible AI demonstrates that industry-wide challenges, like hallucinations, misalignment, and misuse, can foster collaboration even among competitors.

The post OpenAI and Anthropic Collaborate to Identify Safety Risks in AI Models appeared first on CoinCentral.

Market Opportunity
CROSS Logo
CROSS Price(CROSS)
$0.10347
$0.10347$0.10347
+0.61%
USD
CROSS (CROSS) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact crypto.news@mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.
Tags:

You May Also Like

XRP Volume Rises 212%, Bitcoin ETFs Back in Demand With $506 Million, Dogecoin Price Reclaims $0.10 — U.Today Crypto Digest

XRP Volume Rises 212%, Bitcoin ETFs Back in Demand With $506 Million, Dogecoin Price Reclaims $0.10 — U.Today Crypto Digest

Crypto news digest: 212% increase was seen in XRP volume; BTC ETFs have recovered from the low capital; DOGE price jumps 8%.
Share
Coinstats2026/02/28 05:27
Facts Vs. Hype: Analyst Examines XRP Supply Shock Theory

Facts Vs. Hype: Analyst Examines XRP Supply Shock Theory

Prominent analyst Cheeky Crypto (203,000 followers on YouTube) set out to verify a fast-spreading claim that XRP’s circulating supply could “vanish overnight,” and his conclusion is more nuanced than the headline suggests: nothing in the ledger disappears, but the amount of XRP that is truly liquid could be far smaller than most dashboards imply—small enough, in his view, to set the stage for an abrupt liquidity squeeze if demand spikes. XRP Supply Shock? The video opens with the host acknowledging his own skepticism—“I woke up to a rumor that XRP supply could vanish overnight. Sounds crazy, right?”—before committing to test the thesis rather than dismiss it. He frames the exercise as an attempt to reconcile a long-standing critique (“XRP’s supply is too large for high prices”) with a rival view taking hold among prominent community voices: that much of the supply counted as “circulating” is effectively unavailable to trade. His first step is a straightforward data check. Pulling public figures, he finds CoinMarketCap showing roughly 59.6 billion XRP as circulating, while XRPScan reports about 64.7 billion. The divergence prompts what becomes the video’s key methodological point: different sources count “circulating” differently. Related Reading: Analyst Sounds Major XRP Warning: Last Chance To Get In As Accumulation Balloons As he explains it, the higher on-ledger number likely includes balances that aggregators exclude or treat as restricted, most notably Ripple’s programmatic escrow. He highlights that Ripple still “holds a chunk of XRP in escrow, about 35.3 billion XRP locked up across multiple wallets, with a nominal schedule of up to 1 billion released per month and unused portions commonly re-escrowed. Those coins exist and are accounted for on-ledger, but “they aren’t actually sitting on exchanges” and are not immediately available to buyers. In his words, “for all intents and purposes, that escrow stash is effectively off of the market.” From there, the analysis moves from headline “circulating supply” to the subtler concept of effective float. Beyond escrow, he argues that large strategic holders—banks, fintechs, or other whales—may sit on material balances without supplying order books. When you strip out escrow and these non-selling stashes, he says, “the effective circulating supply… is actually way smaller than the 59 or even 64 billion figure.” He cites community estimates in the “20 or 30 billion” range for what might be truly liquid at any given moment, while emphasizing that nobody has a precise number. That effective-float framing underpins the crux of his thesis: a potential supply shock if demand accelerates faster than fresh sell-side supply appears. “Price is a dance between supply and demand,” he says; if institutional or sovereign-scale users suddenly need XRP and “the market finds that there isn’t enough XRP readily available,” order books could thin out and prices could “shoot on up, sometimes violently.” His phrase “circulating supply could collapse overnight” is presented not as a claim that tokens are destroyed or removed from the ledger, but as a market-structure scenario in which available inventory to sell dries up quickly because holders won’t part with it. How Could The XRP Supply Shock Happen? On the demand side, he anchors the hypothetical to tokenization. He points to the “very early stages of something huge in finance”—on-chain tokenization of debt, stablecoins, CBDCs and even gold—and argues the XRP Ledger aims to be “the settlement layer” for those assets.He references Ripple CTO David Schwartz’s earlier comments about an XRPL pivot toward tokenized assets and notes that an institutional research shop (Bitwise) has framed XRP as a way to play the tokenization theme. In his construction, if “trillions of dollars in value” begin settling across XRPL rails, working inventories of XRP for bridging, liquidity and settlement could rise sharply, tightening effective float. Related Reading: XRP Bearish Signal: Whales Offload $486 Million In Asset To illustrate, he offers two analogies. First, the “concert tickets” model: you think there are 100,000 tickets (100B supply), but 50,000 are held by the promoter (escrow) and 30,000 by corporate buyers (whales), leaving only 20,000 for the public; if a million people want in, prices explode. Second, a comparison to Bitcoin’s halving: while XRP has no programmatic halving, he proposes that a sudden adoption wave could function like a de facto halving of available supply—“XRP’s version of a halving could actually be the adoption event.” He also updates the narrative context that long dogged XRP. Once derided for “too much supply,” he argues the script has “totally flipped.” He cites the current cycle’s optics—“XRP is sitting above $3 with a market cap north of around $180 billion”—as evidence that raw supply counts did not cap price as tightly as critics claimed, and as a backdrop for why a scarcity narrative is gaining traction. Still, he declines to publish targets or timelines, repeatedly stressing uncertainty and risk. “I’m not a financial adviser… cryptocurrencies are highly volatile,” he reminds viewers, adding that tokenization could take off “on some other platform,” unfold more slowly than enthusiasts expect, or fail to get to “sudden shock” scale. The verdict he offers is deliberately bound. The theory that “XRP supply could vanish overnight” is imprecise on its face; the ledger will not erase coins. But after examining dashboard methodologies, escrow mechanics and the behavior of large holders, he concludes that the effective float could be meaningfully smaller than headline supply figures, and that a fast-developing tokenization use case could, under the right conditions, stress that float. “Overnight is a dramatic way to put it,” he concedes. “The change could actually be very sudden when it comes.” At press time, XRP traded at $3.0198. Featured image created with DALL.E, chart from TradingView.com
Share
NewsBTC2025/09/18 11:00
Pi Network Fast-Track KYC Lets New Users Unlock Mainnet Wallets Early

Pi Network Fast-Track KYC Lets New Users Unlock Mainnet Wallets Early

TLDR: Pi Network introduces AI-powered Fast Track KYC to speed wallet activation for new users and non-users Users can activate Mainnet wallets before 30 mining sessions but cannot migrate mined balances yet Fast Track KYC maintains strict verification standards and may be more conservative than standard KYC Pi Network reports over 14.82M users fully KYC-verified [...] The post Pi Network Fast-Track KYC Lets New Users Unlock Mainnet Wallets Early appeared first on Blockonomi.
Share
Blockonomi2025/09/19 15:48