Built with world-class reverse engineer Michał “Redford” Kowalczyk, this open-source benchmark has sparked excitement among security experts, opening a new frontierBuilt with world-class reverse engineer Michał “Redford” Kowalczyk, this open-source benchmark has sparked excitement among security experts, opening a new frontier

Quesma Explores Novel AI’s Security Capabilities Against Supply-Chain Attacks

2026/02/13 21:00
2 min read
For feedback or concerns regarding this content, please contact us at crypto.news@mexc.com

Built with world-class reverse engineer Michał “Redford” Kowalczyk, this open-source benchmark has sparked excitement among security experts, opening a new frontier in binary analysis.

WARSAW, Poland–(BUSINESS WIRE)–Quesma, Inc. announced BinaryAudit, the independent benchmark testing whether AI can find hidden threats in software binaries before they cause damage. The results show both promise and limitations: while AI can detect some threats, even the best-performing model, Claude Opus 4.6, succeeded only 49% of the time and frequently flagged safe software as dangerous.

Supply-chain attacks are already causing real-world damage. State-sponsored actors recently hijacked Notepad++, replacing legitimate binaries with infected ones. Shai Hulud 2.0 compromised thousands of organizations, including Fortune 500 companies and governments, stealing credentials. In the XZ Utils case, a long-term contributor legitimately gained ownership access using it to insert malicious code. Security weaknesses can also originate from vendors, including manufacturer-planted code to disable trains and hardcoded credentials in Cisco devices. These public cases are only a fraction of what exists.

Traditional binary reverse engineering is a last-resort method. It’s performed by a small pool of specialists, typically only after a breach or major incident. AI has the potential to transform this reactive approach into a proactive layer of defense, making it feasible to inspect software at any point – before deployment, during updates, before the purchase, or years after release. This could change how organizations approach supply-chain security, turning what was once an emergency response tool into a preventive safeguard.

“We were genuinely surprised that today’s LLMs can detect malicious code at all. At current performance levels, it’s an assistant, not a solution,” said Jacek Migdał, CEO of Quesma. “AI binary analysis could be a new layer of defence in supply-chain security. We hope new AI models released in the next 1-2 years will make binary analysis go mainstream. BinaryAudit helps to track and encourage progress in this field.”

BinaryAudit is available today at https://quesma.com/benchmarks/binaryaudit/.

ABOUT QUESMA:

Quesma is a technological company that evaluates and tests advanced AI models. It creates benchmarks to evaluate how frontier LLMs perform across critical domains, such as DevOps, security, and database migrations. Quesma is backed by Heartcore Capital, Inovo, Firestreak Ventures, and several angels, including Christina Beedgen, co-founder of Sumo Logic. For more information, visit www.quesma.com or follow on LinkedIn.

Contacts

Lucie Šimečková
Marketing

press@quesma.com

Market Opportunity
WorldAssets Logo
WorldAssets Price(INC)
$0.4986
$0.4986$0.4986
+0.02%
USD
WorldAssets (INC) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact crypto.news@mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Aster Genesis Phase 2 will conclude on October 6, with Phase 3 to include spot trading volumes

Aster Genesis Phase 2 will conclude on October 6, with Phase 3 to include spot trading volumes

PANews reported on September 22nd that the decentralized exchange Aster announced that the second phase of Aster Genesis will conclude at 23:59 UTC on October 5th (07:59 Beijing Time on October 6th). With two cycles remaining, users can still trade and earn Rh points—4% of the total ASTER supply has been allocated for Phase 2 rewards. Phase 3 will follow shortly thereafter, incorporating spot trading points and updating the rewards mechanism.
Share
PANews2025/09/22 21:37
Xiaomi Stock: Flagship Phones Launch as Memory Prices Surge 80–90%

Xiaomi Stock: Flagship Phones Launch as Memory Prices Surge 80–90%

TLDR Xiaomi launched the Xiaomi 17 and 17 Ultra globally at Mobile World Congress, priced at 999 euros and 1,499 euros respectively Memory chip prices have surged
Share
Coincentral2026/03/02 18:30
GBP trades firmly against US Dollar

GBP trades firmly against US Dollar

The post GBP trades firmly against US Dollar appeared on BitcoinEthereumNews.com. Pound Sterling trades firmly against US Dollar ahead of Fed’s policy outcome The Pound Sterling (GBP) clings to Tuesday’s gains near 1.3640 against the US Dollar (USD) during the European trading session on Wednesday. The GBP/USD pair holds onto gains as the US Dollar remains on the back foot amid firm expectations that the Federal Reserve (Fed) will cut interest rates in the monetary policy announcement at 18:00 GMT. At the time of writing, the US Dollar Index (DXY), which tracks the Greenback’s value against six major currencies, holds onto losses near a fresh two-month low of 96.60 posted on Tuesday. Read more… UK inflation unchanged at 3.8%, Pound shrugs The British pound is unchanged on Wednesday, trading at 1.3645 in the European session. Today’s inflation report was a dour reminder that UK inflation remains entrenched. CPI for August was unchanged at 3.8% y/y, matching the consensus and its highest level since January 2024. Airfares decreased but this was offset by food and petrol prices. Monthly, CPI rose 0.3%, up from 0.1% in July and matching the consensus. Core CPI, which excludes volatile items such as food and energy, eased to 3.6% from 3.8%. Monthly, core CPI ticked up to 0.3% from 0.2%. The inflation report comes just a day before the Bank of England announces its rate decision. Inflation is almost double the BoE’s target of 2% and today’s release likely means that the BoE will not reduce rates before 2026. Read more… Source: https://www.fxstreet.com/news/pound-sterling-price-news-and-forecast-gbp-trades-firmly-against-us-dollar-ahead-of-feds-policy-outcome-202509171209
Share
BitcoinEthereumNews2025/09/18 01:50