PANews reported on September 26th that OpenAI launched a new evaluation tool, GDPval , which focuses on measuring AI performance on real-world economic value tasks. GDPval covers 44 occupations across the nine industries that contribute most to US GDP . The tasks were designed by industry experts with an average of 14 years of experience. Evaluation results show that nearly half of the outputs of the Claude Opus 4.1 model are comparable to or better than expert performance. OpenAI stated that it will continue to expand the scope and details of GDPval's evaluations in the future.PANews reported on September 26th that OpenAI launched a new evaluation tool, GDPval , which focuses on measuring AI performance on real-world economic value tasks. GDPval covers 44 occupations across the nine industries that contribute most to US GDP . The tasks were designed by industry experts with an average of 14 years of experience. Evaluation results show that nearly half of the outputs of the Claude Opus 4.1 model are comparable to or better than expert performance. OpenAI stated that it will continue to expand the scope and details of GDPval's evaluations in the future.

OpenAI releases GDPval to assess AI's economic value task performance

저자: PANews

출처: PANews

2025/09/26 08:18

1분 읽기

SLEEPLESSAI$0.02156-10.87%

REAL$0.07173-2.32%

이 콘텐츠에 대한 의견이나 우려 사항이 있으시면 crypto.news@mexc.com으로 연락주시기 바랍니다

Don't Miss $200,000 U-Fest

Get mystery boxes, 12% APR & $200 new user gifts!

면책 조항: 본 사이트에 재게시된 글들은 공개 플랫폼에서 가져온 것으로 정보 제공 목적으로만 제공됩니다. 이는 반드시 MEXC의 견해를 반영하는 것은 아닙니다. 모든 권리는 원저자에게 있습니다. 제3자의 권리를 침해하는 콘텐츠가 있다고 판단될 경우, crypto.news@mexc.com으로 연락하여 삭제 요청을 해주시기 바랍니다. MEXC는 콘텐츠의 정확성, 완전성 또는 시의적절성에 대해 어떠한 보증도 하지 않으며, 제공된 정보에 기반하여 취해진 어떠한 조치에 대해서도 책임을 지지 않습니다. 본 콘텐츠는 금융, 법률 또는 기타 전문적인 조언을 구성하지 않으며, MEXC의 추천이나 보증으로 간주되어서는 안 됩니다.