Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Prompt Engineering Endorses ‘Cognitive Cognizance Prompting’ As A Vital Well-Being Technique

    January 20, 2026

    For These Women, Grok’s Sexualized Images Are Personal

    January 20, 2026

    Inside China’s buzzing AI scene a year after DeepSeek shock

    January 20, 2026
    Facebook X (Twitter) Instagram
    ailogicnews.aiailogicnews.ai
    • Home
    ailogicnews.aiailogicnews.ai
    Home»Deepseek»U.S. Commerce Sec. Lutnick says American AI dominates DeepSeek, thanks Trump for AI Action Plan — OpenAI and Anthropic beat Chinese models across 19 different benchmarks
    Deepseek

    U.S. Commerce Sec. Lutnick says American AI dominates DeepSeek, thanks Trump for AI Action Plan — OpenAI and Anthropic beat Chinese models across 19 different benchmarks

    AI Logic NewsBy AI Logic NewsOctober 2, 2025No Comments3 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Reddit Telegram Email
    Share
    Facebook Twitter LinkedIn Pinterest Email

    The National Institute of Science and Technology (NIST) has just completed a comprehensive test of Chinese and American AI models, with the results showing that models from OpenAI and Anthropic outperformed DeepSeek across 19 different benchmarks. U.S. Commerce Secretary Howard Lutnick shared the results on X, thanking President Donald Trump for his AI Action Plan to accelerate American AI innovation and infrastructure while encouraging its allies and friendly nations to adopt it.

    “The report is clear: DeepSeek lags far behind, especially in cyber and software engineering. These weaknesses aren’t just technical. They demonstrate why relying on foreign AI is dangerous and shortsighted,” Sec. Lutnick said in his post. “Allowing our adversaries to control AI poses serious risks to our security. By setting the standards, driving innovation, and keeping America secure, the Department of Commerce is helping ensure continued U.S. leadership in AI.”

    NIST is a federal agency under the Commerce Department that develops standards and supports industry to help keep the U.S. industrially competitive globally, and it conducted this study under the newly-established Center for AI Standards and Innovation (CAISI).


    You may like

    The tests pitted the R1, R1-0528, and V3.1 DeepSeek models (crucially not DeepSeek’s new V3.2 released this week) against OpenAI’s GPT-5, GPT-5-mini, and GPT-oss, and Anthropic’s Opus 4, using 19 different benchmarks. These publicly available tests include SWE-bench Verified and Breakpoint for software engineering, MMLU-Pro and GPQA for general knowledge capabilities, SMT 2025, PUMaC 2024, and OTIS-AIME 2025 math contests for mathematical reasoning, and the AgentDojo framework for hijacking attack resilience. Aside from this, the institution also customized and developed its own custom assessments to test for things like CCP censorship, as there’s no standard test for that.

    All the results were outlined in a 69-page document [PDF], with CAISI saying that OpenAI and Anthropic outperform DeepSeek in all tests, but most especially in software engineering and cyber tasks. The U.S. AI models generally outperform DeepSeek by 20 to 80%, and cost around 35% less to operate. The latter is also easier to hijack and jailbreak, making it more susceptible to acting unintentionally. The report also said that Chinese models are biased and that they toe the line when it comes to messaging from Beijing, although it’s worth bearing in mind that other AI benchmarking tools exist that might yield different results.

    Despite all this, DeepSeek R1 is continuously being adopted, with CAISI saying that the “use of these models may pose a risk to application developers, to consumers, and to U.S. national security.” Beyond that, the Chinese AI company is continuously releasing new models, with DeepSeek-V3.2-Exp being released earlier this week, possibly rendering some of these tests moot.

    Follow Tom’s Hardware on Google News to get our up-to-date news, analysis, and reviews in your feeds. Make sure to click the Follow button.

    Get Tom’s Hardware’s best news and in-depth reviews, straight to your inbox.

    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Previous ArticleSecondary stock sale pegs OpenAI va
    Next Article UMG and WMG Could Sign ‘Landmark’ AI Deals Soon
    AI Logic News

    Related Posts

    Deepseek

    Inside China’s buzzing AI scene a year after DeepSeek shock

    January 20, 2026
    Deepseek

    DeepSeek’s 24/7 Operations Pow

    January 19, 2026
    Deepseek

    Post Techcast: the DeepSeek sh

    January 19, 2026
    Demo
    Top Posts

    Houston’s Small Biz Gets Smarter: H

    July 29, 20259 Views

    How To Rank First In ChatGPT Even If You’re New To AI

    March 29, 20259 Views

    OpenAI to Focus on Safety Amid Deception Risks

    January 4, 20266 Views
    Latest Reviews
    ailogicnews.ai
    © 2026 Lee Enterprises

    Type above and press Enter to search. Press Esc to cancel.