Tsinghua and Microsoft trained a full AI coding model using only synthetic data, with no real-world inputs at any stage.Tsinghua and Microsoft trained a full AI coding model using only synthetic data, with no real-world inputs at any stage.

Chinese AI model trained entirely on synthetic data runs on Nvidia H20 and H200 chips

For feedback or concerns regarding this content, please contact us at crypto.news@mexc.com

Tsinghua University and Microsoft Research Asia trained a full AI model using only fake data. No real-world samples at all.

The entire dataset was artificially generated through a new pipeline called SynthSmith, and the system ran on Nvidia chips from start to finish. The team didn’t just pull off a novelty test. They built a working model with 7 billion parameters that beat much bigger models trained on human data.

Their paper, posted January 11 on arXiv, claims that the X-Coder they trained outperformed coding models with 14 billion parameters, even though it never saw real-world text.

“In-depth analysis reveals that scaling laws hold on our synthetic dataset,” the researchers wrote. This team included names from Tsinghua University, Microsoft Research Asia, and Wuhan University.

Researchers use Nvidia chips to skip real-world data entirely

The training setup leaned hard on Nvidia hardware. For supervised fine-tuning, they used 128 Nvidia H20 chips for 220 hours straight. After that, they switched to 32 H200 chips for another seven full days to handle the reinforcement learning phase. These weren’t random choices. The H20 is tuned for inference, and the H200 is built for high-end training. These are the most powerful chips available to Chinese firms right now, thanks to export control exemptions the Trump administration approved after Nvidia lobbied hard to make them available in China.

The researchers said the pipeline itself wasn’t the problem when it came to scaling. It was all about compute power.

Wu Jie, the lead author and a master’s student at Tsinghua, said the real reason they hadn’t taken the pipeline to 100 billion or trillion-parameter models was simply, “computational constraints, rather than limitations of the pipeline itself.”

By releasing the code publicly, they hope others can build on the project without needing to pay massive training costs. The paper also points out a trend in AI.

Models are now expected to “think” over longer timeframes and handle complex reasoning, which has pushed the need for way more compute during inference, not just training.

Chinese team builds faster chip using old fabrication tech

Separately, a new chip called ACCEL was built by Chinese scientists using light particles, not electricity. The chip (short for All-Analogue Chip Combining Electronics and Light) was tested in a lab and hit 4.6 PFLOPS.

That’s 3,000 times faster than Nvidia’s A100, and the Chinese chip used 4 million times less energy. This makes it one of the most efficient AI chips ever made for specific tasks like image recognition or autonomous driving.

It won’t replace CPUs or smartphone chips yet, but the team thinks it could work in wearables, electric vehicles, or smart factories.

The chip was built using a 20-year-old process by Semiconductor Manufacturing International Corporation. It avoided the need for advanced lithography machines that China still can’t access.

“Deployment of photonic computing systems used to be a challenge due to complicated structural design and vulnerability to noise and system errors,” Tsinghua said in an article.

The chip avoids this by combining photonic and analog electronics in a new framework. It doesn’t handle general computing tasks like file compression, but it’s great for AI vision and low-light sensing.

One crazy detail: the energy it takes to run modern chips for an hour could keep ACCEL running for 500 years. That low power demand also makes it easier to deal with heat issues, which limit how small chips can get.

The chip’s functions include traffic identification, lowlight imaging, and real-time vision, using ambient light directly in the sensing process. The team said it’s not a general-purpose chip, but it fills a very specific need.

Funding came from the National Key R&D Programme and the National Natural Science Foundation of China. A Beijing chip company called MakeSens, co-founded by one of the researchers, was involved and recently launched a low-power analog chip too.

Tsinghua’s Dai Qionghai, one of the project leads, said building a new computing architecture was just the first step.

“The more important challenge is to bring this new architecture to practical applications, solving major national and public needs, which is our responsibility.”

The team hasn’t said anything about when this chip might hit the market.

Want your project in front of crypto’s top minds? Feature it in our next industry report, where data meets impact.

World Cup Combo: Aim for 200x

World Cup Combo: Aim for 200xWorld Cup Combo: Aim for 200x

Combine up to 20 World Cup matches in one order

Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact crypto.news@mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Why The Green Bay Packers Must Take The Cleveland Browns Seriously — As Hard As That Might Be

Why The Green Bay Packers Must Take The Cleveland Browns Seriously — As Hard As That Might Be

The post Why The Green Bay Packers Must Take The Cleveland Browns Seriously — As Hard As That Might Be appeared on BitcoinEthereumNews.com. Jordan Love and the Green Bay Packers are off to a 2-0 start. Getty Images The Green Bay Packers are, once again, one of the NFL’s better teams. The Cleveland Browns are, once again, one of the league’s doormats. It’s why unbeaten Green Bay (2-0) is a 8-point favorite at winless Cleveland (0-2) Sunday according to betmgm.com. The money line is also Green Bay -500. Most expect this to be a Packers’ rout, and it very well could be. But Green Bay knows taking anyone in this league for granted can prove costly. “I think if you look at their roster, the paper, who they have on that team, what they can do, they got a lot of talent and things can turn around quickly for them,” Packers safety Xavier McKinney said. “We just got to kind of keep that in mind and know we not just walking into something and they just going to lay down. That’s not what they going to do.” The Browns certainly haven’t laid down on defense. Far from. Cleveland is allowing an NFL-best 191.5 yards per game. The Browns gave up 141 yards to Cincinnati in Week 1, including just seven in the second half, but still lost, 17-16. Cleveland has given up an NFL-best 45.5 rushing yards per game and just 2.1 rushing yards per attempt. “The biggest thing is our defensive line is much, much improved over last year and I think we’ve got back to our personality,” defensive coordinator Jim Schwartz said recently. “When we play our best, our D-line leads us there as our engine.” The Browns rank third in the league in passing defense, allowing just 146.0 yards per game. Cleveland has also gone 30 straight games without allowing a 300-yard passer, the longest active streak in the NFL.…
Share
BitcoinEthereumNews2025/09/18 00:41
Crypto Hack: Drift Protocol Drained Over $200M in Private Key Breach

Crypto Hack: Drift Protocol Drained Over $200M in Private Key Breach

Key Insights: A major crypto hack has struck Drift Protocol, with losses estimated at more than $220 million and some assessments reaching $285 million. The incident
Share
Thecoinrepublic2026/04/02 18:32
Solana Price Prediction: SOL Slides Below $80 As $270M Hack Triggers Selloff

Solana Price Prediction: SOL Slides Below $80 As $270M Hack Triggers Selloff

The post Solana Price Prediction: SOL Slides Below $80 As $270M Hack Triggers Selloff appeared first on Coinpedia Fintech News Solana price is back under pressure
Share
CoinPedia2026/04/02 18:59