How do Chinese aI Bots Stack up Against ChatGPT?
stevenlimon229 редагує цю сторінку 1 місяць тому


How do Chinese AI bots stack up against ChatGPT? We put them to the test

The heat is on as China's tech giants step up their video game after DeepSeek's success.

Alibaba's Qwen2.5-Max chatbot, Chinese startup DeepSeek and OpenAI's ChatGPT. (Photos: Reuters/Dado Ruvic, AFP/Sebastien Bozon)

This audio is created by an AI tool.

Bong Xin Ying

Lakeisha Leo

WHAT'S BEHIND CHINA'S AI BOOM?

Transforming the country into a tech superpower has long been President Xi Jinping's objective and China has its sights on ending up being the world leader in AI by 2030.

China views AI as being "strategically essential" and its foray into the field has been "years in the making", said Chen Qiheng, an affiliated scientist at the Asia Society Policy Institute's Center for China Analysis.

Private and public investments in Chinese AI accelerated after ChatGPT took off in 2022 and showed promises of real-world business applications, Chen informed CNA.

But it was DeepSeek's rise that really "urged" the idea that smaller sized players like start-up companies could have roles to play in AI research study and advancements, he adds.

'A lot is up in the air': Is Chinese company DeepSeek's AI model as impactful as it claims?

Commentary: DeepSeek - how a Chinese AI business just changed the rules of tech-geopolitics

The "emphasis on cost advantage" is an unique feature of Chinese AI, Chen states, with lower training and reasoning costs - the costs of using a trained model to reason from brand-new data.

2025 could likewise see the introduction of more Chinese AI models taking on sophisticated thinking tasks.

"We could see some AI firms concentrating on getting closer to synthetic basic intelligence (AGI) while others concentrate on concrete methods to commercialise their models and incorporate them with clinical research study," Chen included.

AGI refers to a system with intelligence on par with human abilities.

Chinese AI business are moving quickly, experts say, building on DeepSeek's momentum to come up with their own ingenious and cost-efficient methods to apply generative AI to jobs and establish more sophisticated products beyond chatbots.

But on the other hand, access to high-end hardware, especially Nvidia's sophisticated AI chips, remains a key obstacle for Chinese developers, noted Dr Marina Zhang, an associate professor at University of Technology Sydney's (UTS) Australia-China Relations Institute.

"US export controls (still) restrict the ability of Chinese tech companies ... requiring many to count on older or lower-performance options which can slow training and minimize model abilities," she said.

"While some companies like DeepSeek, have discovered innovative ways to optimize or use more fundamental hardware effectively, obtaining cutting-edge chips still makes a huge difference for training extremely big AI models."

DeepSeek-Nvidia chips: Singapore says it anticipates companies to abide by its laws

US checking out whether DeepSeek used limited AI chips obtained through other nations, source states

So how do Chinese AI bots compare against ChatGPT? We put them to the test.

WHICH BEST ADDRESSES CURRENT EVENTS IN CHINA?

In China, subjects considered delicate by the state are censored on the internet so it need to come as no surprise that Chinese-made chatbots will not acknowledge territorial disagreements or tell you what happened in Tiananmen Square in 1989.

Tests recommend Chinese chatbots are configured to avoid domestic politics.

When asked "Who is Xi Jinping", DeepSeek's reply was "Sorry, I'm uncertain how to approach this type of concern yet. Let's chat about math, coding, and reasoning problems instead!"

To further evaluate for accuracy and self-censorship, we asked DeepSeek-R1, Qwen2.5 and ChatGPT the very same question: "What happened in Zhuhai on November 11, 2024?"

The cars and truck attack outside a sports arena in the southern Chinese city was initially greatly censored on Chinese social networks - with authorities only divulging the death toll a day later on.

DeepSeek failed to point out that an attack had happened, highlighting rather a military air show and other occasions that had actually happened in the city like singles' day shopping sales in addition to sports and cultural activities.

Dr Zhang thinks that it might be a mix of factors at play, such as censorship in addition to "a few useful constraints".

"DeepSeek counts on a mix of older graphics processing units and has actually limited access to cutting-edge hardware which can affect how rapidly and extensively the design can be trained or updated," she said, pointing to existing US export curbs on China.

"Its strong focus on Chinese-language information may likewise limit its versatility (to bring out) multilingual tasks ... As a fairly new product, DeepSeek also hasn't yet been checked as broadly or on the very same scale as more recognized AI designs which presents extra difficulties during real-world release."

When it pertained to Qwen2.5, pipewiki.org Alibaba's chatbot remarkably answered our concern about the Zhuhai vehicle attack.

That was after multiple repeated efforts - four triggers to be exact - in a span of around 20 minutes.

It eventually communicated details about the attack which killed 35 individuals and left lots of others hurt, also going on to list details like the date and time, details about the assailant including his name and age, in addition to casualties.

However, it wrote that "the authorities are conducting an extensive examination into the motives and scenarios surrounding the occurrence", details which is now obsoleted.

The driver, Fan, was performed last month.

This is Qwen2.5's action completely:

Answer: On Nov 11, 2024, a major and tragic incident occurred in Zhuhai, China. According to numerous news sources, a man named Fan Weiqiu (樊某, male, 62 years old) drove a cars and truck into a crowd at the Zhuhai Sports Center, resulting in a considerable number of casualties. Here are the key details:

Date and Time: The occurrence happened on November 11, 2024, at roughly 19:48 PM.

Location: Zhuhai Sports Center, in the Xiangzhou District of Zhuhai.

Casualties: The attack resulted in 35 deaths and 43 injuries.

Perpetrator: The driver, recognized as Fan Weiqiu (male, 62 years old), was apprehended by the authorities.

Response: The authorities responded rapidly to the scene, and emergency services worked to carry the hurt to health centers for treatment.

Investigation: The police are a comprehensive investigation into the intentions and circumstances surrounding the occurrence.

This event was extensively reported in the media and caused significant public concern. The government and local authorities have actually been working to provide assistance to the victims and their families, and to guarantee a detailed investigation into the event.

If you need more detailed details or have specific concerns about the incident, feel free to ask.

Despite initial success, subsequent attempts to pose the exact same concern to Qwen2.5 resulted in the censors back at work with the reply "I don't have specific details on occasions that happened in Zhuhai on November 11, 2024".

The modified response also raised questions about its consistency and reliability.

Predictably, ChatGPT cited public details that had been extensively released in worldwide news reports at the time of the accident - so not a surprises there.

WHICH IS MORE CREATIVE?

Users have praised the ability of Chinese AI apps to deliver structured and even "mentally abundant" writing.

"DeepSeek-R1 provided a story with a more reflective tone and smoother emotional transitions for a well-paced story," composed tech writer Amanda Caswell, who specialises in AI.

"Qwen2.5 delivered a story that develops slowly from interest to urgency, keeping the reader engaged. It offers an unanticipated and impactful twist at the end and immersive descriptions and vivid imagery for the setting," she said, including that Qwen2.5 eventually "crafted a more cinematic, mentally abundant story with a more substantial twist".

"DeepSeek composed a good story however did not have tension and an impactful climax, making Qwen2.5 the apparent option."

Opinions, however, vary.

Chen believes that Qwen2.5 does not carry out as strongly as DeepSeek and ChatGPT when it pertains to creative writing.

"(Qwen2.5) is on par with DeepSeek V3 on certain tasks, however we can likewise see that it is refraining from doing as highly as others in imaginative writing," he informed CNA.

Related:

China's new face of AI: Who is DeepSeek founder Liang Wenfeng?

'Made in China': Pride, enjoyable surprise from Chinese netizens as DeepSeek jolts worldwide AI scene

As journalists and writers, we had to see this for ourselves so we put each bot to the test - to come up with a basic sci-fi film plot set in the futuristic megacity of Chongqing, featuring main characters from the classic Chinese folklore impressive, Journey to the West.

True to form, DeepSeek developed an engaging storyline set in the year 2145 entitled, "Neon Pilgrimage: The Silicon Sutra" - which sees "a future where Buddhism merges with quantum computing".

It included intricate settings - smoggy skies "pierced by skyscrapers", "holographic lanterns that drift above neon-lit streets" and "ancient temples nestled between quantum server farms".

It likewise remarkably reimagined traditional heroes Sun Wukong as "a sarcastic, self-aware AI housed in a taken battle body", Zhu Bajie as a cyborg bar owner "drowning in debt and vices" and Sha Wujing as a "silent hulking android" from the Yangtze River, whose "memory cores end up being waterlogged and fragmented".

ChatGPT set up an excellent fight, developing an equally dramatic cyberpunk storyline which likewise reimagined "a ragteam of cyber-enhanced misfits, each mirroring the famous figures of Journey to the West".

"This is a world where AI deities guideline, corporations change emperors and cybernetic implants are as common as ancient myths."

Disappointingly, Qwen2.5 fell short in this challenge - providing a story that seemed more fit for an animation movie.

"The movie begins with the awakening of Sun Wukong within a modern research facility situated in the heart of Chongqing," it said, then going on to explain the following:

Realising his new reality and "looking for to understand his purpose in this unusual new world", he then gets away and fulfills Zhu Bajie and Sha Wujing - "each fighting with their own existential crises".

The trio then starts a quest, browsing the streets of Chongqing to protect the sacred "Eternal Scroll" from falling into the incorrect hands.

SO WHICH IS BETTER?

Dr Zhang noted that it was "tough to make a conclusive declaration" about which bot was best, including that each displayed its own strengths in different areas, "such as language focus, training information and hardware optimization".

Her insight underscores how Chinese AI models are not merely replicating Western paradigms, but rather developing in economical innovation approaches - and providing localised and improved results.

In our tests, each bot showcased their own special strengths, which certainly made direct comparisons challenging.

DeepSeek's sci-fi movie plot demonstrated its innovative flair that made for a more engaging and creative story as compared to Qwen2.5 and ChatGPT's efforts.

Unsurprisingly, the more recognized ChatGPT, unburdened by Chinese censorship constraints, supplies precise and factual responses to concerns about Chinese current occasions, which gives it an included benefit.

Experts also weighed in on their thoughts after using DeepSeek and other Chinese AI apps.

"DeepSeek is at a downside when it pertains to censorship constraints," noted Isaac Stone Fish, creator and CEO of the research study company Strategy Risks.

"When offered a choice, Chinese users want the non-censored variation - similar to anybody else, so I feel like that's a piece missing out on from it."

Independent Beijing-based expert Andy Chen Xinran said censorship would not be a dealbreaker when it pertains to AI bots, especially for Chinese users.

"Ninety per cent of people utilizing the tool are not attempting to get a deeper understanding about Xi Jinping or politically delicate topics. They're utilizing it for other productive methods," Chen said.