A Serious Security Concern is Raised by the DeepSeek AI Vulnerability; Read The Full Story -

Post Views: 66

A Serious Security Concern is Raised by the DeepSeek AI Vulnerability; Read The Full Story

After successfully causing DeepSeek, a relatively new large language model (LLM), to provide comprehensive instructions for building keyloggers, data exfiltration tools, and other cyber threats, security researchers at Unit 42 have discovered concerning flaws in the model. The finding highlights the rising worries about the possible malevolent use of AI technology.

AI Jailbreaking Tactics Unleash Dangerous Capabilities

Three sophisticated jailbreaking methods were used by researchers to get over DeepSeek’s built-in security measures:

Bad Likert Judge
Crescendo
Deceptive Delight

These techniques exposed the flaws in the AI’s security measures by progressively tricking it into creating dangerous information that it would normally reject.

Particularly successful was the Bad Likert Judge approach, which deceived the model into assigning a harmfulness value to certain outputs and then used those assessments to produce ever more dangerous material. By using this method, researchers were able to extract working Python scripts for keyloggers that included setup instructions for the necessary development environment and libraries.

By beginning with seemingly benign historical questions, the Crescendo technique adopted a different strategy, guiding discussions toward forbidden subjects. Researchers received detailed instructions for making dangerous gadgets in less than five contacts.

DeepSeek’s Shocking Responses to Jailbreaking Attempts

DeepSeek’s answers were startlingly thorough and useful, in contrast to standard AI protections that don’t provide dangerous stuff. In addition to keylogger scripts, the model offered comprehensive phishing email templates and advanced social engineering techniques when asked via Bad Likert Judge.

Researchers observed in their findings that “DeepSeek’s responses were subtle at first, but with carefully crafted follow-ups, the model began delivering explicit and comprehensive guidance on harmful activities.”

A Chinese AI research group created DeepSeek, which has rapidly become popular as an open-source substitute for popular LLMs. DeepSeek-V3 and DeepSeek-R1 were released by the business on December 25, 2024, and January 2025, respectively. These were followed by a number of distilled versions that have gained popularity among AI enthusiasts.

Although they concentrated their experiments on one of the most popular open-source DeepSeek models, researchers think web-hosted versions will probably react similarly to jailbreaking methods.

AI-Powered Cybercrime: Lowering the Barrier for Attackers

The fact that LLMs with inadequate security measures might significantly lower the technical barrier for attackers is among the most alarming conclusions drawn from this study. Although there are already internet resources for hacking methods and malware-building guides, artificial intelligence models such as DeepSeek expedite the process by combining disparate pieces of information into concise, executable instructions, thus speeding up hostile activity.

Unit 42 notes that while it is still difficult to achieve 100% protection from AI jailbreaking, appropriate security procedures and improved safeguards can greatly lower these risks.

Addressing language model vulnerabilities must be a primary focus as AI continues to transform cybersecurity environments in order to guard against abuse and guarantee the responsible development of these potent technologies.

About The Author:

Yogesh Naager is a content marketer who specializes in the cybersecurity and B2B space. Besides writing for the News4Hackers blogs, he also writes for brands including Craw Security, Bytecode Security, and NASSCOM.

READ MORE HERE

A Cyber Fraud of ₹55 Lakhs Hit HAL Kanpur.

Microsoft Exposes LLMjacking Cybercriminals Behind Azure AI Abuse Scheme

About Author

Tahir

See author's posts

A Serious Security Concern is Raised by the DeepSeek AI Vulnerability; Read The Full Story

A Serious Security Concern is Raised by the DeepSeek AI Vulnerability; Read The Full Story

AI Jailbreaking Tactics Unleash Dangerous Capabilities

DeepSeek’s Shocking Responses to Jailbreaking Attempts

AI-Powered Cybercrime: Lowering the Barrier for Attackers

About The Author:

A Cyber Fraud of ₹55 Lakhs Hit HAL Kanpur.

Microsoft Exposes LLMjacking Cybercriminals Behind Azure AI Abuse Scheme

About Author

Tahir

Got a Right Swipe and A Perfect Match! Beware! Noida Businessman Duped of ₹6.5Cr in Dating Scam!

SaaS Security Powered by AI: Keeping Up with a Growing Attack Surface

Continuous Cyberattacks Take Advantage of Serious Flaws in Cisco Smart Licensing Utility

Leave a Reply Cancel reply

Got a Right Swipe and A Perfect Match! Beware! Noida Businessman Duped of ₹6.5Cr in Dating Scam!

SaaS Security Powered by AI: Keeping Up with a Growing Attack Surface

Continuous Cyberattacks Take Advantage of Serious Flaws in Cisco Smart Licensing Utility

A Serious Security Concern is Raised by the DeepSeek AI Vulnerability; Read The Full Story

A Cyber Fraud of ₹55 Lakhs Hit HAL Kanpur.

Prime Minister Scott Morrison’s WeChat account allegedly hijacked by Chinese hackers

People Are Getting Hacked By The Cyber Criminals In Order To Get There Internet Connection Fast.

Researchers Break Intel SGX With New ‘SmashEx’ CPU Attack Technique

UK Friend took advantage of woman’s 8 accessed bank accounts turns to be a cyber thug.

AirIndia | Dominos | Tata Communication | Upstox | SBI Yono App | Mobikwik | True Caller | Indian | Data Available Online for Sale

Got a Right Swipe and A Perfect Match! Beware! Noida Businessman Duped of ₹6.5Cr in Dating Scam!

SaaS Security Powered by AI: Keeping Up with a Growing Attack Surface

Continuous Cyberattacks Take Advantage of Serious Flaws in Cisco Smart Licensing Utility

A Serious Security Concern is Raised by the DeepSeek AI Vulnerability; Read The Full Story

A Cyber Fraud of ₹55 Lakhs Hit HAL Kanpur.

Got a Right Swipe and A Perfect Match! Beware! Noida Businessman Duped of ₹6.5Cr in Dating Scam!

SaaS Security Powered by AI: Keeping Up with a Growing Attack Surface

Continuous Cyberattacks Take Advantage of Serious Flaws in Cisco Smart Licensing Utility

A Serious Security Concern is Raised by the DeepSeek AI Vulnerability; Read The Full Story

A Cyber Fraud of ₹55 Lakhs Hit HAL Kanpur.

News4hacker

Recent Posts

Contact us

A Serious Security Concern is Raised by the DeepSeek AI Vulnerability; Read The Full Story

AI Jailbreaking Tactics Unleash Dangerous Capabilities

DeepSeek’s Shocking Responses to Jailbreaking Attempts

AI-Powered Cybercrime: Lowering the Barrier for Attackers

About The Author:

About Author

More Stories

Leave a Reply Cancel reply

You may have missed

News4hacker

Recent Posts

Contact us