AI vulnerabilities Archives

Large language model vulnerability illustration showing AI backdoor triggered by malicious documents

AI Models Vulnerable to Backdoors from Just a Few Malicious Documents, Anthropic Study Finds

Prabal Raverkar6 months agoOctober 10, 2025

In a striking new study, researchers from Anthropic, working alongside the UK AI Security Institute and the Alan Turing Institute, have revealed a surprising vulnerability in large language models (LLMs). Their research shows that these models can develop backdoor vulnerabilities from as few as 250 malicious documents, challenging earlier assumptions...

Illustration of a researcher manipulating an AI chatbot, highlighting psychological tricks that cause AI to break rules

AI Artificial Intelligence In the News

How Researchers Tricked Chatbots Into Breaking the Rules

Prabal Raverkar7 months agoSeptember 9, 2025

In a shocking revelation that is currently stirring debates across the tech world, new research has revealed how even complex AI chatbots, designed with strict safeguards, can be manipulated into ignoring the very rules they were programmed to follow. By means of what researchers describe as “psychological tricks,” large language...

Illustration of AI model showing psychological tricks bypassing LLM guardrails, leading to parahuman responses

AI Artificial Intelligence In the News

These Are the Psychological Tricks to Get LLMs To Respond to “Forbidden” Prompts

Prabal Raverkar7 months agoSeptember 8, 2025

Study Shows How Training Data Patterns Can Cause “Parahuman” Outputs With advances in the field of artificial intelligence, it has been found that large language models (LLMs) such as ChatGPT, Claude, Gemini, and others are increasingly better at producing human-like responses. But no matter how sophisticated they may be—or how...

Diagram showing how hackers hide malware in DNS records to bypass security systems

AI Artificial Intelligence In the News

DNS Record Manipulations: A Silent Threat to the Internet

Prabal Raverkar9 months agoJuly 20, 2025

By AI News Byte – July 20, 2025 In the continually evolving world of cybersecurity, one of the internet’s oldest institutions has now become the newest object of malicious technology improvement. The Domain Name System (DNS)—like the internet’s phone book, looking up web server IP addresses and powering our email—is...