October 1, 2024
by
Cyber News
Less than a minute
158 Views

This vulnerability hacks a feature that allows ChatGPT to have long-term memory, where it uses information from past conversations to inform future conversations with that same user. A researcher found that he could use that feature to plant “false memories” into that context window that could subvert the model.

A month later, the researcher submitted a new disclosure statement. This time, he included a PoC that caused the ChatGPT app for macOS to send a verbatim copy of all user input and ChatGPT output to a server of his choice. All a target needed to do was instruct the LLM to view a web link that hosted a malicious image. From then on, all input and output to and from ChatGPT was sent to the attacker’s website.

Tags: artificial intelligence, LLM, vulnerabilities

Sidebar photo of Bruce Schneier by Joe MacInnis.

Hacking ChatGPT by Planting False Memories into Its Data

Company

Services

Important Links

Get In Touch

Hacking ChatGPT by Planting False Memories into Its Data

Related Post

Upcoming Speaking Engagements

Discord Invite Link Hijacking Delivers AsyncRAT and Skuld

Friday Squid Blogging: Stubby Squid

Why a GRC Platform Isn’t Optional Anymore—It’s Mission-Critical