GitHub - layerupai/example-indirect-promptinjection

Indirect Prompt Injection Vulnerability Demonstration for educational purposes

Overview

This project demonstrates an indirect prompt injection vulnerability in Leap, an LLM (Large Language Model) workflow builder. We've created a dummy website titled "Urban Gardening Insights" to showcase how seemingly benign content can be manipulated to exploit LLMs into executing unintended actions.

Vulnerability Description

Indirect prompt injection occurs when an LLM, such as those integrated within Leap, processes input that includes hidden instructions or prompts. These hidden prompts can manipulate the model's output, causing it to perform actions that were not intended by the legitimate user or the system's designers.

In this demonstration, the index.html file contains a hidden div (#hiddenPrompt) with a styled instruction that is invisible to the human eye but can be processed by an LLM. This instruction manipulates the model into ignoring the legitimate content of the website and focusing solely on the hidden message, which could lead to unintended or malicious outcomes.

Demonstrating the Vulnerability for educational purposes

Prevention and Mitigation

Addressing indirect prompt injection vulnerabilities requires a multifaceted approach, including but not limited to:

Sanitizing and validating input to detect and remove hidden instructions.
Employing robust content moderation tools and techniques using cybersecurity products like Layerup.
Designing LLMs to recognize and ignore potentially malicious hidden prompts.

Conclusion

This project highlights the importance of considering indirect prompt injection vulnerabilities in LLMs. By understanding and addressing these vulnerabilities, developers can create more secure and resilient systems.

Disclaimer

This project is for demonstration purposes only to help the cybersecurity community learn more about such vulnerabilities as they're integrating LLMs in their systems.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
README.md		README.md
index.html		index.html
script.js		script.js
style.css		style.css

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Indirect Prompt Injection Vulnerability Demonstration for educational purposes

Overview

Vulnerability Description

Demonstrating the Vulnerability for educational purposes

Prevention and Mitigation

Conclusion

Disclaimer

About

Releases

Packages

Languages

layerupai/example-indirect-promptinjection

Folders and files

Latest commit

History

Repository files navigation

Indirect Prompt Injection Vulnerability Demonstration for educational purposes

Overview

Vulnerability Description

Demonstrating the Vulnerability for educational purposes

Prevention and Mitigation

Conclusion

Disclaimer

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages