liuyaojialiuyaojia

liuyaojialiuyaojia

Highlights

Awesome-LLM-Security-Paper Awesome-LLM-Security-Paper Public

Your best llm security paper library

5 1
lm-evaluation-harness lm-evaluation-harness Public

Forked from EleutherAI/lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python
axolotl axolotl Public

Forked from axolotl-ai-cloud/axolotl

Go ahead and axolotl questions

Python
Yunji-v1 Yunji-v1 Public
refusal_direction refusal_direction Public

Forked from andyrdt/refusal_direction

Paper1实验 —— 复现Refusal in Language Models Is Mediated by a Single Direction

Python