SAE Evolver

This project uses Sparse Autoencoders (SAE) and Evolutionary Algorithms to evolve selections of SAE that align towards objectives. It leverages the Groq API and implements custom evolution strategies to optimize chatbot behavior based on specified criteria.

About

This is an open research project originally created in the AI Tinkerers group in Denver. The goal of this project is to use combinations of SAE to get desired behaviors from base models, which can be specified through data - see the examples/*.yaml

Examples

Create a chatbot from a base model without RLHF
Create a model that only outputs JSON without strict grammar rules
Anything else you can imagine - see the examples/*.yaml

Features

Evolutionary algorithm for LLM SAE optimization
Integration with Groq API
Customizable criteria for SAE candidate evaluation
Population-based approach with elite selection
Configurable evolution parameters

Prerequisites

Python 3.7+
Groq API key

Installation

Clone this repository:

git clone https://github.com/255BITS/sae-evolver.git
cd sae-evolver

Install the required dependencies:
```
pip install -r requirements.txt
```
Set up your Groq API key as an environment variable:
```
export GROQ_API_KEY=your_api_key_here
```

Usage

Run the main script sae_search.py with desired parameters. Here's an example:

python sae_search.py --cycles 10 --elite 5 --population 20 --initial-population 10 --criteria examples/happy-chatbot.yaml --model llama3-70b-8192 --coeff-start 30 --coeff-end 120

Parameters

--cycles: Number of evolution cycles (default: 10)
--elite: Number of elite candidates to preserve (default: 5)
--population: Total population size (default: 15)
--initial-population: Initial population size (default: 2)
--criteria: YAML file containing evolution criteria (default: "examples/sports_coach.yaml")
--model: Groq model to use (default: "llama-3.1-70b-versatile")
--coeff-start: Start of coefficient range (default: 40)
--coeff-end: End of coefficient range (default: 200)
--seed: Random seed for reproducibility (optional)

Project Structure

sae_search.py: Main script for running the evolution process
sae_evolution.py: Contains evolution-related functions (Candidate, breed, mutation, crossover, etc.)
examples/: Directory containing sample criteria YAML files
results/: Directory where evolution results are stored

How It Works

The script initializes a population of chatbot candidates with random SAE configurations.
For each evolution cycle:
- Candidates are compared based on the specified criteria using the Groq API.
- The best-performing candidates are selected as elites.
- New candidates are generated through breeding, mutation, and crossover.
The process repeats for the specified number of cycles.
The final population is saved, and the best candidate is rendered in HTML.

Extras

Create an Example:

python3 metaprompt.py 'Expand a given word with vivid imagery'

Run Command Line REPL:

python3 steer_gamma_repl.py candidate examples/buick.yml

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Are you a researcher? We could use some help with studying this technique. It's available for anyone to build on. If you want to be nice reference this github repo.

Citation

  255labs.xyz · sae-evolver, 2024 · GitHub repository: https://github.com/255BITS/sae-evolver

License

MIT

References

Groq https://groq.com/
JumpRELU https://arxiv.org/abs/2407.14435
GemmaScope https://huggingface.co/google/gemma-scope

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
examples		examples
gemma		gemma
.gptignore		.gptignore
LICENSE		LICENSE
README.md		README.md
gemma_test.py		gemma_test.py
logo.svg		logo.svg
metaprompt.py		metaprompt.py
metaprompt.txt		metaprompt.txt
sae_evolution.py		sae_evolution.py
sae_search.py		sae_search.py
steer_gamma_repl.py		steer_gamma_repl.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SAE Evolver

About

Examples

Features

Prerequisites

Installation

Usage

Parameters

Project Structure

How It Works

Extras

Create an Example:

Run Command Line REPL:

Contributing

Citation

License

References

About

Releases

Packages

Contributors 2

Languages

License

255BITS/sae-evolver

Folders and files

Latest commit

History

Repository files navigation

SAE Evolver

About

Examples

Features

Prerequisites

Installation

Usage

Parameters

Project Structure

How It Works

Extras

Create an Example:

Run Command Line REPL:

Contributing

Citation

License

References

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages