Adaptive Coder

Transform digital data to ATCG sequences for DNA storage in high logical density, while output sequences comply with arbitrary user-defined constraints.

First time setup

The following steps are required in order to run Adaptive Coder:

Install Docker.
- Install NVIDIA Container Toolkit for GPU support.
- Setup running Docker as a non-root user.
Check GPUs are avaliable by running:
```
docker run --rm --gpus all nvidia/cuda:11.0-base nvidia-smi
```
The output of this command should show a list of your GPUs.

Running Adaptive Coder

The simplest way to run Adaptive Coder is using the provided Docker script. This was tested with 20 vCPUs, 64 GB of RAM, and a 3090 GPU.

Launch the nvidia maintained container by running:

docker run --gpus all -it --rm nvcr.io/nvidia/tensorflow:xx.xx-tf1-py3

Where xx.xx is the container version. For example, 21.12.

Install the bert4keras dependencies in running container, then commit it as a new image for later use.
```
pip install bert4keras
docker commit <CONTAINER ID> adaptive-coder:1.0
```

Clone this repository to your machine and cd into it.

git clone https://github.com/chill868686/adaptive-coder.git

Install the run_docker.py dependencies. Note: You can create a new environment by Conda or Virtualenv to prevent conflicts with your system's Python environment.
```
pip3 install -r docker/requirements.txt
```

Run run_docker.py pointing to a file containing digital data or DNA sequences which you wish to transform. You optionally provide parameters to command coding:

   python docker/run_docker.py --file_path=(file_path) [OPTIONS]
   OPTIONS(defaluts):
     --log=running.log \
     --model=best_model.weights \
     --docker_image_name=adaptive-coder:1.0 \
     --coding_type=en_decoding|encoding|decoding|training

We provide the following pattern:

DNA encoding&decoding:

 python docker/run_docker.py --file_path=mutimedias/poetry.txt

DNA encoding:

 python docker/run_docker.py --file_path=mutimedias/poetry.txt --coding_type=encoding

DNA decoding:

 python docker/run_docker.py --file_path=results/encodes/poetry.txt.dna --coding_type=decoding

model training:

 python docker/run_docker.py --file_path=datasets/seq_good_256_m.txt --coding_type=training

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
datasets/generators		datasets/generators
docker		docker
mutimedias		mutimedias
.gitignore		.gitignore
DNACoder.py		DNACoder.py
LICENSE		LICENSE
ModelTraining.py		ModelTraining.py
NTProGenerator.py		NTProGenerator.py
README.md		README.md
arithmeticcoding_fast.py		arithmeticcoding_fast.py
conversion.sh		conversion.sh
training.sh		training.sh
vocab.txt		vocab.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Adaptive Coder

First time setup

Running Adaptive Coder

About

Releases

Packages

Languages

License

chill868686/adaptive-coder

Folders and files

Latest commit

History

Repository files navigation

Adaptive Coder

First time setup

Running Adaptive Coder

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages