Skip to content

AminoExtract is a tool for translating and extracting amino acid sequences from nucleotide-sequences based on a GFF input

License

Notifications You must be signed in to change notification settings

RIVM-bioinformatics/AminoExtract

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

53 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CodeFactor GitHub

AminoExtract

AminoExtract is a CLI tool and library to work with GFF files, and extract amino acid sequences from nucleotide sequences based on a GFF input file.

AminoExtract is able to filter the genomic features in the input files to make sure your output makes sense and to write the resulting amino acid sequences to either a single file or to individual files for every feature, depending on your inputs as a user.

Additionally, AminoExtract allows you to use GFF data in python by making it accessible in a pandas dataframe structure.

Why this tool?

Because sometimes you just want a dedicated tool to do a mundane task. And sometimes it's just simply necessary to, for example, ensure reproducibility, portability or to facilitate long-term maintainability of larger projects.

Instead of copying this one script used for translating and writing amino acids across all projects, we can now point to AminoExtract for just that.

Installation requirements

AminoExtract requires python 3.10 or later to work.

Dependencies such as Pandas, Biopython and python-magic are installed during the installation procedure.

Installation instructions

AminoExtract can be instsalled easily with conda or pip.
Installation through conda is recommended.

Installation with conda

conda install -c bioconda -c conda-forge aminoextract

Installation with pip

pip install AminoExtract

Installation from source

  1. Download the latest version of AminoExtract by cloning this repository and navigate to the newly created directory.
    Copy and paste the code-snippet below in order to do so.
git clone https://github.com/RIVM-bioinformatics/AminoExtract.git && cd AminoExtract/
  1. If necessary, create a conda-environment and install the necessary dependencies.
    Copy and paste the code-snippet below in order to do so.
mamba env create -f env.yml && conda activate AminoExtract
  1. Now install AminoExtract into the conda environment with the following:
pip install .

AminoExtract is now installed!
You can use AminoExtract from anywhere on your system as long as the conda-environment in which it is installed is active.
You can test if installation was succesful by typing AminoExtract -v which should display the installed version.

About

AminoExtract is a tool for translating and extracting amino acid sequences from nucleotide-sequences based on a GFF input

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages