Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OOM Error #898

Open
angelapredolac opened this issue Oct 25, 2024 · 2 comments
Open

OOM Error #898

angelapredolac opened this issue Oct 25, 2024 · 2 comments

Comments

@angelapredolac
Copy link

Expected Behavior

I tried using easy-clust to perform clustering to generate smaller representative subsets from Uniref50 (Uniref40, Uniref 30, Uniref 20, etc.)

Current Behavior

The clustering dies after a few steps with an Out of Memory Error.

Steps to Reproduce (for bugs)

Please make sure to execute the reproduction steps with newly recreated and empty tmp folders.

MMseqs Output (for bugs)

Please make sure to also post the complete output of MMseqs. You can use gist.github.com for large output.
tmp/5521603387764364218/clu_tmp/3834389364932800683/cascaded_clustering.sh: line 102: 208445 Killed $RUNNER "$MMSEQS" prefilter
"$INPUT" "$INPUT" "${TMP_PATH}/pref_step$STEP" ${TMP}
slurmstepd: error: Detected 1 oom_kill event in StepId=16746209.batch. Some of the step tasks have been OOM Killed.

Context

Providing context helps us come up with a solution and improve our documentation for the future.
#SBATCH --output=slurm_mlm.out
#SBATCH -e slurm_mlm.err
#SBATCH -p scavenger-gpu
#SBATCH --gres=gpu:1
#SBATCH --mem=200G
#SBATCH -c 32
mmseqs easy-cluster /hpc/group/naderilab/eleanor/prose/data/uniref50.fasta uniref40 tmp --min-seq-id 0.4 -c 0.8 --cov-mode 1 --split-memory-limi
t 160G --threads 12

Your Environment

Include as many relevant details about the environment you experienced the bug in.

  • Git commit used (The string after "MMseqs Version:" when you execute MMseqs without any parameters):
  • Which MMseqs version was used (Statically-compiled, self-compiled, Homebrew, etc.):
  • For self-compiled and Homebrew: Compiler and Cmake versions used and their invocation:
  • Server specifications (especially CPU support for AVX2/SSE and amount of system memory):
  • Operating system and version:
@milot-mirdita
Copy link
Member

The excessive memory consumption issue should be fixed in git mmseqs and will be part of release 16 that we are going to release in the next few days. It would be great if you could test if this crash still happens, you can use the precompiled binaries at https://mmseqs.com/latest/

@milot-mirdita
Copy link
Member

Please check if this error still occurs in release 16.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants