Bootleg takes way too much space and memory #156
Labels
bootleg
Issues with NER and Bootleg
bug
Something isn't working
P2
We need to fix it (backlog)
performance
Performance issues: resource usage, scaling, GPU efficiency
server
Issues with serving and dynamic inference-time
I had to bump up the ephemeral storage of BART models with Bootleg to 55G. It takes ~20 minutes for a model to start for inference, as it downloads and then loads in memory gigs and gigs of "stuff". This is simply not feasible.
The text was updated successfully, but these errors were encountered: