Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference" proposed by Peking University and UC Berkeley.
-
Updated
Oct 22, 2024 - Python
Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference" proposed by Peking University and UC Berkeley.
Add a description, image, and links to the sparsevlm topic page so that developers can more easily learn about it.
To associate your repository with the sparsevlm topic, visit your repo's landing page and select "manage topics."