Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

instant-id: added deleting of full precision pipeline before quantization #2539

Conversation

nikita-savelyevv
Copy link
Collaborator

@nikita-savelyevv nikita-savelyevv commented Nov 19, 2024

Changes:

  • In order to reduce peak memory footprint, full precision OV pipeline is now deleted right after calibration dataset is collected. This way it does not take up additional memory during quantization.
  • Comparing inference speed of original and optimized pipelines is now optional (disabled by default) for the same reason.

After applying the changes and updating to openvino-nightly, the peak memory is observed to drop from 120 GB to 60 GB.

Related ticket: 146016

Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

@nikita-savelyevv
Copy link
Collaborator Author

@l-bat , it may be beneficial to consider logic like this when implementing quantization of large pipelines in the future

@eaidova
Copy link
Collaborator

eaidova commented Nov 20, 2024

@nikita-savelyevv please fix code formatting

@eaidova eaidova merged commit 8d79475 into openvinotoolkit:latest Nov 20, 2024
16 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants