You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
embeddings.index(...) takes a long time. When working with a database or other network dependent data retrieval, something may go wrong during the index call which means the save call is never reached, all progress is lost.
Value of Feature
It would allow for the ability to continue an index that has failed for reasons other than bad data.
The text was updated successfully, but these errors were encountered:
I'd yes and this.. It would be better if we can treat it like a check point and in the event that the indexing fails, be able to restart it from the last check point. I've had issues where there were non-strings in a field that was trying to create vector index on and it breaks after many hours of running. It would be great to be able to identify the error, correct any issue with the data and start the indexing from the last check point position.
davidmezzetti
changed the title
[Feature Request]: Auto-save during indexing
Feature Request: Auto-save during indexing
Jun 26, 2024
Feature Description
I suggest implementing an autosave feature:
Something like this should save the index every
interval
seconds tosave_path
.Reason
When one processes a large dataset, indexing can take a long time. The naive approach:
embeddings.index(...)
takes a long time. When working with a database or other network dependent data retrieval, something may go wrong during the index call which means the save call is never reached, all progress is lost.Value of Feature
It would allow for the ability to continue an index that has failed for reasons other than bad data.
The text was updated successfully, but these errors were encountered: