new: Added jina clip text embedding #408

hh-space-invader · 2024-11-19T11:30:21Z

Adding jinaai/jina-clip-v1
They provided two examples, the first one works and the second one complains about missing jinaai/jina-clip-v1/sentence_xlnet_config.json. The output of the first one seems to have small numbers, like they are normallized but its not mentioned so not sure tbh.

Update:
The text model needs pooling and normalizing
The image model needs the image to be square

All Submissions:

Have you followed the guidelines in our Contributing document?
Have you checked to ensure there aren't other open Pull Requests for the same update/change?

New Feature Submissions:

Does your submission pass the existing tests?
Have you added tests for your feature?
Have you installed pre-commit with pip3 install pre-commit and set up hooks with pre-commit install?

New models submission:

Have you added an explanation of why it's important to include this model?
Have you added tests for the new model? Were canonical values for tests computed via the original model?
Have you added the code snippet for how canonical values were computed?
Have you successfully ran tests with your changes locally?

new: added resize2square

joein · 2024-11-21T10:49:32Z

fastembed/image/transform/functional.py

+def resize2square(
+    image: Image.Image,
+    size: int,
+    fill_color: Optional[Union[str, int, tuple[int, ...]]] = None,
+    resample: Union[Image.Resampling, int] = Image.Resampling.BICUBIC,
+) -> Image.Image:
+    resized_image = resize(image=image, size=size, resample=resample)
+
+    new_image = Image.new(mode="RGB", size=(size, size), color=fill_color)
+    left = (size - resized_image.size[0]) // 2
+    top = (size - resized_image.size[1]) // 2
+    new_image.paste(resized_image, (left, top))
+    return new_image


we already have resize, let's not introduce new functions, we can add an additional parameter like preserve_aspect_ratio or something like this to resize.
If it's false then we should just resize image to the required size (preserving aspect ratio is useful when later we have crop)

joein · 2024-11-21T10:54:53Z

fastembed/image/transform/operators.py

+        if config.get("do_normalize", False) or ("mean" in config and "std" in config):
+            transforms.append(
+                Normalize(
+                    mean=config.get("image_mean", config.get("mean")),
+                    std=config.get("image_std", config.get("std")),
+                )
+            )


Suggested change

if config.get("do_normalize", False) or ("mean" in config and "std" in config):

transforms.append(

Normalize(

mean=config.get("image_mean", config.get("mean")),

std=config.get("image_std", config.get("std")),

)

)

if config.get("do_normalize", False):

transforms.append(Normalize(mean=config["image_mean"], std=config["image_std"]))

elif "mean" in config and "std" in config:

transforms.append(Normalize(mean=config["mean"], std=config["std"]))

hh-space-invader requested review from I8dNLo and joein November 19, 2024 11:30

hh-space-invader added 9 commits November 21, 2024 02:04

WIP: Added jina clip text embedding

6318f3d

WIP: Added preprocess for jina clip

d02517e

WIP: Added jina clip vision (not sure if it works yet)

baeaab8

improve: Improved mean pooling if the output doesnt have seq length

49d2d67

fix: Fixed jina clip text

30d814b

nit

d25fa1c

fix: Fixed jina clip image preprocessor

52ced38

fix: Fix type hints

b43aeef

new: added resize2square

tests: Add jina clip vision test case

82e2d4b

hh-space-invader force-pushed the support-jina-clip branch from 86287ef to 82e2d4b Compare November 21, 2024 00:06

hh-space-invader changed the title ~~WIP: Added jina clip text embedding~~ new: Added jina clip text embedding Nov 21, 2024

nit

414a4fe

joein requested changes Nov 21, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

new: Added jina clip text embedding #408

new: Added jina clip text embedding #408

hh-space-invader commented Nov 19, 2024 •

edited

Loading

joein Nov 21, 2024

joein Nov 21, 2024

new: Added jina clip text embedding #408

Are you sure you want to change the base?

new: Added jina clip text embedding #408

Conversation

hh-space-invader commented Nov 19, 2024 • edited Loading

All Submissions:

New Feature Submissions:

New models submission:

joein Nov 21, 2024

Choose a reason for hiding this comment

joein Nov 21, 2024

Choose a reason for hiding this comment

hh-space-invader commented Nov 19, 2024 •

edited

Loading