clip

Star

Here are 705 public repositories matching this topic...

marqo-ai / marqo

Star

Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

Updated Dec 13, 2024
Python

OFA-Sys / Chinese-CLIP

Star

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

nlp computer-vision deep-learning transformers pytorch chinese pretrained-models multi-modal clip coreml-models contrastive-loss vision-language multi-modal-learning image-text-retrieval vision-and-language-pre-training

Updated Aug 6, 2024
Python

easychen / pushdeer

Star

开放源码的无App推送服务，iOS14+扫码即用。亦支持快应用/iOS和Mac客户端、Android客户端、自制设备

app notification-service push clip

Updated Feb 26, 2024
C

CVHub520 / X-AnyLabeling

Star

Effortless data labeling with AI support from Segment Anything and other awesome models.

deep-learning sam pytorch yolo resnet deeplearning clip paddle labeling-tool onnx llm

Updated Dec 11, 2024
Python

open-mmlab / mmpretrain

Star

OpenMMLab Pre-training Toolbox and Benchmark

deep-learning pytorch image-classification resnet pretrained-models clip mae mobilenet moco multimodal self-supervised-learning constrastive-learning beit vision-transformer swin-transformer masked-image-modeling convnext

Updated Nov 1, 2024
Python

yuanzhoulvpi2017 / zero_nlp

Star

中文nlp解决方案(大模型、数据、模型、训练、推理)

nlp transformers text-generation pytorch llama gpt clip bert gpt2 huggingface-transformers llava chatglm-6b llama2

Updated Dec 10, 2024
Jupyter Notebook

pharmapsychotic / clip-interrogator

Star

Image to prompt with BLIP and CLIP

pytorch clip

Updated May 15, 2024
Python

jingyi0000 / VLM_survey

Star

Collection of AWESOME vision-language models for vision tasks

computer-vision deep-learning survey transfer-learning clip knowledge-distillation vision-language-model multi-modal-model

Updated Dec 3, 2024

rom1504 / clip-retrieval

Star

Easily compute clip embeddings and build a clip retrieval system with them

ai deep-learning clip knn semantic-search multimodal

Updated Apr 15, 2024
Jupyter Notebook

RuffianZhong / RWidgetHelper

Star

Android UI 快速开发，专治原生控件各种不服

Updated Feb 21, 2024
Java

cambrian-mllm / cambrian

Star

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

computer-vision chatbot representation-learning clip dino large-language-models llms instruction-tuning mllm multimodal-large-language-models

Updated Oct 30, 2024
Python

roboflow / awesome-openai-vision-api-experiments

Star

Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥

computer-vision openai classification clip zero-shot chatgpt segment-anything open-vocabulary-detection open-vocabulary-segmentation grounding-dino

Updated Nov 26, 2024
Python

open-compass / VLMEvalKit

Star

Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks

computer-vision evaluation pytorch gemini openai vqa vit gpt multi-modal clip claude openai-api gpt4 large-language-models llm chatgpt llava qwen gpt-4v

Updated Dec 12, 2024
Python

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

chatbot llama clip mulit-modal vision-language vicuna gpt-4 vision-language-pretraining llava video-chatboat video-conversation

Updated Aug 27, 2024
Python

yzhuoning / Awesome-CLIP

Star

Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).

clip pre-training contrastive-learning

Updated Jun 28, 2024

unum-cloud / uform

Star

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️