#

clip

Here are 690 public repositories matching this topic...

easychen / pushdeer

开放源码的无App推送服务，iOS14+扫码即用。亦支持快应用/iOS和Mac客户端、Android客户端、自制设备

app notification-service push clip

Updated Feb 26, 2024
C

marqo

marqo-ai / marqo

Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai

Updated Nov 15, 2024
Python

OFA-Sys / Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

nlp computer-vision deep-learning transformers pytorch chinese pretrained-models multi-modal clip coreml-models contrastive-loss vision-language multi-modal-learning image-text-retrieval vision-and-language-pre-training

Updated Aug 6, 2024
Python

CVHub520 / X-AnyLabeling

Effortless data labeling with AI support from Segment Anything and other awesome models.

deep-learning sam pytorch yolo resnet deeplearning clip paddle labeling-tool onnx llm

Updated Nov 15, 2024
Python

open-mmlab / mmpretrain

OpenMMLab Pre-training Toolbox and Benchmark

deep-learning pytorch image-classification resnet pretrained-models clip mae mobilenet moco multimodal self-supervised-learning constrastive-learning beit vision-transformer swin-transformer masked-image-modeling convnext

Updated Nov 1, 2024
Python

yuanzhoulvpi2017 / zero_nlp

中文nlp解决方案(大模型、数据、模型、训练、推理)

nlp bloom pipeline transformers text-generation pytorch falcon gpt clip bert dolly gpt2 huggingface-transformers gpt-neox chatglm-6b llama2

Updated Oct 29, 2024
Jupyter Notebook

pharmapsychotic / clip-interrogator

Image to prompt with BLIP and CLIP

Updated May 15, 2024
Python

jingyi0000 / VLM_survey

Collection of AWESOME vision-language models for vision tasks

computer-vision deep-learning survey transfer-learning clip knowledge-distillation vision-language-model multi-modal-model

Updated Nov 3, 2024

rom1504 / clip-retrieval

Easily compute clip embeddings and build a clip retrieval system with them

ai deep-learning clip knn semantic-search multimodal

Updated Apr 15, 2024
Jupyter Notebook

RuffianZhong / RWidgetHelper

Android UI 快速开发，专治原生控件各种不服

Updated Feb 21, 2024
Java

cambrian-mllm / cambrian

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

computer-vision chatbot representation-learning clip dino large-language-models llms instruction-tuning mllm multimodal-large-language-models

Updated Oct 30, 2024
Python

roboflow / awesome-openai-vision-api-experiments

Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥

computer-vision openai classification clip zero-shot chatgpt segment-anything open-vocabulary-detection open-vocabulary-segmentation grounding-dino

Updated Feb 22, 2024
Python

open-compass / VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks

computer-vision evaluation pytorch gemini openai vqa vit gpt multi-modal clip claude openai-api gpt4 large-language-models llm chatgpt llava qwen gpt-4v

Updated Nov 17, 2024
Python

mbzuai-oryx / Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

chatbot llama clip mulit-modal vision-language vicuna gpt-4 vision-language-pretraining llava video-chatboat video-conversation

Updated Aug 27, 2024
Python

yzhuoning / Awesome-CLIP

Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).

clip pre-training contrastive-learning

Updated Jun 28, 2024

uform

unum-cloud / uform

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

Updated Oct 1, 2024
Python

EdVince / Stable-Diffusion-NCNN

Stable Diffusion in NCNN with c++, supported txt2img and img2img

android cpp executable clip diffusion tensorrt mnn ncnn onnx img2img tnn txt2img stable-diffusion

Updated Jul 3, 2023
C++

natural-language-image-search

haltakov / natural-language-image-search

Search photos on Unsplash using natural language

photos machine-learning computer-vision unsplash image-search clip

Updated Oct 13, 2022
Jupyter Notebook

natural-language-youtube-search

haltakov / natural-language-youtube-search

Search inside YouTube videos using natural language

search machine-learning youtube computer-vision clip

Updated Oct 15, 2021
Jupyter Notebook

ArrowLuo / CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

search retrieval ranking clip multimodality multimodal-learning multimodal activitynet retrieval-model msvd msrvtt video-text-retrieval lsmdc didemo video-clip-retrieval

Updated Apr 12, 2024
Python

Improve this page

Add a description, image, and links to the clip topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the clip topic, visit your repo's landing page and select "manage topics."