-
GRASP Lab, University of Pennsylvania
- Philadelphia, United States
-
06:40
(UTC -04:00) - https://sites.google.com/seas.upenn.edu/bowenjiang/
- https://orcid.org/0009-0005-0414-0435
- @laurenbjiang
- https://scholar.google.com/citations?user=_6AHV9QAAAAJ&hl=en
-
AnyText Public
Forked from tyxsspa/AnyTextOfficial implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>
Python Apache License 2.0 UpdatedSep 27, 2024 -
Multi-Agent-VQA Public
[CVPR 2024 CVinW] Multi-Agent VQA: Exploring Multi-Agent Foundation Models on Zero-Shot Visual Question Answering
-
llm_token_bias Public
[EMNLP 2024] This is the official implementation of the paper "A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners" in PyTorch.
-
Awesome-LLM-Reasoning Public
Forked from atfortes/Awesome-LLM-ReasoningReasoning in Large Language Models: Papers and Resources, including Chain-of-Thought and OpenAI o1 🍓
MIT License UpdatedSep 20, 2024 -
scene_graph_commonsense Public
This is the official implementation of the paper "Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense Knowledge" in PyTorch.
-
MMMA_Rationality Public
This is the official repository of the paper "Multi-Modal and Multi-Agent Systems Meet Rationality: A Survey"
-
SeeAct Public
Forked from OSU-NLP-Group/SeeAct[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).
Python Other UpdatedJun 2, 2024 -
CCD Public
Forked from TongkunGuan/CCD[ICCV2023] Self-supervised Character-to-Character Distillation for Text Recognition
Python UpdatedApr 20, 2024 -
unilm Public
Forked from microsoft/unilmLarge-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Python MIT License UpdatedMar 4, 2024 -
VLSAT Public
Forked from wz7in/CVPR2023-VLSATCVPR2023 : VL-SAT: Visual-Linguistic Semantics Assisted Training for 3D Semantic Scene Graph Prediction in Point Cloud
Python UpdatedFeb 6, 2024 -
Rethinking-Text-Segmentation Public
Forked from SHI-Labs/Rethinking-Text-Segmentation[CVPR 2021] Rethinking Text Segmentation: A Novel Dataset and A Text-Specific Refinement Approach
Python UpdatedDec 2, 2023 -
CFR_VQA Public
Forked from aioz-ai/CFR_VQACoarse-to-Fine Reasoning for Visual Question Answering (CVPRW'22)
Python MIT License UpdatedNov 3, 2022