HomeExplorevision-language-learning

Vision Language Learning Collection

Repositories tagged with "vision-language-learning"

RARE

TCG-style cards with ATK/DEF/SPD stats

UNCOMMON

⭐1.5kHP

◆

🔮Psychic

★★

Ovis

AIDC-AI

Pythonchatbotllama3

“A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.”

★

1.5k

83 forks

ATK

DEF

SPD

GitPedia #130

2/5

View wiki →𝕏

GitPedia

Repository Card

UNCOMMON

★

1.5k

COMMON

⭐455HP

◆

🔮Psychic

★

RLAIF-V

RLHF-V

Pythonchatbotcvpr2025

“[CVPR'25 highlight] RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness”

★

455

20 forks

ATK

DEF

SPD

GitPedia #400

1/5

View wiki →𝕏

GitPedia

Repository Card

COMMON

★

455

COMMON

⭐410HP

◆

🔮Psychic

★

OPERA

shikiw

Pythonchatbotchatgpt

“[CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation”

★

410

33 forks

ATK

DEF

SPD

GitPedia #713

1/5

View wiki →𝕏

GitPedia

Repository Card

COMMON

★

410

COMMON

⭐112HP

◆

🔮Psychic

★

Modality-Integration-Rate

shikiw

Pythonchatbotgpt-4o

“[ICCV 2025] The official code of the paper "Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate".”

★

112

2 forks

ATK

DEF

SPD

GitPedia #623

1/5

View wiki →𝕏

GitPedia

Repository Card

COMMON

★

112