Meta VL-JEPA - Vision-Language Prediction Models Collection Meta VL-JEPA Vision-Language Joint Embedding Predictive Architecture for video understanding • 6 items • Updated about 1 month ago • 7
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16 Text Generation • 32B • Updated 11 days ago • 64.8k • 104
nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-BF16 Image-Text-to-Text • 13B • Updated Dec 2, 2025 • 84.7k • 74