Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Ruohong Zhang's picture
3 6 11

Ruohong Zhang

ruohongz
21world's profile picture Sicong's profile picture
·
  • RifleZhang

AI & ML interests

LM pre-training

Organizations

Sotopia's profile picture ShareGPTVideo's profile picture MultiVLM's profile picture Share4oReasoning's profile picture Carnegie Mellon University's profile picture

upvoted an article over 1 year ago
view article
Article

SigLIP 2: A better multilingual vision language encoder

  • +1
ariG23498, merve, qubvel-hf
•
Feb 21, 2025
• 216
upvoted 3 papers over 1 year ago

Scalable Ranked Preference Optimization for Text-to-Image Generation

Paper • 2410.18013 • Published Oct 23, 2024 • 14

Improve Vision Language Model Chain-of-thought Reasoning

Paper • 2410.16198 • Published Oct 21, 2024 • 26

Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems

Paper • 2408.16293 • Published Aug 29, 2024 • 27
upvoted a paper almost 2 years ago

Law of Vision Representation in MLLMs

Paper • 2408.16357 • Published Aug 29, 2024 • 95
upvoted a paper about 2 years ago

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward

Paper • 2404.01258 • Published Apr 1, 2024 • 12
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs