AI & ML interests

LLM

Recent Activity

sinwangΒ  submitted a paper about 2 hours ago
Multi-hop Reasoning via Early Knowledge Alignment
AuraithmΒ  updated a model about 3 hours ago
OpenMOSS-Team/DiRL-8B-Instruct
lkdhyΒ  updated a dataset 5 days ago
OpenMOSS-Team/VideoThinkBench
View all activity

OpenMOSS-Team 's collections 10

MHA2MLA
The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"
MHA2MLA-refactor
The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"
MHA2MLA-refactor
The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"
MHA2MLA
The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"