LLLeo Li's picture

1 17 7

LLLeo Li

LLLeo612

·

AI & ML interests

None yet

Recent Activity

liked a dataset 15 days ago

JingkunAn/TraceSpatial-Bench

updated a model 24 days ago

LLLeo612/MyAwesomeModel-TestRepo

published a model 24 days ago

LLLeo612/MyAwesomeModel-TestRepo

View all activity

Organizations

liked a dataset 15 days ago

JingkunAn/TraceSpatial-Bench

Viewer • Updated 5 days ago • 100 • 242 • 3

updated a model 24 days ago

LLLeo612/MyAwesomeModel-TestRepo

Feature Extraction • Updated 24 days ago • 18

published a model 24 days ago

LLLeo612/MyAwesomeModel-TestRepo

Feature Extraction • Updated 24 days ago • 18

upvoted a paper 26 days ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published 26 days ago • 74

upvoted a paper 3 months ago

LLMs Learn to Deceive Unintentionally: Emergent Misalignment in Dishonesty from Misaligned Samples to Biased Human-AI Interactions

Paper • 2510.08211 • Published Oct 9, 2025 • 22

reacted to AdinaY's post with 🔥 3 months ago

Post

3535

BAAI has released ROME🔥 evaluating 30+ large reasoning models on text & visual reasoning

FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions (2509.17177)

✨Tests visual reasoning, not just recognition
✨Covers capability × alignment × safety × efficiency
✨More transparent & reliable (less data contamination)
✨Helps make real-world deployment choices

upvoted a paper 3 months ago

FlagEval Findings Report: A Preliminary Evaluation of Large Reasoning Models on Automatically Verifiable Textual and Visual Questions

Paper • 2509.17177 • Published Sep 21, 2025 • 13

liked a dataset 6 months ago

JingkunAn/RefSpatial

Viewer • Updated Jul 20, 2025 • 800 • 2.79k • 20

upvoted 2 papers 7 months ago

Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Paper • 2506.05176 • Published Jun 5, 2025 • 77

RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics

Paper • 2506.04308 • Published Jun 4, 2025 • 43

upvoted a paper 9 months ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26, 2025 • 168

upvoted a paper 10 months ago

Personalize Anything for Free with Diffusion Transformer

Paper • 2503.12590 • Published Mar 16, 2025 • 44

authored a paper 10 months ago

AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement

Paper • 2502.16776 • Published Feb 24, 2025 • 6

upvoted 2 papers 10 months ago

AISafetyLab: A Comprehensive Framework for AI Safety Evaluation and Improvement

Paper • 2502.16776 • Published Feb 24, 2025 • 6

Thus Spake Long-Context Large Language Model

Paper • 2502.17129 • Published Feb 24, 2025 • 73

liked a model 12 months ago

Goodfire/Llama-3.1-8B-Instruct-SAE-l19

Updated Jan 11, 2025 • 76 • 42

New activity in SafeMTData/SafeMTData about 1 year ago

[bot] Conversion to Parquet

#1 opened about 1 year ago by

parquet-converter

authored a paper about 1 year ago

VLSBench: Unveiling Visual Leakage in Multimodal Safety

Paper • 2411.19939 • Published Nov 29, 2024 • 10

upvoted 2 papers about 1 year ago

VLSBench: Unveiling Visual Leakage in Multimodal Safety

Paper • 2411.19939 • Published Nov 29, 2024 • 10

Multimodal Situational Safety

Paper • 2410.06172 • Published Oct 8, 2024 • 12