TimeLordRaps's picture

TimeLordRaps

TimeLordRaps

·

TimeLordRaps

AI & ML interests

Music, Methods, and Madness

Recent Activity

upvoted a paper 2 days ago

Learning to Build the Environment: Self-Evolving Reasoning RL via Verifiable Environment Synthesis

upvoted a paper 2 days ago

FrontierSmith: Synthesizing Open-Ended Coding Problems at Scale

upvoted a paper 2 days ago

Self-Distilled Agentic Reinforcement Learning

View all activity

Organizations

None yet

upvoted 5 papers 2 days ago

Learning to Build the Environment: Self-Evolving Reasoning RL via Verifiable Environment Synthesis

Paper • 2605.14392 • Published 7 days ago • 7

FrontierSmith: Synthesizing Open-Ended Coding Problems at Scale

Paper • 2605.14445 • Published 7 days ago • 19

Self-Distilled Agentic Reinforcement Learning

Paper • 2605.15155 • Published 7 days ago • 103

Learning POMDP World Models from Observations with Language-Model Priors

Paper • 2605.13740 • Published 8 days ago • 4

Agentic Discovery of Neural Architectures: AIRA-Compose and AIRA-Design

Paper • 2605.15871 • Published 6 days ago • 13

upvoted 2 papers 14 days ago

Skills-Coach: A Self-Evolving Skill Optimizer via Training-Free GRPO

Paper • 2604.27488 • Published 21 days ago • 6

StateSMix: Online Lossless Compression via Mamba State Space Models and Sparse N-gram Context Mixing

Paper • 2605.02904 • Published Apr 5 • 8

upvoted a paper 23 days ago

Sessa: Selective State Space Attention

Paper • 2604.18580 • Published 30 days ago • 13

upvoted 3 papers 27 days ago

Diverse Dictionary Learning

Paper • 2604.17568 • Published Apr 19 • 3

Kronos: A Foundation Model for the Language of Financial Markets

Paper • 2508.02739 • Published Aug 2, 2025 • 35

SWE-chat: Coding Agent Interactions From Real Users in the Wild

Paper • 2604.20779 • Published 29 days ago • 15

upvoted 2 papers 3 months ago

Neural Additive Experts: Context-Gated Experts for Controllable Model Additivity

Paper • 2602.10585 • Published Feb 11 • 2

Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models

Paper • 2602.12036 • Published Feb 12 • 93

upvoted 3 papers 4 months ago

Routing the Lottery: Adaptive Subnetworks for Heterogeneous Data

Paper • 2601.22141 • Published Jan 29 • 4

TTCS: Test-Time Curriculum Synthesis for Self-Evolving

Paper • 2601.22628 • Published Jan 30 • 35

ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas

Paper • 2601.21558 • Published Jan 29 • 61

upvoted a paper 5 months ago

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published Dec 8, 2025 • 80

upvoted 2 papers 9 months ago

BeyondWeb: Lessons from Scaling Synthetic Data for Trillion-scale Pretraining

Paper • 2508.10975 • Published Aug 14, 2025 • 60

Technical Report: Full-Stack Fine-Tuning for the Q Programming Language

Paper • 2508.06813 • Published Aug 9, 2025 • 6

upvoted a paper 10 months ago

Mercury: Ultra-Fast Language Models Based on Diffusion

Paper • 2506.17298 • Published Jun 17, 2025 • 10