Dev Mode Explorers

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Prabhjotschugh authored a paper about 5 hours ago

Not Truly Multilingual: Script Consistency as a Missing Dimension in VLM Evaluation

Prabhjotschugh authored a paper about 5 hours ago

FirstPass: Grounding AI Scientific Judgment in Multi-Round Editorial Outcomes

Prabhjotschugh authored a paper about 5 hours ago

Beyond 'One Language, One Script': Quantifying Orthographic Bias in Multilingual VLMs with PuMVR

View all activity

Prabhjotschugh

authored 4 papers about 5 hours ago

Not Truly Multilingual: Script Consistency as a Missing Dimension in VLM Evaluation

Paper • 2606.17188 • Published 13 days ago • 1

FirstPass: Grounding AI Scientific Judgment in Multi-Round Editorial Outcomes

Paper • 2606.20769 • Published 12 days ago • 1

Beyond 'One Language, One Script': Quantifying Orthographic Bias in Multilingual VLMs with PuMVR

Paper • 2606.20770 • Published 12 days ago • 1

AirCast-SR: A Foundation Model for Kilometer-Scale Atmospheric Super-Resolution via Latent Consistency Diffusion

Paper • 2605.26130 • Published May 20 • 1

eienmojiki

posted an update 4 days ago

Post

105

Hi everyone,

I've created a Gradio space for embedding and extracting invisible watermarks in images:
👉 eienmojiki/blind-watermark-studio

It supports hiding text, images, and bit arrays using the DWT-DCT-SVD algorithm.

Credits:
- Original library: https://github.com/guofei9987/blind_watermark
- Author: Guo Fei

:).

KingNish

posted an update 13 days ago

Post

4301

We trained an open-source Mythos like cybersecurity LLM for the Build Small Hackathon meet OpenMythos

Trained in two stages: SFT on ~1.84K filtered ArXiv cs.CR papers + real CVE data, then RLVR using paired with past vulnerabilities GitHub repos with a verifier model checking outputs against ground truth.

Trained on: H100s from Modal

The RLVR stage made the biggest difference responses got more precise and less prone to confusing similar vulnerability classes.

Everything is open:
🤖 Demo → build-small-hackathon/OpenMythos
🧠 Model → build-small-hackathon/OpenMythos
📦 CVE Dataset → build-small-hackathon/CVE_Vulnerailities_Detailed
📄 ArXiv Dataset → himanshu17HF/ArvixImport-Filtered-Final

Try it out and let us know where it breaks 🙏

2 replies

Abhaykoul

posted an update 13 days ago

Post

223

Shipped v0.1.2 of vtx — a minimalist coding agent for the terminal.

Most agentic CLIs ship 10k+ token system prompts. Vtx is ~2,200. Less prompt overhead means more room for your code in the model's context window.

Vtx is a from-scratch Python implementation of the design philosophy behind pi-mono — same principles, pure Python, no transpiled runtime.

What ships out of the box:

→ Textual TUI + headless CLI (vtx -p "fix the failing test")
→ 49 LLM provider gateways, all declared in a single provider.yaml
→ 5 core tools (read / edit / write / bash / find) plus web search and fetch
→ Session tree with compaction, handoff, and resume
→ AGENTS.md / CLAUDE.md auto-discovery
→ Skills system — drop SKILL.md files in .agents/skills/ and they become slash commands
→ Two OAuth flows (GitHub Copilot device flow, OpenAI Codex PKCE)
→ Two-mode permissions: prompt (default) or auto, with a safe-command allowlist

This release adds a proper extension system. Register new LLM-callable tools, intercept tool calls, hook lifecycle events, and add slash commands from a single register(api) function in a Python file under ~/.vtx/agent/extensions/. Extensions can override built-in tools by name and chain handler logic across subscribers.

Apache 2.0. uv tool install vtx-coding-agent and you're running.

GitHub: https://github.com/OEvortex/vtx-coding-agent
PyPI: https://pypi.org/project/vtx-coding-agent

Built in the open. Feedback, extensions, and PRs welcome.

DongfuJiang

authored 3 papers 24 days ago

RewardHarness: Self-Evolving Agentic Post-Training

Paper • 2605.08703 • Published May 9 • 10

Cosmos 3: Omnimodal World Models for Physical AI

Paper • 2606.02800 • Published 29 days ago • 136

AutoLab: Can Frontier Models Solve Long-Horizon Auto Research and Engineering Tasks?

Paper • 2606.05080 • Published 27 days ago • 30

alielfilali01

posted an update about 1 month ago

Post

563

Plans in HTML > Plans in Markdown

Tonic

posted an update about 2 months ago

Post

2972

🙋🏻‍♂️ Hey there folks ,

Turns out : if we predict 🌏 earth we can save a lot of time looking for interesting things and less time looking at things that we expect to see.

Sentinel-2 imagery 🛰️basically takes a long time to download towards earth. so our "near real time" systems are quite far from that in practical terms.

meanwhile , if we "predict" what we will see , based on what we do see , we can send down much less data in a timely way , and prioritize 📡earth-bound response .

I'm talking about illegal fishing , logging , mining or building in nature reserves , the more of that we predict early the more we're able to stop it on time.

At least that's the concept !

check out the blog : https://huggingface.co/blog/Tonic/save-patagonia-by-predicting-earth

- Collection: https://huggingface.co/collections/NuTonic/earth-observation-with-temporal-and-general-understanding
- Code: https://github.com/Josephrp/Nutonic
- Dataset: NuTonic/sat-vl-sft-training-ready-v1
- Model: NuTonic/lspace
- Training: NuTonic/lspace-trackio
- Evals: NuTonic/Patagonia_Eval

2 replies

DongfuJiang

authored 4 papers about 2 months ago

Watch Before You Answer: Learning from Visually Grounded Post-Training

Paper • 2604.05117 • Published Apr 6 • 36

ClawBench: Can AI Agents Complete Everyday Online Tasks?

Paper • 2604.08523 • Published Apr 9 • 265

Nemotron 3 Super: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning

Paper • 2604.12374 • Published Apr 14 • 37

Beyond Semantic Similarity: Rethinking Retrieval for Agentic Search via Direct Corpus Interaction

Paper • 2605.05242 • Published May 3 • 126

Prabhjotschugh

authored a paper about 2 months ago

When Less Is More: Simplicity Beats Complexity for Physics-Constrained InSAR Phase Unwrapping

Paper • 2605.00896 • Published Apr 28

Tonic

posted an update 2 months ago

Post

4351

🙋🏻‍♂️ Hey there folks,

since everyone liked my previous announcement post ( https://huggingface.co/posts/Tonic/338509028435394 ) so much , i'm back with more high quality proceedural datasets in the Geospacial domain for SFT training !

Check this one out :
NuTonic/sat-bbox-metadata-sft-v1

the goal is to be able to train vision models on multiple images for remote sensing analysis with one shot .

hope you like it ! 🚀

2 replies

Tonic

posted an update 2 months ago

Post

3681

🙋🏻‍♂️ Hey there folks ,

I'm sharing huggingface's largest dataset of annotated statelite images today.

check it out here : NuTonic/sat-image-boundingbox-sft-full

I hope you like it , the idea is to be able to use this with small vision models 🚀

1aurent

authored a paper 3 months ago

Voxtral TTS

Paper • 2603.25551 • Published Mar 26 • 63

AI & ML interests

Recent Activity

Team members 144

dev-mode-explorers's activity