We trained an open-source Mythos like cybersecurity LLM for the Build Small Hackathon meet OpenMythos
Trained in two stages: SFT on ~1.84K filtered ArXiv cs.CR papers + real CVE data, then RLVR using paired with past vulnerabilities GitHub repos with a verifier model checking outputs against ground truth.
Trained on: H100s from Modal
The RLVR stage made the biggest difference responses got more precise and less prone to confusing similar vulnerability classes.
Shipped v0.1.2 of vtx โ a minimalist coding agent for the terminal.
Most agentic CLIs ship 10k+ token system prompts. Vtx is ~2,200. Less prompt overhead means more room for your code in the model's context window.
Vtx is a from-scratch Python implementation of the design philosophy behind pi-mono โ same principles, pure Python, no transpiled runtime.
What ships out of the box:
โ Textual TUI + headless CLI (vtx -p "fix the failing test") โ 49 LLM provider gateways, all declared in a single provider.yaml โ 5 core tools (read / edit / write / bash / find) plus web search and fetch โ Session tree with compaction, handoff, and resume โ AGENTS.md / CLAUDE.md auto-discovery โ Skills system โ drop SKILL.md files in .agents/skills/ and they become slash commands โ Two OAuth flows (GitHub Copilot device flow, OpenAI Codex PKCE) โ Two-mode permissions: prompt (default) or auto, with a safe-command allowlist
This release adds a proper extension system. Register new LLM-callable tools, intercept tool calls, hook lifecycle events, and add slash commands from a single register(api) function in a Python file under ~/.vtx/agent/extensions/. Extensions can override built-in tools by name and chain handler logic across subscribers.
Apache 2.0. uv tool install vtx-coding-agent and you're running.
Turns out : if we predict ๐ earth we can save a lot of time looking for interesting things and less time looking at things that we expect to see.
Sentinel-2 imagery ๐ฐ๏ธbasically takes a long time to download towards earth. so our "near real time" systems are quite far from that in practical terms.
meanwhile , if we "predict" what we will see , based on what we do see , we can send down much less data in a timely way , and prioritize ๐กearth-bound response .
I'm talking about illegal fishing , logging , mining or building in nature reserves , the more of that we predict early the more we're able to stop it on time.
since everyone liked my previous announcement post ( https://huggingface.co/posts/Tonic/338509028435394 ) so much , i'm back with more high quality proceedural datasets in the Geospacial domain for SFT training !