Trainable selective sampling and sparse attention kernels are indispensable in the era of context engineering. We hope our work will be helpful to everyone! π€
@SmallDoge SmallTalks(SmallDoge/SmallTalks) is a synthetic dataset designed for supervised fine-tuning of language models. The dataset covers a variety of conversational content, including daily conversations, tool usage, Python programming, encyclopedia Q&A, exam problem-solving, logical reasoning, and more. Each task is provided in both English and Chinese versions.
Welcome to the Doge Face Open Source Community! π Our goal is to explore the foundation of embodied intelligence for the next two years, which is indispensable β small language models. π¬ We aim to open-source code and documentation to give everyone more time to slack off while working or studying! π€ π Repository name on Github: https://github.com/SmallDoges/small-doge π Organization name on Hugging Face: