Under writing
Under writing

Swallow Code & Swallow Math: Open Datasets for Code and Mathematical Reasoning LLMs
High-quality, Apache-2.0 licensed datasets for code and math pre-training — refined through large-scale LLM rewriting and ablation experiments.