Skip to content

Selected Experiment Reports

This is a semi-automatically generated list of experiment issues and reports. It is periodically updated by a script, and then curated by hand.

This page includes only experiments that have at least one run or report.

Marin 8B Base

Cooldowns

Big Runs

Modeling

Training and Performance

Data Experiments

Datashop

High Quality Data Ablations

Data Filtering

  • Stack Exchange Quality Classifier #596
    • GitHub Issue #596
    • WandB Report
    • Conclusion: Seems to lead to better loss than using Reddit ELI5 or OpenHermes.
    • NOTE: this seems like a loose end, we should pursue this further.

Text Extraction and Formatting

Supervised Fine Tuning

Scaling Laws

Baselines and Reproductions

Other Projects

Compel

Uncategorized

(This is for experiments that have been added via the script but have not yet been curated.)