qmd/finetune/data
Tobias Lütke 1fb2e2819e Merge origin/main into feat/ast-aware-chunking
Resolve conflicts: combine AST chunking args (filepath, chunkStrategy)
with abort signal parameter from #458.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-28 20:00:49 -04:00
..
train Add wall-clock checkpoints and full eval defaults 2026-02-22 15:02:02 -05:00
fix_hyde_checkpoint.json Merge origin/main into feat/ast-aware-chunking 2026-03-28 20:00:49 -04:00
qmd_expansion_balanced_deduped.jsonl lots of training stuff 2026-01-31 23:02:23 +00:00
qmd_expansion_diverse_addon.jsonl lots of training stuff 2026-01-31 23:02:23 +00:00
qmd_expansion_handcrafted_only.jsonl lots of training stuff 2026-01-31 23:02:23 +00:00
qmd_expansion_handcrafted.jsonl lots of training stuff 2026-01-31 23:02:23 +00:00
qmd_expansion_lex_phrases_negation.jsonl finetune: quoted phrases, negation, and entity preservation (#247) 2026-02-22 13:38:59 -04:00
qmd_expansion_locations.jsonl lots of training stuff 2026-01-31 23:02:23 +00:00
qmd_expansion_people.jsonl lots of training stuff 2026-01-31 23:02:23 +00:00
qmd_expansion_personal_entities.jsonl finetune: quoted phrases, negation, and entity preservation (#247) 2026-02-22 13:38:59 -04:00
qmd_expansion_short_nontech.jsonl lots of training stuff 2026-01-31 23:02:23 +00:00
qmd_expansion_sports.jsonl data: add 48 sports acronym training examples 2026-02-22 09:37:25 -05:00
qmd_expansion_v3_structured.jsonl finetune: strict Pydantic schema, one canonical data format 2026-02-22 13:39:00 -04:00
qmd_only_sampled.jsonl lots of training stuff 2026-01-31 23:02:23 +00:00