ai-workspace-services/qmd

Author	SHA1	Message	Date
Tobias Lütke	f75c668e46	Merge pull request #310 from giladgd/nodeLlamaCppUseBuildAutoAttempt feat: use `build: "autoAttempt"` on `getLlama`	2026-03-07 14:24:52 -04:00
Tobias Lütke	6934c464db	Merge pull request #311 from gi11es/patch-1 Fix claude plugin setup syntax	2026-03-07 14:24:49 -04:00
Gilad S.	607ab7a402	fix: remove unused config	2026-03-07 07:48:13 +02:00
Tobi Lutke	2ae1baba2f	release: v1.1.1	2026-03-06 14:48:47 -04:00
Tobi Lutke	60721658c0	docs: write changelog for 1.1.1	2026-03-06 14:48:35 -04:00
Gilles Dubuc	7f8e33e0a9	Fix plugin install syntax	2026-03-06 12:14:16 +01:00
Gilles Dubuc	75589d77f3	Fix claude marketplace syntax	2026-03-06 12:12:14 +01:00
Gilad S.	3095041e0f	feat: use `build: "autoAttempt"` on `getLlama`	2026-03-06 07:02:50 +00:00
Tobias Lütke	40610c3aa6	Merge pull request #256 from rkbadhan/reward-design fix(reward): tighten entity detection, filler penalty, stricter diversity	2026-02-26 06:15:49 -05:00
rkbadhan	4511b9bd4d	fix(reward): tighten entity detection, add filler penalty, stricter diversity - Compound entity chaining now stops one level deep. Previously "TDS motorsports team history" would inflate the expected entity set with "team" and "history", causing false-positive entity-preservation penalties during GRPO. Now only {tds, motorsports} are detected. - Add INTERIOR_FILLER_WORDS penalty (-3/line): lex lines containing "overview" or "basics" absent from the original query are penalised. Targets template-generator noise, e.g. "ancient overview rome timeline". - Raise is_diverse threshold 2→3: requires 3 unique words between lex lines before they count as diverse. Reduces reward for near-duplicate pairs like "auth setup" / "auth configuration". - Broaden quoted-phrase bonus: was gated on named entities existing; now any multi-word query earns +3 for using quotes in lex lines. Better incentivises BM25-aware syntax like "memory leak" python. Fixes scoring noise identified while working on issue #247.	2026-02-24 19:46:23 +05:30
Tobi Lütke	d6f3688d91	Remove grpo command from default train entrypoint	2026-02-22 15:29:09 -05:00
Tobi Lütke	189916d6fb	Move GRPO training out of default finetune pipeline	2026-02-22 15:26:23 -05:00
Tobi Lütke	cbeeb1f89b	Add wall-clock checkpoints and full eval defaults	2026-02-22 15:02:02 -05:00
Tobi Lütke	5233e676d9	fix(rerank): truncate documents exceeding 2048-token context size node-llama-cpp throws a hard error when any document + query + template overhead exceeds the ranking context size. Truncate oversized documents using the rerank model's tokenizer before passing them to rankAll(). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-22 12:41:59 -05:00
Tobi Lutke	1d7d167b29	finetune: strict Pydantic schema, one canonical data format Replace ad-hoc JSON parsing with a strict Pydantic model (TrainingExample with typed OutputPair). All data loading goes through load_examples() which fails loudly on invalid data. - Convert v3_structured.jsonl from "searches" to "output" format - Rewrite all consumer scripts (prepare, validate, score, analyze) to load through the Pydantic schema - Prepared train/val files are ephemeral build artifacts - Restore LFM2 and GEPA experiments under experiments/ - Add pydantic>=2.0 to dependencies	2026-02-22 13:39:00 -04:00
Tobi Lutke	3950055708	finetune: quoted phrases, negation, and entity preservation (#247 ) Training data: - Expand lex phrases/negation examples from 12 to 74 with intent field - Add 50 personal entity examples (meetings, emails, projects with names) Reward function: - Detect entities at position 0 (fixes "Bob asked about deploy") - Per-entity coverage penalty: -20 per entity absent from all lex+vec - Phrase quoting bonus: +3 when lex uses quotes for multi-word terms - Expanded stopwords to reduce false positive entity detection Eval queries: add 21 test queries for personal entities, quoted phrases, and negation/disambiguation scenarios.	2026-02-22 13:38:59 -04:00
Tobi Lutke	599935754b	finetune: remove orphaned files and abandoned experiments Remove one-off data generator/fix scripts, superseded data files (v2, v3 replaced by v3_structured), LFM2 experiment, GEPA directory, duplicate job scripts, and historical docs. Clean up Justfile. These are restored under experiments/ in a later commit.	2026-02-22 13:38:59 -04:00
Tobi Lutke	64ef25e1f6	Document query grammar and add skill helpers	2026-02-22 13:36:08 -04:00
Tobi Lutke	0e0feb6f2b	release: v1.1.0	2026-02-22 11:09:36 -04:00
Tobi Lutke	60564b34f8	chore: gitignore package-lock.json	2026-02-22 11:09:36 -04:00
Tobi Lutke	1765b6870d	docs: write changelog for 1.1.0 Query document format, lex phrase/negation syntax, standard node shebang, collection management commands, and formal SYNTAX.md spec.	2026-02-22 11:09:36 -04:00
Tobi Lutke	c7e8ea02a5	test: restructure container smoke tests for interactive use Replaces the inner test script with an outer driver that runs individual podman/docker commands against a pre-built image. Tests sqlite-vec loading and store unit tests under both node and bun runtimes. Supports --build (image only), --shell (interactive), and -- CMD (arbitrary command) for debugging install issues in isolation.	2026-02-22 11:09:36 -04:00
Tobi Lutke	0b57711d32	refactor: replace bash wrapper with standard #!/usr/bin/env node shebang The qmd bin was a custom bash script that discovered node via hardcoded fallback paths (mise, asdf, nvm, homebrew). This was nonstandard and caused ABI mismatches when installed via bun (native modules compiled for bun but executed with node). Now uses the standard npm bin convention: dist/qmd.js with a node shebang, added by the build script. The isMain guard resolves symlinks so it works when npm/bun create symlinked bin entries. Also converts all dynamic require() calls in tests to ESM imports, and adds container-based smoke tests (test/smoke-install.sh) that verify install + run under both node and bun via mise in a Debian container.	2026-02-22 11:09:36 -04:00
Tobias Lütke	a815b52ede	Merge pull request #214 from pcasaretto/fix/nix-build-deps	2026-02-22 10:42:57 -04:00
Tobi Lütke	8d73eda4de	data: add 48 sports acronym training examples Covers UFC, NFL, NBA, NHL, MLB, F1, MLS, IMSA, WEC, NASCAR, PGA, ATP, WTA, FIFA. Fixes query expansion failures like UFC → 'united fighting club'.	2026-02-22 09:37:25 -05:00
Tobi Lütke	6ac7c6837e	chore: release 1.0.8	2026-02-19 06:56:17 -05:00
Tobi Lütke	3b87e3e224	feat: query document format, lex phrase/negation syntax, training data The 'query document' is now a first-class concept in QMD: a structured document with typed sub-queries that combine for best recall. ## Query types - lex: BM25 keyword search with phrase and negation syntax - vec: Semantic vector search (natural language questions) - hyde: Hypothetical document (write the expected answer) - expand: Auto-expand via local LLM (max 1, default for plain queries) ## Lex syntax Full BM25 operator support: "exact phrase" verbatim match, no prefix -term exclude documents containing term -"exact phrase" exclude documents containing phrase Examples: "C++ performance" optimization -sports -athlete "connection pool" timeout -redis "machine learning" -sports -athlete ## MCP tool description rewritten The 'query' tool description now fully teaches AI agents the query document format, lex syntax, and strategy for combining types. Includes worked examples including intent-aware lex (C++ performance, not sports) which is critical for disambiguation in dense corpora. ## Unit tests 11 new lex parser tests covering: - plain terms, quoted phrases, negation, combined - intent-aware disambiguation (performance -sports -athlete) - only-negation returns null (FTS5 constraint) - empty/whitespace handling ## Training data 12 new intent-aware examples for next model training round: - Real technical topics with lex phrase+negation combinations - Covers: C++ perf, Python memory, DB connections, rate limiting, SQL optimization, ML overfitting, Docker, JWT, async/await, git conflicts, Kubernetes, React state - Each shows how context/intent shapes lex query construction (e.g. performance with C++ context → -sports -athlete exclusions)	2026-02-19 06:52:58 -05:00
Tobi Lütke	2cb7b59152	fix(cli): print timing inline after each step, not on separate line	2026-02-19 06:11:21 -05:00
Tobi Lütke	24ffa4876b	feat(cli): show timing for expand, embed, and rerank steps Each LLM invocation now prints elapsed time: Expanding query... (4.2s) → searching 6 queries Embedding 4 queries... (605ms) Reranking 40 chunks... (3.2s)	2026-02-19 06:08:02 -05:00
Tobi Lütke	a81f013126	fix(status): improve tips with subfolder context example and better update cmd	2026-02-19 06:01:53 -05:00
Tobi Lütke	bdf0c205a9	feat(status): add tips for missing context and update commands	2026-02-19 05:54:18 -05:00
Tobi Lütke	262c7d351c	refactor: remove context check command	2026-02-19 05:53:13 -05:00
Tobi Lütke	2b94e097d2	feat(cli): add collection management commands New collection subcommands: - show <name> Show collection details - update-cmd <name> [cmd] Set pre-update command (runs before indexing) - include <name> Include in default queries (default) - exclude <name> Exclude from default queries Collections with includeByDefault=false are skipped unless explicitly named with -c flag. CLI improvements: - 'qmd collection' shows help instead of error - 'qmd collection list' shows [excluded] tag - Better command descriptions and examples	2026-02-19 05:46:07 -05:00
Tobi Lütke	07421de55f	feat(http): keep /search as alias for /query	2026-02-19 05:30:27 -05:00
Tobi Lütke	4649069e62	feat: add expand: type, rename to query, document syntax BREAKING CHANGES: - MCP tool renamed: structured_search → query - HTTP endpoint renamed: /search → /query New features: - expand: type auto-expands via local LLM (max 1 per query) - docs/SYNTAX.md formal grammar for query documents - lex syntax: "phrase", -negation documented Query types: lex, vec, hyde, expand Default (no prefix) = expand (backwards compatible)	2026-02-18 22:22:50 -05:00
Tobi Lütke	de3a83a553	refactor: remove OR operator from lex queries Simplify to just: terms, "phrases", and -negation	2026-02-18 22:17:52 -05:00
Tobi Lütke	77e4d8f378	refactor: remove single collection param, use collections array only BREAKING: collection param removed from structured_search. Use collections: ['name'] for single collection filter.	2026-02-18 22:16:15 -05:00
Tobi Lütke	efb39616e6	feat(lex): add query syntax for exact phrases, negation, and OR Lex queries now support: - "exact phrase" - quoted exact matching (no prefix) - -term or -"phrase" - exclude from results - term1 OR term2 - match either term Semantic queries (vec/hyde) validate and reject these operators with helpful error messages. Examples: performance -sports → matches "performance" excluding "sports" "machine learning" → exact phrase match auth OR authentication → matches either term	2026-02-18 22:14:09 -05:00
Tobi Lütke	d1ec31eab8	feat: add collections array filter + improve query writing docs - structured_search now accepts collections[] for OR filtering - Updated skill docs with detailed query writing guidance - lex: 2-5 keywords, include synonyms, exact names - vec: full natural language questions with context - hyde: 50-100 word hypothetical answer passages	2026-02-18 22:09:24 -05:00
Tobi Lütke	6d6bdff09c	docs: simplify skill documentation	2026-02-18 22:00:24 -05:00
Tobi Lütke	19284ddb80	refactor(mcp): remove deprecated search tools, keep only structured_search BREAKING CHANGE: MCP tools search, vector_search, deep_search removed. Use structured_search with lex/vec/hyde queries instead. - Remove search, vector_search, deep_search MCP tool registrations - Update MCP instructions to focus on structured_search - Update skill docs to reflect simplified API - Rename test describes to reflect they test store functions - CLI commands (qmd search, vsearch, query) unchanged for backwards compat	2026-02-18 21:50:25 -05:00
Tobi Lütke	db44e1a5bc	test: add comprehensive tests for structured search 32 tests covering: - parseStructuredQuery parser (24 tests) - plain queries returning null - single/multiple prefixed queries - mixed plain + prefixed lines - error on multiple plain lines - whitespace handling - edge cases (colons in text, etc.) - StructuredSubSearch type validation (3 tests) - structuredSearch function basics (5 tests) - empty searches - no matches - limit/minScore options	2026-02-18 21:39:40 -05:00
Tobias Lütke	222dde153d	Merge pull request #220 from tobi/feat/structured-search feat: add structured_search for LLM-provided query expansions	2026-02-18 22:36:20 -04:00
Tobi Lütke	8ebe5ab31b	feat(cli): add structured query syntax to qmd query Lines prefixed with lex:, vec:, or hyde: route directly to structured search, skipping automatic query expansion. Examples: qmd query 'lex: CAP theorem' qmd query $'lex: keywords\nvec: natural language question' qmd query $'lex: terms\nvec: question\nhyde: hypothetical answer...' Plain queries (single line, no prefix) still use automatic expansion. Multiple plain lines without prefixes error with helpful message. This lets CLI users leverage the same structured search as MCP, useful when piping from scripts or when you know exactly what query variations you want.	2026-02-18 19:58:41 -05:00
Tobi Lütke	bdec84a3e9	docs: use npm package name, clarify vec/hyde both use vector similarity	2026-02-18 19:34:08 -05:00
Tobi Lütke	0201710c2b	feat: add structured_search for LLM-provided query expansions - New MCP tool: structured_search - lets capable LLMs provide their own lex/vec/hyde query variations instead of using local expansion model - New REST endpoint: POST /search - same functionality without MCP protocol - Updated skill docs to prioritize structured_search for LLM callers - Added installation instructions for Claude Code, Desktop, and OpenClaw Pipeline: lex→FTS, vec/hyde→batch embed, RRF fusion (first query 2x weight), chunk + rerank, position-aware blending, dedup. This is the recommended endpoint for capable LLMs - they generate better query variations than the small local model, especially for domain-specific or nuanced queries.	2026-02-18 19:05:49 -05:00
Tobi Lutke	648779a04d	fix(test): reset currentIndexName between test files collections-config.test.ts set currentIndexName to "myindex" in its last test but only restored env vars in afterEach — not the module variable. Under bun test (single process), this leaked into mcp.test.ts, causing it to look for myindex.yml instead of index.yml. Fix: reset setConfigIndexName("index") in afterEach, and add defensive reset in mcp.test.ts beforeAll.	2026-02-18 15:53:58 -04:00
Tobi Lutke	0697bc59d6	release: v1.0.7	2026-02-18 15:36:54 -04:00
Tobi Lutke	8cddbcb247	changelog: write 1.0.7 unreleased notes LFM2 query expansion model, multiple collection filters, XDG_CONFIG_HOME support, and JSON output fixes.	2026-02-18 15:36:31 -04:00
Paulo Casaretto	1033fffca6	fix: add python3 and cctools to nix build dependencies node-gyp needs python3 to compile better-sqlite3, and on macOS it also needs libtool from cctools to create static libraries. Without these, `nix build` fails in the sandbox.	2026-02-18 10:22:32 -03:00

1 2 3 4 5 ...

288 Commits