Refactor completion suggestions engine to use declarative rules by rolandwalker · Pull Request #1761 · dbcli/mycli

rolandwalker · 2026-03-31T19:09:39Z

Description

Refactor completion suggestions engine to use declarative rules, breaking up suggest_based_on_last_token() into many small functions.

This is not perfect, as some functions such as _emit_lparen() remain large, and others such as suggest_special() are untouched. The rules are still ordered as a list, which could be finicky for future changes. An alternative is including a priority rank in the SuggestRule dataclass.

A risk is that the declarative rules make ample use of lambdas, which might impose a performance penalty.

Motivation: making the rules easier to understand and modify, and making rules easier to migrate to sqlglot, which may be more reliable and performant.

Further work could include turning the return values from the _emit functions into a list of Suggestion instances, instead of using dicts. It could also be nice to migrate the rules in SUGGEST_BASED_ON_LAST_TOKEN_RULES into a separate file completion_rules.py, in which perhaps all rules are not based on the last token.

The PR consists of four commits, which may make it easier to review

tests
the primary refactor to review
tests
make SQL parsing lazy for performance

Checklist

I added this contribution to the changelog.md file.
I added my name to the AUTHORS file (or it's already there).

To lint and format the code, I ran

uv run ruff check && uv run ruff format && uv run mypy --install-types .

github-actions · 2026-03-31T19:18:37Z

No blocking findings on correctness or security in the PR diff (completion_engine refactor appears behavior-preserving, with targeted tests for most branches).
Test gap: add a non-progressing rewind test for a binary operator token (not just comma), so the recursion guard in completion_engine.py is explicitly covered for both paths.
Current coverage hits comma-only cases in test_completion_engine.py and test_completion_engine.py.

Validation note: I couldn’t run the suite in this environment because uv/sqlparse are unavailable (uv: command not found, ModuleNotFoundError: sqlparse).

github-actions · 2026-03-31T19:29:57Z

Codex Review:

Medium: autocomplete performance regression from eager parsing
- In completion_engine.py, suggest_based_on_last_token() now always builds SuggestContext, which always parses SQL via _build_suggest_context() -> _parse_suggestion_statement() (line 103).
- Previously, cheap guards (word starts with digit, quote, inside quotes) returned early before parsing. That optimization is now gone, so parsing runs even in cases that immediately return [].
- Impact: higher per-keystroke latency in common typing paths (especially quoted strings), i.e. a user-visible regression in completion responsiveness.
- Action: restore fast-path short-circuiting before parse, or make parse/tokens lazy so rules that don’t need parse data don’t trigger it.
Missing regression test for the above fast-path behavior
- Current tests validate output correctness but not that parse is skipped for guard cases.
- Action: add a test that monkeypatches _parse_suggestion_statement and asserts it is not called when word_before_cursor starts with digit/quote or cursor is inside '/" strings.

No security issues stood out in this PR.

I couldn’t run tests in this environment because uv/pytest are unavailable here.

github-actions · 2026-03-31T19:53:16Z

Codex Review

No blocking findings in the PR diff (eb1dd47c...abdf19b) for correctness or security.

I checked the refactor in completion_engine.py against the prior branch logic and didn’t find behavior regressions in rule ordering or fallback semantics; the declarative rules preserve the old branches.

Residual risk / test gap:

I could not execute the test suite in this environment because uv/pytest are unavailable (command not found), so this review is static-analysis only. Running uv run -- pytest test/pytests/test_completion_engine.py in CI/local is still important for confidence.

in preparation for a refactor

breaking up suggest_based_on_last_token() into many small functions. This is not perfect, as some functions such as _emit_lparen() remain large, and others such as suggest_special() are untouched. The rules are still ordered as a list, which could be finicky for future changes. An alternative is including a priority rank in the SuggestRule dataclass. A risk is that the declarative rules make ample use of lambdas, which might impose a performance penalty. Motivation: making the rules easier to understand and modify, and making rules easier to migrate to sqlglot, which may be more reliable and performant. Further work could include turning the return values from the _emit functions into a list of Suggestion instances, instead of using dicts. It could also be nice to migrate the rules in SUGGEST_BASED_ON_LAST_TOKEN_RULES into a separate file completion_rules.py, in which perhaps all rules are not based on the last token.

adding some todo comments in completion_engine.py, related to tests which are xfailed here.

so that guards such as "is inside string" can run without full parsing, for performance.

rolandwalker self-assigned this Mar 31, 2026

rolandwalker force-pushed the RW/add-completion-engine-tests branch from 132f539 to 4427504 Compare March 31, 2026 19:10

rolandwalker added the codex request Codex review for an updated PR label Mar 31, 2026

rolandwalker force-pushed the RW/add-completion-engine-tests branch from 4427504 to 9accb87 Compare March 31, 2026 19:27

rolandwalker added codex request Codex review for an updated PR and removed codex request Codex review for an updated PR labels Mar 31, 2026

rolandwalker force-pushed the RW/add-completion-engine-tests branch from 9accb87 to abdf19b Compare March 31, 2026 19:47

rolandwalker added codex request Codex review for an updated PR and removed codex request Codex review for an updated PR labels Mar 31, 2026

rolandwalker requested review from amjith and scottnemes March 31, 2026 20:27

rolandwalker removed the codex request Codex review for an updated PR label Mar 31, 2026

rolandwalker added 4 commits April 1, 2026 07:35

add completion_engine regression tests

1c2a5fa

in preparation for a refactor

add tests for declarative suggestion functions

ca472ba

adding some todo comments in completion_engine.py, related to tests which are xfailed here.

make completion-engine SQL parsing lazy

bd6183c

so that guards such as "is inside string" can run without full parsing, for performance.

rolandwalker force-pushed the RW/add-completion-engine-tests branch from 823ea55 to bd6183c Compare April 1, 2026 11:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor completion suggestions engine to use declarative rules#1761

Refactor completion suggestions engine to use declarative rules#1761
rolandwalker wants to merge 4 commits intomainfrom
RW/add-completion-engine-tests

rolandwalker commented Mar 31, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 31, 2026

Uh oh!

github-actions bot commented Mar 31, 2026 •

edited by rolandwalker

Loading

Uh oh!

github-actions bot commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rolandwalker commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Uh oh!

github-actions bot commented Mar 31, 2026

Uh oh!

github-actions bot commented Mar 31, 2026 • edited by rolandwalker Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codex Review:

Uh oh!

github-actions bot commented Mar 31, 2026

Codex Review

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

rolandwalker commented Mar 31, 2026 •

edited

Loading

github-actions bot commented Mar 31, 2026 •

edited by rolandwalker

Loading