You may read my paper and cite like this:
WANG, Z. (2025, May 12). Scheme-langserver: Treat Scheme Code Editing as the First-Class Concern. The 18th European Lisp Symposium (ELS`25), Zurich. https://doi.org/10.5281/zenodo.15384882
Due to occasional GitHub access restrictions from China, this repository is also mirrored on Codeberg and Gitee. I collaborate with XmacsLabs; a fork is available here.
default.mp4
VSCode is now supported! See the setup guide.
Note: Auto-generated type information is available here. It is mainly used for downstream development and debugging.
Implementing IDE features like autocomplete, goto definition, and hover documentation is a significant effort. Compared to languages like Java, Python, JavaScript, or C, language server implementations for Lisp dialects are still scarce. Existing tools such as Geiser, racket langserver, and swish-lint rely primarily on a REPL or keyword tokenization rather than static program analysis.
For example, when editing an incomplete project whose code is not yet fully executable, Geiser can only complete top-level bindings listed by environment-symbols (on Chez Scheme) or raw symbols—not true identifiers. This means local bindings and unfinished code receive no help in recognizing valid identifier scopes. The same limitation applies to goto definition and other core IDE features.
The root cause is that Scheme and other Lisp dialects present a formidable challenge for program analysis: their rich data structures, flexible control flow, and especially macros make static reasoning difficult. But this does not mean Scheme is only for geniuses and meta-programming. With a better editing environment, Scheme can be accessible and productive for everyone.
scheme-langserver is a Language Server Protocol (LSP) implementation for Scheme that provides completion, goto definition, hover, and type inference through static code analysis based on the R6RS standard. It handles incomplete code gracefully and is published via Akku, a Scheme package manager.
The server has been tested on Chez Scheme 9.4, 9.5, and 10.x.
See the setup guide.
scheme-langserver can persist the analyzed workspace state to a Chez FASL cache so
that subsequent restarts skip the expensive init-references phase. Enable it
with the --cache-path option:
./run --cache-path ~/.cache/scheme-langserverThe cache file is written to <cache-path>/workspace.fasl. It is saved when the
server receives a normal LSP exit or shutdown request; if the process
crashes or is killed, the previously saved cache remains usable. The cache is
keyed by a manifest that includes the langserver version, Chez version, machine
type, and record-layout fingerprint; any mismatch falls back to a cold start.
Because Chez FASL is platform-specific, do not share the cache file across
machines or Chez versions. When only a few files have changed since the cache
was saved, the server performs an incremental refresh and preserves the analysis
results for unchanged files.
Typical speedups:
| Fixture | Cold startup | Cached startup | Speedup |
|---|---|---|---|
| simple-lib | ~31 ms | ~1.3 ms | ~24x |
| Synthetic 200-file fixture | ~2484 ms | ~49 ms | ~50x |
scheme-langserver itself (128 .sls files) |
~55,790 ms | ~1750 ms | ~32x |
| scheme-langserver, one file changed | ~58,846 ms | ~1900 ms | ~31x |
See doc/analysis/workspace.md §8 and AGENTS.md §11 for implementation details.
For troubleshooting tips, see debugging.md.
Active development is focused on bug fixes, performance profiling, and expanding the type inference system. The 2.1.3 release adds a Chez FASL workspace cache that skips the expensive init-references phase on restart. The 2.1.2 release restores bracket-mismatch diagnostics in the fault-tolerant tokenizer. The 2.1.1 release fixes several crashes and diagnostics issues, and adds R7RS/S7 tokenizer compatibility. The 2.1.0 release brings major improvements to diagnostics, macro auto-resolution, and LSP protocol robustness. Planned features include a dedicated VSCode plugin and data-flow analysis.
2.1.3 — Feature release adding a Chez FASL workspace cache with incremental refresh.
- Workspace cache: New
--cache-path <dir>CLI option persists the analyzed workspace state to a Chez FASL file. On restart, unchanged files skip the expensiveinit-referencesphase; changed/added/deleted files are refreshed incrementally. Typical speedups range from ~20x on small fixtures to ~30–50x on larger projects.
Older releases are documented in doc/release-history.md.
- Tokenizer:
- Restore clear diagnostics for unmatched parentheses and brackets (e.g.
unclosed parenthesis,unexpected close bracket) during fault-tolerant parsing, while keeping the R7RS/S7 compatibility fixes from 2.1.1.
- Restore clear diagnostics for unmatched parentheses and brackets (e.g.
- Completion for top-level and local identifier bindings.

- Goto definition.

- Compatible with package manager: Akku.
- File-change synchronization with corresponding index updates.
- Hover.
- References and document highlights (document-scoped).

- Document symbol.

- Workspace symbol search (
workspace/symbol). - Catching local identifier bindings in
define-syntax,let-syntax, and other macro forms via hand-written rules. - Automatic macro resolution (experimental). The generic expander for
syntax-rules,syntax-case,let-syntax, andletrec-syntax—plus multi-layer macro cascade propagation—is functionally correct but not enabled in production because it is too slow for real-world projects (it triggers heavy macro expansion and cross-document reference back-propagation for every macro use site). The routing code inanalysis/identifier/self-defined-rules/router.slscurrently falls back to hand-written rules such asmatch-processforufo-match. If you are interested in pushing this research forward—e.g. via lazy expansion, incremental caching, or selective rule generation—contributions and discussions are very welcome! - Cross-platform parallel indexing.
- Custom source-code annotator compatible with
.spsfiles. - Peephole optimization for API requests using suspendable tasks.
- Type inference via a homemade DSL interpreter, now integrated into auto-completion. Parameters whose types match the expected signature are ranked higher, as shown below where
length-aandlength-b(bothinteger?) appear first because they match the parameter type required by<=.
- Supports R6RS, R7RS, and S7 by switching top environments.
send-message
2023 11 21 11 26 41 967266866
{"jsonrpc":"2.0","id":"3","result":[{"label":"length-a"},{"label":"length-b"},{"label":"lambda"},{"label":"latin-1-codec"},{"label":"lcm"},{"label":"least-fixnum"},{"label":"length"},{"label":"let"},{"label":"let*"},{"label":"let*-values"},{"label":"let-syntax"},{"label":"let-values"},{"label":"letrec"},{"label":"letrec*"},{"label":"letrec-syntax"},{"label":"lexical-violation?"},{"label":"list"},{"label":"list->string"},{"label":"list->vector"},{"label":"list-ref"},{"label":"list-sort"},{"label":"list-tail"},{"label":"list?"},{"label":"log"},{"label":"lookahead-char"},{"label":"lookahead-u8"}]}- Abstract interpreter that resolves identifiers across multiple file extensions:
.scm,.ss,.sps,.sls,.sld. - Code diagnostics with LSP-standard
sourceandcodefields. Detects library-not-found, duplicate identifiers in binding forms (e.g.(lambda (x x) ...)), unused imports (e.g.(only (rnrs) car)wherecaris never referenced), and tokenizer syntax errors.
- Renaming support (
textDocument/rename+prepareRename). - Formatting (
textDocument/formatting). - Signature help (
textDocument/signatureHelp). - Code actions (
textDocument/codeAction) — e.g. "Remove unused import", "Organize imports". - Full R6RS compatibility.
- Step-by-step macro expander for self-defined macros.
- Code evaluation within the language server.
- Cross-language semantic support via AST transformers.
- Extract expression/statement into a procedure (refactoring).
Pull requests are welcome! Please see AGENTS.md for project conventions, build steps, and coding style before opening a PR.
Since mid-2025, active development on this project has been assisted by KIMI (Moonshot AI) in a vibe-coding workflow: the maintainer describes intent in natural language, KIMI explores the codebase, proposes changes, and iterates with tests. If you notice commits authored or co-authored by kimi, that is the AI agent trail. Human review and final approval always remain with the maintainer.
Almost all key procedures and APIs are covered by tests. Run the full suite with:
bash test.shFor faster feedback during development, run a single test file:
source .akku/bin/activate
scheme --script tests/protocol/apis/test-definition.spsNote: Tests currently focus on single-threaded execution.
Script-Fu is based on Scheme. Using this example, you can apply scheme-langserver to .scm files in GIMP.
Possible future targets include OMN (Opusmodus Notation) and AutoLisp.
find . -name "*.sls" ! -path "./.akku/*" |xargs wc -l- Catching identifier bindings — how the abstract interpreter resolves
define,lambda,let,define-record-type, etc. - Macro auto-resolution — generic
syntax-rulesexpansion vs hand-written rules - Type system & inference — complete type-inference pipeline and DSL
- Workspace lifecycle — initialization, incremental updates, and refresh batches
- File dependency graph — topological sorting and linkage matrix
- API request scheduling — request queue, engine time-slicing, cancellation, and document-sync protection
- Diagnostic publication — how diagnostics are generated, accumulated, and sent
- Debugging guide — enable logs, replay logs, and iterative printf debugging
- Development guide (中文) / English version
- AGENTS.md — build steps, testing conventions, coding style, and common traps for contributors
- Scheme-langserver paper (ELS'25) — academic paper on static analysis for Scheme
- Macro resolution notes — debugging notes for macro identifier capture
- Syntax candy DSL — pattern matcher for type-rule authoring
- Record type inference analysis —
define-record-typein the type system - Type inference benchmark — performance measurement methodology
- DeepWiki
