VSAVM

Question compilation pipeline

This page is a theory note. It expands the topic in short chapters and defines terminology without duplicating the formal specification documents.

The diagram has a transparent background and is intended to be read together with the caption and the sections below.

Related wiki pages: VM, event stream, VSA, bounded closure, consistency contract, query compiler, schema.

Related specs: DS003.

Overview

A question is treated as a request to produce an executable query program. The pipeline is explicit to support audit and control: normalization creates a structured span, retrieval proposes candidate schemas, slot filling binds discrete values, and compilation emits a program in the VM instruction set. This transformation operates through a learned pipeline rather than hand-coded rules.

Natural language to query compilation

The compilation process follows explicit stages:

Slot filling and program instantiation

Slot filling binds entities, roles, and references using discrete matching and coreference heuristics, augmented by associative retrieval:

Complex queries may require multiple schemas combined or nested. The composition system maintains explicit data flow graphs tracking information movement through composite reasoning.

Program instantiation translates filled schemas into executable VM instruction sequences, including optimization steps: common subexpression identification, redundant operation elimination, and operation reordering for cache locality.

Program search and selection

The search process explores the space of possible reasoning strategies:

Schema learning and consolidation

The schema learning process discovers recurring patterns in query-program relationships:

Multimodal query processing

Queries spanning multiple input modalities require sophisticated coordination:

Managing ambiguity

Instead of forcing a single interpretation, VSAVM carries multiple candidate programs in a beam. Candidates are evaluated by explanatory fit and by early closure checks that detect contradictions. This makes uncertainty explicit and supports conditional outputs when necessary.

Engineering implications

Because compilation is explicit, it is testable. You can measure how often a schema is retrieved, how often slot filling is ambiguous, and how often a candidate fails under closure. These metrics can guide consolidation and improve robustness over time.

question-compilation diagram
Questions are compiled into executable programs through explicit stages, with ambiguity managed by beam evaluation and consistency checks. Schema learning and multimodal coordination extend the core pipeline.

References

Program synthesis (Wikipedia) Beam search (Wikipedia) Information retrieval (Wikipedia) Minimum description length (Wikipedia)