The law in force should be knowable.

LawVM is a deterministic replay compiler for legislation. It reconstructs point-in-time statutory text from amendment streams, making legal state reproducible, inspectable, and auditable — and exposing where official consolidated surfaces diverge from enacted law.

Reported findings

High-confidence meaningful candidate findings reported to Finlex and Riigi Teataja for external review. Broader divergence evidence remains under active classification.

Publisher correction

Riigi Teataja confirmed and corrected a LawVM-reported Estonian consolidation omission in the Auditors Activities Act. Read the case study.

Construction proof

0.65%

Mean text distance against the archived Finlex comparison surface (frozen 2026-04-16). Residuals are classified separately: replay defects, source gaps, editorial artifacts, and candidate publisher issues.

Benchmark snapshot: 2026-04-16. Reported finding count: 2026-04-27. Estonia correction status: 2026-04-27. Corpus: Finnish alpha replay corpus curated from a larger amended-statute universe.

Research software. Not an official legal consolidation or legal advice. Finland frontend active; other frontends experimental.

See Finland evidence Compare jurisdictions Carry it forward Watch the compiler Read the architecture Get started GitHub

amendment act → parse → typed ops → resolve targets → replay → PIT state → compare → classify

Deterministic by design: conventional grammars and symbolic replay.

Why this exists

Legal text is an incrementally maintained system: thousands of amendments applied over decades by distributed authors with partial authority. The law at any moment is the accumulated result.

The usual assumption is that executable law requires formalized drafting or post-hoc rules-as-code. LawVM shows a different path: human-written amendment law is already executable. Compile the amendment stream already published by the legal system.

The immediate purpose is zero-to-one: prove high-coverage replay on a real legal system. Finland is the proving ground because its amendment stream is difficult enough to matter and open enough to audit.

Most legal databases present an editorial consolidation: a human-curated snapshot. LawVM replays the amendment chain from primary sources, preserves what happened, and makes the result auditable.

That means typed operations instead of string patches. Point-in-time materialization from explicit timelines. Findings and source-pathology traces instead of silent cleanup. A way to separate replay defects from editorial artifacts.

LawVM v0.1 is a construction proof and a handoff artifact. It shows that the substrate can be built, that ordinary law is already executable at the text-state layer, and that the remaining work is concrete. Researchers, public institutions, legal publishers, and civic infrastructure teams can start from a working compiler substrate.

Read The Law in Force Should Be Knowable for the civic principle. Read Why Law Is Law-Shaped for the structural analysis.

Carry this forward

LawVM v0.1 is a handoff point. The next stage belongs in real legal information infrastructure.

Finland shows that the construction is technically possible. Durable infrastructure needs maintained jurisdiction frontends, public-sector pilots, legal publisher audit workflows, and research replication.

Useful adoption signals include legal publisher audit workflows, public-sector pilots, official source expertise, and concrete plans to make LawVM part of legal information infrastructure. See the handoff map.

Current state

Finland is the active reference frontend and the main proving ground. It is where the replay model, normalization rules, editorial adjudication, and residual-review workflow are exercised against a real corpus. See the Finland showcase.

Estonia is the second mature lane: strong consistency-checking telemetry and one Riigi Teataja-confirmed correction from a LawVM report. The UK is the third most substantial frontend: a maintained legislation.gov.uk workbench whose 2026-06-03 wide run covered 12,393 rows with 0 errors, 12,089 status-OK rows, and 10 source-unavailable rows. Norway and Sweden remain experimental source-frontier probes. See the jurisdiction status and benchmark caveats. Read the Estonia correction case study.

If the compiler model is still too abstract, start with the interactive explainer: it walks through a synthetic amendment from source text to typed operation to document-tree mutation. Open the replay explainer.

Layer 0

LawVM is deliberately narrow. It computes what the legal text says at a point in time. Interpretation, application, and cost analysis are higher layers that build on a correct text-state substrate.

The design principle: keep the text-state kernel narrow, explicit, stable, and anchor-rich. That is what makes it useful as infrastructure for everything above.

Get started

git clone https://github.com/eliask/lawvm.git
cd lawvm
uv sync

# Import the public Finlex source archives (~5 GB after ingestion)
uv run lawvm import-zip \
  --statute-zip https://www.finlex.fi/api/assets/open-data/archives/statute.zip \
  --consolidated-zip https://www.finlex.fi/api/assets/open-data/archives/statute-consolidated.zip

# Replay, diff, and explain
uv run lawvm replay 2002/738 --as-of 2024-01-01
uv run lawvm diff 2002/738
uv run lawvm explain 2002/738

Full getting-started guide · GitHub repository