OriGen Architecture — Top-Level Overview¶
Purpose of This Document¶
This document defines the conceptual foundation of OriGen:
- What the system is and is not
- Core architectural philosophy
- The high-level model that drives all design decisions
- The separation of planning vs execution
- Why determinism is a first-class constraint
- The metaphors that guide OriGen’s mental model
- Architectural invariants that should not drift
This is the root of the architecture series and should remain mostly stable after finalization.
1. What OriGen Is¶
OriGen is a deterministic workflow compiler.
OriGen does not execute tools directly. It:
- loads Maps
- loads Navigators
- resolves dependencies
- normalizes the workflow
- selects Backpacks
- validates structure
- produces a deterministic Route (IR)
- Guide translates Route into native platform formats
Execution is delegated to interchangeable backend drivers.
In summary:
OriGen decides what should happen, not how it runs.
This keeps the system platform-agnostic, reproducible, and maintainable.
2. The Compass, Maps, Navigators, Backpacks, Routes, Guides¶
OriGen uses a consistent metaphor to describe its components.
Compass (Core Planner)¶
- Loads Maps
- Loads Navigators
- Resolves Backpacks
- Compiles everything into a Route The Compass is the heart of OriGen’s planning model.
Maps (Declarative Workflows)¶
YAML workflows describing intent, not logic:
- steps
- tools
- modes
- inputs/outputs
- order of execution
Maps contain no imperative code.
Navigators (Tool Definitions)¶
Navigator files define:
- tool containers (by OCI digest, not tag)
- entrypoints
- argument templates
- modes
- backpack requirements
They isolate tooling changes from workflows.
Backpacks¶
Backpacks are immutable, versioned, read-only resource bundles carried into execution.
- typesetting engines (LaTeX, typst, HTML→PDF engines, etc.)
- ICC profiles
- font packs
- metadata processors
- templates
- validators
- any future deterministic resource
Routes¶
Intermediate Representation (IR): the deterministic, platform-neutral execution graph OriGen compiles from a Map. A Route defines how a workflow should run, independent of any backend system.
Guides¶
Translates a Route into native backend execution formats (e.g., GitHub Actions, GitLab CI, Argo, Kubernetes, or local scripts). Guides perform translation only — OriGen does not run workflows.
3. Why This Architecture Exists¶
Modern workflows — across software, data, research, infrastructure, and automation — are built from heterogeneous, independently evolving components:
- containerized tools and CLIs
- domain-specific engines (compilers, renderers, transformers)
- data processors and validators
- environment managers and package ecosystems
- runtime platforms (CI systems, orchestrators, schedulers)
- OS and container environments
- internal scripts and glue code
Teams combine these pieces in different ways. Defaults drift. Tools silently upgrade. Environment assumptions accumulate. And because each component changes at its own pace, the workflow itself becomes unstable.
The result is the same everywhere:
identical inputs frequently fail to produce identical outcomes.
What Knuth solved for typesetting — the same input produces the same output — generalizes to far more than documents. In any domain, stability requires an architecture where intent is explicit, toolchains are frozen, and the plan cannot mutate downstream.
OriGen extends that principle from a single tool to entire multi-step pipelines, no matter what they contain.
OriGen is explicitly not tied to containers, CI systems, or cloud platforms. It applies equally well to workflows that run in VMs, on bare metal, inside embedded devices, across HPC clusters, or within domain-specific appliances. Whether a toolchain lives in a container, a runtime VM, an FPGA build environment, a LaTeX engine, a compiler suite, or an ML training stack, OriGen can model it deterministically — as long as its behavior can be declared through Maps, Navigators, and Backpacks.
It can describe deterministic workflows built from:
- containerized tools and VM-hosted tools
- embedded, on-device, or appliance-bound execution environments
- data and ML pipelines
- infrastructure and platform automation
- DevOps build/scan/publish chains
- document, publishing, and rendering engines
- scientific and research reproducibility environments
- firmware, simulation, and hardware-adjacent toolchains
- any collection of tools declared via Maps + Navigators + Backpacks
OriGen is about workflow determinism, not tool or platform choice. It provides a stable planning layer regardless of the technologies involved.
4. Planning vs Execution (Foundational Separation)¶
OriGen performs planning. Backend drivers perform execution.
OriGen does not:
- run containers
- manage schedulers
- handle resource allocation
- replace CI/CD systems
- embed a runtime for tools
Instead:
- OriGen outputs a deterministic IR
- A backend driver interprets that IR and performs execution
- Local development and one-click desktop modes use built-in backend drivers
- CI/CD, serverless, and enterprise environments implement their own backend drivers externally
Backend drivers are peers, not a hierarchy.
5. Execution Flow (High-Level)¶
OriGen produces a deterministic IR. Execution is delegated to a backend driver chosen by the user or environment.
All backend drivers are peers. No backend derives from any other.
About the Route. The Route is a structured, machine-readable execution plan describing steps, dependencies, required tools, and deterministic parameters. It is versioned, schema-validated, and serializable. The IR does not encode execution strategy — only intent and structure — ensuring stability across platforms and backends.

Key points¶
- The IR is universal across all execution environments.
- Backend drivers are interchangeable adapters that accept IR.
- Local container and VM drivers are included for development and desktop use; all others are external.
- OriGen stays independent of container engines, orchestration systems, and toolchains.
This architecture ensures OriGen is reproducible, portable, and flexible.
6. Backend Landscape¶
OriGen ships with built-in backend drivers for:
Local Container Backend (Developer Mode)¶
Uses whatever OCI-compatible container engine the host provides.
Local Runtime VM Backend (One-Click Mode)¶
A sealed VM appliance containing a minimal container engine and backpacks.
External Backend Drivers¶
Implemented by CI/CD systems, enterprises, or cloud environments:
- CI/CD
- serverless execution
- hardened enterprise runtimes
- future custom backends
These exist outside OriGen Core and consume IR directly.
7. Determinism¶
Determinism is a first-class architectural constraint.
Determinism applies to both successful execution and failure states: identical inputs must produce identical outputs and identical errors, enabling reproducible debugging and CI behavior.
OriGen enforces:
- identical outputs for identical inputs
- identical failure states for identical inputs
- immutable sidecars
- pinned tool containers (OCI digests)
- controlled mount and directory structure
- stable IR schema
- execution isolation from the host system
If a design conflicts with determinism, the design is rejected.
8. Non-Goals¶
OriGen does not attempt to:
- be a container engine
- serve as an orchestrator
- replace CI/CD systems
- mandate a publishing toolchain
- enforce LaTeX-only workflows
- become a monolithic “all-in-one” tool
OriGen focuses on:
- planning
- determinism
- stability
- clarity
- portability
- reproducibility
Execution is always delegated.
9. Stability and Invariants¶
This document and 01-contracts.md define architectural principles that must remain stable:
- planning/execution separation
- backend-agnostic
- toolchain neutrality
- immutability of sidecars
- determinism as a contract
- IR as the canonical output of planning
Future components must conform to these invariants.
10. Reading Order¶
Continue with:
- 01-contracts.md — foundational invariants and system guarantees
- 02-execution-model.md — the system lifecycle
- 03-ir-specification.md — the core data structure
…and proceed numerically.