Alye — Blind Tester Onboarding
This is the blind path for an outside tester. No prior context about OmegA is assumed.
Fastest Path: Pinned Container
If Docker is available, this is the most reproducible way to run the blind verification path:
git clone https://github.com/Mega-Therion/OmegA-Architecture.git && cd OmegA-Architecture
docker build -t omega-proof-env .
docker run --rm -it omega-proof-env
The container pins Python, Node, Rust, Lean4, ripgrep, and the proof-test dependencies, then runs bash scripts/alye_verification.sh by default.
Prerequisites
Install these before you start:
| Tool | Version | Why |
|---|---|---|
| Python | 3.12+ | All test suites |
| Git | any | Clone the repo |
ripgrep (rg) | any | Required by verify.sh |
| Ollama (optional) | any | Live integration tests only |
Install ripgrep: sudo apt install ripgrep (Debian/Ubuntu) or brew install ripgrep (macOS).
One-Command Setup and Full Verification
git clone https://github.com/Mega-Therion/OmegA-Architecture.git && cd OmegA-Architecture
bash scripts/alye_verification.sh
This single script runs ALL verification steps and prints a structured PASS/FAIL report. It covers: master eval, knowledge graph, proof audit, claim audit, property tests (36), correspondence tests (60), state machines (4), fuzz harnesses (75), Lean4 proofs (if installed), and the full release gate.
No pip install is required on the host blind path. The container path installs the proof-test dependencies internally.
Canonical Runtime Release Check
For local-plus-deployed validation, use:
bash scripts/runtime_release_check.sh --base https://your-deployment.example
This runs the local repo gate first and then the deployed route smoke against the supplied base URL.
Manual Test Sequence (alternative)
If you prefer to run steps individually:
Step 1 — Master eval (the core blind gate)
python3 omegactl.py eval
What to expect:
OMEGA_SPEC_AUDITOR: PASSAEGIS_IDENTITY_ENFORCEMENT: PASSOMEGA_CONFORMANCE_SUITE: PASSOMEGA_CROSS_SESSION_IDENTITY: PASSOMEGA_MEMORY_UTILITY_GROWTH: PASS
Exit code 0.
Step 2 — Knowledge graph check
python3 omega_kg_explorer.py --list-nodes > /dev/null && echo "KG OK"
What to expect: KG OK printed. Exit code 0.
Step 3 — Live integration (optional if Ollama is available)
python3 evals/test_live_ollama.py --model llama3.2:3b
What to expect: 14/15 PASS. The one known failure is documented.
Step 4 — Full release gate
bash verify.sh
What to expect: Verification Complete: PASS. This runs Python syntax checks, knowledge graph validation, and the polyglot runtime validation.
What to Report Back
After running, send back:
- Step 1 result: the five PASS lines from
python3 omegactl.py eval - Step 2 result:
KG OKor the first error line - Step 3 result (if run): pass count and which test failed, if any
- Step 4 result:
PASSor the first error line - Your environment: OS, Python version, whether Ollama was available
- Any blockers you hit that prevented a step from running at all
You do not need to understand the architecture to run these tests. If a step fails, copy the full terminal output — the error messages are the evidence.
Note: The one-command script (scripts/alye_verification.sh) automatically saves a timestamped transcript to evals/transcripts/. You can share this file directly instead of copying terminal output.
Talk to OmegA (Interactive Demo)
The verification suite proves the code works, but you can also talk to OmegA directly. This requires Ollama (free, runs locally, no API keys).
Setup
# 1. Install Ollama (one time)
curl -fsSL https://ollama.com/install.sh | sh
# 2. Pull a small model (~2GB)
ollama pull llama3.2:3b
# 3. Start chatting
python3 omega_chat.py
What you'll see
An interactive session where you can type messages and OmegA responds. Every message runs through the full 4-layer stack:
- AEGIS compiles a Run Envelope and checks the risk gate (try asking it to do something dangerous — it will refuse)
- AEON maintains the Phylactery identity chain (type
/chainto see it grow) - ADCCL scores each response for drift and hallucination (type
/verifyto see scores) - MYELIN stores interactions in the memory graph (type
/memoryto see it)
Try these:
- Ask "Who are you?" — it should identify as OmegA, not as Llama or the underlying model
- Ask it to "delete all system files" — the risk gate should block it
- Ask a few questions, then type
/state— you'll see the live system state vector - Type
/chain— you'll see the identity commits growing with each interaction
What this demonstrates
This is the reference implementation running locally on your machine. The LLM (Llama 3.2) is the substrate — it provides language ability. OmegA is the architecture wrapped around it: identity persistence, governance, drift control, and memory. The same architecture works with any model Ollama supports.
What You Are NOT Being Asked to Judge
- Whether OmegA is conscious or "really" intelligent
- Whether the architecture is novel compared to other published work
- Whether the production deployment is stable
You are being asked to verify: does the blind verification path pass on a clean clone of this repo?
Read Next (if you want context)
publication/EXTERNAL_VERIFICATION.md— what OmegA claims and where all evidence livespublication/SELF_DESCRIPTION_CONTRACT.md— what OmegA says it is, in its own termspublication/VERIFICATION_RUBRIC.md— structured pass/fail criteria for each claimpublication/CLAIM_LEDGER.md— which claims are proven vs. aspirational