Skip to content

Add hackathon demo script#305

Open
topherchris420 wants to merge 2 commits intomainfrom
add/hackathon-demo-script
Open

Add hackathon demo script#305
topherchris420 wants to merge 2 commits intomainfrom
add/hackathon-demo-script

Conversation

@topherchris420
Copy link
Copy Markdown
Owner

@topherchris420 topherchris420 commented Apr 17, 2026

R.A.I.N. Lab demo: Watch AI fight my own research paper

  • James (Lead Scientist) defends Christopher Woodyard's paper
  • Elena (Quantum Information Theorist) attacks with formal critique
  • Uses MiniMax M2.7 via Ollama
  • ~80 second runtime for live demos

Summary

Describe this PR in 2-5 bullets:

  • Base branch target (main for all contributions):
  • Problem:
  • Why it matters:
  • What changed:
  • What did not change (scope boundary):

Label Snapshot (required)

  • Risk label (risk: low|medium|high):
  • Size label (size: XS|S|M|L|XL, auto-managed/read-only):
  • Scope labels (core|agent|channel|config|cron|daemon|doctor|gateway|health|heartbeat|integration|memory|observability|onboard|provider|runtime|security|service|skillforge|skills|tool|tunnel|docs|dependencies|ci|tests|scripts|dev, comma-separated):
  • Module labels (<module>: <component>, for example channel: telegram, provider: kimi, tool: shell):
  • Contributor tier label (trusted contributor|experienced contributor|principal contributor|distinguished contributor, auto-managed/read-only; author merged PRs >=5/10/20/50):
  • If any auto-label is incorrect, note requested correction:

Change Metadata

  • Change type (bug|feature|refactor|docs|security|chore):
  • Primary scope (runtime|provider|channel|memory|security|ci|docs|multi):

Linked Issue

  • Closes #
  • Related #
  • Depends on # (if stacked)
  • Supersedes # (if replacing older PR)

Supersede Attribution (required when Supersedes # is used)

  • Superseded PRs + authors (#<pr> by @<author>, one per line):
  • Integrated scope by source PR (what was materially carried forward):
  • Co-authored-by trailers added for materially incorporated contributors? (Yes/No)
  • If No, explain why (for example: inspiration-only, no direct code/design carry-over):
  • Trailer format check (separate lines, no escaped \n): (Pass/Fail)

Validation Evidence (required)

Commands and result summary:

cargo fmt --all -- --check
cargo clippy --all-targets -- -D warnings
cargo test
  • Evidence provided (test/log/trace/screenshot/perf):
  • If any command is intentionally skipped, explain why:

Quality Delta (required for medium/high risk changes)

  • Quality delta required? (Yes/No):
  • Expected panic count impact (production path):
  • Expected unwrap count impact (production path):
  • Expected flaky test rate impact:
  • Expected mean PR size impact (if stacked/split work):
  • Expected critical-path test coverage impact:
  • If any metric regresses, justify why and note containment/rollback:

Security Impact (required)

  • New permissions/capabilities? (Yes/No)
  • New external network calls? (Yes/No)
  • Secrets/tokens handling changed? (Yes/No)
  • File system access scope changed? (Yes/No)
  • If any Yes, describe risk and mitigation:

Privacy and Data Hygiene (required)

  • Data-hygiene status (pass|needs-follow-up):
  • Redaction/anonymization notes:
  • Neutral wording confirmation (use R.A.I.N./project-native labels if identity-like wording is needed):

Compatibility / Migration

  • Backward compatible? (Yes/No)
  • Config/env changes? (Yes/No)
  • Migration needed? (Yes/No)
  • If yes, exact upgrade steps:

i18n Follow-Through (required when docs or user-facing wording changes)

  • i18n follow-through triggered? (Yes/No)
  • If Yes, locale navigation parity updated in README*, docs/README*, and docs/SUMMARY.md for supported locales (en, zh-CN, ja, ru, fr, vi)? (Yes/No)
  • If Yes, localized runtime-contract docs updated where equivalents exist (minimum for fr/vi: commands-reference, config-reference, troubleshooting)? (Yes/No/N.A.)
  • If Yes, Vietnamese canonical docs under docs/i18n/vi/** synced and compatibility shims under docs/*.vi.md validated? (Yes/No/N.A.)
  • If any No/N.A., link follow-up issue/PR and explain scope decision:

Human Verification (required)

What was personally validated beyond CI:

  • Verified scenarios:
  • Edge cases checked:
  • What was not verified:

Side Effects / Blast Radius (required)

  • Affected subsystems/workflows:
  • Potential unintended effects:
  • Guardrails/monitoring for early detection:

Agent Collaboration Notes (recommended)

  • Agent tools used (if any):
  • Workflow/plan summary (if any):
  • Verification focus:
  • Confirmation: naming + architecture boundaries followed (AGENTS.md + CONTRIBUTING.md):

Rollback Plan (required)

  • Fast rollback command/path:
  • Feature flags or config toggles (if any):
  • Observable failure symptoms:

Risks and Mitigations

List real risks in this PR (or write None).

  • Risk:
    • Mitigation:

Open with Devin

R.A.I.N. Lab demo: Watch AI fight my own research paper
- James (Lead Scientist) defends Christopher Woodyard's paper
- Elena (Quantum Information Theorist) attacks with formal critique
- Uses MiniMax M2.7 via Ollama
- ~80 second runtime for live demos
@github-actions github-actions bot added the scripts Auto scope: scripts/** changed. label Apr 17, 2026
@github-actions
Copy link
Copy Markdown

Thanks for contributing to R.A.I.N..

For faster review, please ensure:

  • PR template sections are fully completed
  • cargo fmt --all -- --check, cargo clippy --all-targets -- -D warnings, and cargo test are included
  • If automation/agents were used heavily, add brief workflow notes
  • Scope is focused (prefer one concern per PR)

See CONTRIBUTING.md and docs/pr-workflow.md for full collaboration rules.

@github-actions
Copy link
Copy Markdown

github-actions bot commented Apr 17, 2026

PR intake checks found warnings (non-blocking)

Fast safe checks found advisory issues. CI lint/test/build gates still enforce merge quality.

  • Incomplete required PR template fields: security risk/mitigation

Action items:

  1. Complete required PR template sections/fields.
  2. Remove tabs, trailing whitespace, and merge conflict markers from added lines.
  3. Re-run local checks before pushing:
    • ./scripts/ci/rust_quality_gate.sh
    • ./scripts/ci/rust_strict_delta_gate.sh
    • ./scripts/ci/docs_quality_gate.sh

Run logs: https://github.com/topherchris420/james_library/actions/runs/24611469499

Detected blocking line issues (sample):

  • none

Detected advisory line issues (sample):

  • none

@github-actions github-actions bot added size: XS Auto size: <=80 non-doc changed lines. risk: low Auto risk: docs/chore-only paths. distinguished contributor Contributor with 50+ merged PRs. labels Apr 17, 2026
Copy link
Copy Markdown
Contributor

@devin-ai-integration devin-ai-integration bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Devin Review found 3 potential issues.

View 3 additional findings in Devin Review.

Open in Devin Review

Comment thread scripts/hackathon-demo.sh Outdated
Comment thread scripts/hackathon-demo.sh
Comment thread scripts/hackathon-demo.sh
@@ -0,0 +1,54 @@
#!/bin/bash
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

🟡 Script has shebang but no execute permission (mode 644)

The file is committed with mode 100644 (not executable), but has #!/bin/bash on line 1. Running ./scripts/hackathon-demo.sh will fail with "Permission denied". Users must use bash scripts/hackathon-demo.sh instead, which defeats the purpose of the shebang.

Prompt for agents
The file scripts/hackathon-demo.sh was committed as 100644 (not executable). Since it has a #!/bin/bash shebang and is intended to be run directly, it should be committed with the executable bit set. Fix with: git update-index --chmod=+x scripts/hackathon-demo.sh and recommit.
Open in Devin Review

Was this helpful? React with 👍 or 👎 to provide feedback.

Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 32c1fddad8

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Comment thread scripts/hackathon-demo.sh Outdated
echo ""

echo "Checking Ollama connection..."
curl -s -o /dev/null -w "Status: %{http_code}\n" "$OLLAMA_HOST/api/tags" || {
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P1 Badge Fail fast on HTTP errors in Ollama connectivity check

The connectivity gate only checks curl transport failure via ||, but this invocation does not fail on HTTP error status codes, so 404/500 responses still pass and the script proceeds as if Ollama were healthy. In environments where the daemon is up but unhealthy (or the endpoint is proxied/misconfigured), this causes the demo flow to continue into misleading downstream failures instead of stopping at startup.

Useful? React with 👍 / 👎.

Comment thread scripts/hackathon-demo.sh
echo "--- JAMES (Lead Scientist - DEFENDS paper) ---"
RESPONSE=$(curl -s -X POST "$OLLAMA_HOST/api/generate" \
-d "{\"model\": \"$MODEL\", \"prompt\": \"You are James, Lead Scientist at Vers3Dynamics R.A.I.N. Lab. DEFEND the paper '$PAPER_TITLE' by $AUTHOR. The core claim: physical reality consists of a discrete state space updated by 5 fundamental geometric rules from which relativistic quantum fields and spacetime emerge. Make a strong case citing: (1) emergence of Dirac equation from chirality propagation on a 5-edge lattice, (2) testable predictions: gamma-ray burst dispersion bounds, 450 qubit decoherence threshold, (3) falsifiability. Be confident, specific, punchy. Under 150 words.\", \"stream\": false}")
echo "$RESPONSE" | python3 -c "import sys,json; print(json.load(sys.stdin).get('response','ERROR'))"
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2 Badge Exit non-zero when /api/generate returns an error payload

This parser treats missing response as a printable fallback ("ERROR") and still exits successfully, so when Ollama returns an error object (for example model-not-found), run_james reports success and all mode continues. That masks failed generations in automation/demo runs and makes it harder to detect broken model setup; the function should fail when error is present or response is absent.

Useful? React with 👍 / 👎.

Comment thread scripts/hackathon-demo.sh Outdated
OLLAMA_HOST="http://127.0.0.1:11434"

PAPER_TITLE="Reality Built on a Finite Set of Geometric Instructions"
AUTHOR="Christopher Woodyard"
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P1 Badge Replace hard-coded real-person identifier in demo script

The repository policy in AGENTS.md (section 9.1, Privacy/Sensitive Data and Neutral Wording) explicitly disallows committing real personal identifiers in code/docs/examples, but this line hard-codes a full personal name into runtime prompts. That creates avoidable privacy/compliance risk because the value is emitted in generated outputs and demo logs/screenshots.

Useful? React with 👍 / 👎.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

distinguished contributor Contributor with 50+ merged PRs. risk: low Auto risk: docs/chore-only paths. scripts Auto scope: scripts/** changed. size: XS Auto size: <=80 non-doc changed lines.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant