Add learning exploit guardrails
This commit is contained in:
@@ -888,7 +888,7 @@ Target deliverable: A small group can join a server, spawn into one biome, gathe
|
|||||||
- [x] Define the first subject content format: topic, concepts, difficulty tier, prerequisite concepts, in-game effect, practice action, and source note. Added a compact subject content schema to `Docs/KnowledgeAndSkillFoundation.md` with `topic`, `concepts`, `difficulty_tier`, `prerequisite_concepts`, `in_game_effect`, `practice_action`, and `source_note`, plus a fire-clearance example suitable for later data assets.
|
- [x] Define the first subject content format: topic, concepts, difficulty tier, prerequisite concepts, in-game effect, practice action, and source note. Added a compact subject content schema to `Docs/KnowledgeAndSkillFoundation.md` with `topic`, `concepts`, `difficulty_tier`, `prerequisite_concepts`, `in_game_effect`, `practice_action`, and `source_note`, plus a fire-clearance example suitable for later data assets.
|
||||||
- [x] Add a small MVP question bank for elementary survival knowledge. Added six elementary survival question records to `Docs/KnowledgeAndSkillFoundation.md` for fire clearance, potable water, cold exposure, shelter drainage, bleeding care, and fiber identification, each with answer options, correct answer, and practical feedback.
|
- [x] Add a small MVP question bank for elementary survival knowledge. Added six elementary survival question records to `Docs/KnowledgeAndSkillFoundation.md` for fire clearance, potable water, cold exposure, shelter drainage, bleeding care, and fiber identification, each with answer options, correct answer, and practical feedback.
|
||||||
- [x] Define when deeper questions should matter: quality improvements, safer work, complex crafting, teaching others, and advanced branches. Added deeper-question timing rules to `Docs/KnowledgeAndSkillFoundation.md`, keeping advanced checks out of the first survival loop while tying them to quality improvements, safer work, complex crafting, teaching others, and advanced branches.
|
- [x] Define when deeper questions should matter: quality improvements, safer work, complex crafting, teaching others, and advanced branches. Added deeper-question timing rules to `Docs/KnowledgeAndSkillFoundation.md`, keeping advanced checks out of the first survival loop while tying them to quality improvements, safer work, complex crafting, teaching others, and advanced branches.
|
||||||
- [ ] Add design notes for avoiding exploit farming and rote memorization.
|
- [x] Add design notes for avoiding exploit farming and rote memorization. Added exploit and rote-memorization guardrails to `Docs/KnowledgeAndSkillFoundation.md`, covering diminishing returns, meaningful action context, repeated prompt suppression, recovery-only credit, learned-concept separation, and varied wording to reward decisions and practice instead of low-cost repetition.
|
||||||
- [ ] Add persistence requirements for knowledge, skill experience, learned concepts, failed attempts, and tutorial state.
|
- [ ] Add persistence requirements for knowledge, skill experience, learned concepts, failed attempts, and tutorial state.
|
||||||
- [ ] Add multiplayer rules for teaching, observation, shared work, and group skill benefits.
|
- [ ] Add multiplayer rules for teaching, observation, shared work, and group skill benefits.
|
||||||
|
|
||||||
|
|||||||
@@ -509,3 +509,30 @@ Advanced branches:
|
|||||||
Depth rule: deeper questions should improve mastery and responsibility. They
|
Depth rule: deeper questions should improve mastery and responsibility. They
|
||||||
should not be busywork for actions the character has already demonstrated
|
should not be busywork for actions the character has already demonstrated
|
||||||
through repeated safe practice.
|
through repeated safe practice.
|
||||||
|
|
||||||
|
## Exploit And Rote-Memorization Guardrails
|
||||||
|
|
||||||
|
The learning system should reward understanding and practice, not repetitive
|
||||||
|
input farming or memorizing answer order.
|
||||||
|
|
||||||
|
Exploit farming risks:
|
||||||
|
|
||||||
|
- Repeating the same safe action for unlimited experience.
|
||||||
|
- Creating harmless failures just to farm mistake feedback.
|
||||||
|
- Spamming prompts or questions for repeated rewards.
|
||||||
|
- Using alternate accounts or group work to duplicate teaching credit.
|
||||||
|
- Performing actions with no meaningful cost, context, or outcome.
|
||||||
|
|
||||||
|
Guardrails:
|
||||||
|
|
||||||
|
- Apply diminishing returns to identical repeated actions.
|
||||||
|
- Require meaningful context for experience: time, resource cost, risk,
|
||||||
|
environmental variation, or consequence.
|
||||||
|
- Track recent prompt/question exposure and suppress repeated rewards.
|
||||||
|
- Give recovery credit only when the player takes a useful corrective action.
|
||||||
|
- Separate "learned concept" from "perfect mastery" so one answer does not solve
|
||||||
|
every future situation.
|
||||||
|
- Prefer concept families and varied wording over fixed answer-order memorizing.
|
||||||
|
|
||||||
|
Anti-exploit rule: rewards should come from meaningful decisions, varied
|
||||||
|
practice, and good recovery, not from low-cost repetition.
|
||||||
|
|||||||
@@ -0,0 +1,40 @@
|
|||||||
|
#!/usr/bin/env python3
|
||||||
|
"""Verify exploit-farming and rote-memorization guardrails are documented."""
|
||||||
|
|
||||||
|
from pathlib import Path
|
||||||
|
|
||||||
|
|
||||||
|
ROOT = Path(__file__).resolve().parents[1]
|
||||||
|
DOC = ROOT / "Docs" / "KnowledgeAndSkillFoundation.md"
|
||||||
|
ROADMAP = ROOT / "AGRARIAN_DEVELOPMENT_ROADMAP.md"
|
||||||
|
|
||||||
|
REQUIRED = {
|
||||||
|
DOC: [
|
||||||
|
"## Exploit And Rote-Memorization Guardrails",
|
||||||
|
"Exploit farming risks:",
|
||||||
|
"Guardrails:",
|
||||||
|
"diminishing returns",
|
||||||
|
"meaningful context",
|
||||||
|
"varied wording",
|
||||||
|
"Anti-exploit rule:",
|
||||||
|
],
|
||||||
|
ROADMAP: [
|
||||||
|
"[x] Add design notes for avoiding exploit farming and rote memorization.",
|
||||||
|
],
|
||||||
|
}
|
||||||
|
|
||||||
|
|
||||||
|
def main() -> None:
|
||||||
|
missing: list[str] = []
|
||||||
|
for path, snippets in REQUIRED.items():
|
||||||
|
text = path.read_text(encoding="utf-8")
|
||||||
|
for snippet in snippets:
|
||||||
|
if snippet not in text:
|
||||||
|
missing.append(f"{path.relative_to(ROOT)} missing {snippet!r}")
|
||||||
|
if missing:
|
||||||
|
raise SystemExit("FAILED: " + "; ".join(missing))
|
||||||
|
print("OK: learning exploit guardrails are documented.")
|
||||||
|
|
||||||
|
|
||||||
|
if __name__ == "__main__":
|
||||||
|
main()
|
||||||
Reference in New Issue
Block a user