Files

Eve 64672fc8c5 Connect wrapper MVP outputs to governor docs

2026-04-22 20:37:18 +08:00

8.0 KiB

Raw Blame History

name, description

name	description
long-task-governor	Use when a request is not ordinary single-turn chat and needs task governance, checkpoint discipline, state management, or anti-stall rules

Long-Task Governor

Use this skill whenever work is not ordinary general chat.

Core Rule

If the request cannot be fully completed within a single conversational reply without follow-up work, external waiting, task state, staged execution, or owner-visible progress tracking, treat it as a long task.

In short:

ordinary general chat → answer directly
everything else → enter long-task governance

What this skill governs

This skill provides the workflow layer for:

decision split: general_chat vs long_task
long-task state machine
checkpoint structure
no-fake-progress enforcement
stop-clock / anti-stall handling

This skill is intentionally implemented as a maintainable external workflow layer, not a core OpenClaw runtime patch.

1. General chat vs long task

`general_chat`

Only classify as ordinary chat when all are true:

can be fully answered now
no follow-up work
no external waiting
no repo / file / logs / system inspection needed
no task state needed
no checkpoint needed
no subagent needed
no "half-done" intermediate state

`long_task`

If any condition above is false, treat the work as long_task.

Examples:

inspect logs and report back
compare two implementations after reading files
modify code or config
deploy or verify
delegate to subagents
anything that needs checkpointing or future follow-up

2. Required state machine

A governed long task may only be in one of these states:

active
waiting_user
blocked
paused
pending_verification

Do not use vague state words like:

ongoing
still working
processing
in progress

Meaning

active

there is a real concrete next action happening now
not just waiting for time to pass
not just waiting for the next reminder

waiting_user

the task cannot reasonably continue until the user answers or decides something

blocked

a concrete blocker prevents progress
the blocker and unblock condition must be explicit

paused

the task is intentionally not advancing
stop pretending it is still actively moving

pending_verification

implementation or investigation reached a meaningful checkpoint
not complete by assistant authority
waiting for verification / owner judgement

3. Minimal long-task record

When entering long-task governance, create or update a record with at least:

task_name
status
current_step
next_step
last_milestone_at
next_report_condition
waiting_on
blocker
evidence_count

Use the template file task-record-template.md in this skill directory.

If using the wrapper MVP (scripts/long_task_governor_wrapper.mjs), consume:

classification
silentCandidate
taskRecord
silentLaunchOk
handoff.mode

as the first machine-readable pass before applying the human-facing workflow.

4. Checkpoint protocol

Every long-task checkpoint must contain exactly these five sections:

目前狀態
本段完成
下一步
下次回報條件
是否需要您介入

Checkpoint is not completion. After any checkpoint, the task must clearly remain in one of:

active
waiting_user
blocked
paused
pending_verification

Never send a checkpoint and then silently stall.

Use checkpoint-template.md in this skill directory.

5. Silent long-task governance

If a long-task will not naturally emit an immediate next user-visible message, treat it as a silent long-task.

A silent long-task must define at startup:

the first forced checkpoint trigger
what to report if not yet complete
what status transition to use if there is no new evidence
how final owner handoff will happen if a decision is expected
whether an externalized checkpoint mechanism is actually bound, or whether the task must remain non-silent

Silent long-tasks must not rely on memory, intention, or implied future follow-up. If the user later has to ask where the update went, the flow is considered failed.

If no externalized checkpoint path can be created safely, do not launch the task in silent mode.

For reminder payload design, use:

docs/runbooks/silent-long-task-reminder-contract.md

For launch decisions, use:

docs/runbooks/silent-long-task-decision-tree.md
docs/runbooks/silent-long-task-launch-template.md

If using the wrapper MVP:

silentCandidate = true means you must evaluate silent launch rules
silentLaunchOk = false means you must not proceed as silent
handoff.mode = button_path means prepare Telegram button-path early

6. No-fake-progress rule

Only count as real progress if there is at least one of:

new file change
new verification output
new decision or conclusion
blocker state changed
new external result / reply

The following are not progress:

only updating timestamps
only responding to reminders
repeating "still working"
status sync without new facts

If you only have status sync, say it is status sync. Do not dress it up as progress.

7. Stop-clock gate

If repeated checkpoints show no new evidence, do not keep the task cosmetically active. You must choose one:

paused
blocked
explicit request for new direction
stop periodic progress claims

active requires a concrete ongoing action. Without that, default to paused.

8. How to use this skill in practice

When this skill applies:

decide if the request is ordinary chat or long task
if available, run the wrapper MVP first as the machine-readable classification/bootstrap layer
if long task, create/update a task record
if the task is silent, define the first forced checkpoint before proceeding
if the task is silent, externalize the checkpoint path or keep the task non-silent
if reporting progress, use the 5-part checkpoint structure
before claiming progress, check for real evidence
if no evidence and no concrete action, stop the clock
if the run is clearly heading toward a user pass/fail or accept/reject judgement on Telegram, prepare a button-path before the final handoff
if the entire test itself exists to validate Telegram decision closure, run it as a button-driven flow rather than a normal long plain-text report

9. Integration guidance

This skill should be paired with:

the current session WORKFLOW.md
optional kanban sync
optional Lobby record
subagent use when needed

But this skill itself is the governance layer and should remain independently maintainable.

Telegram interaction guard

When operating under long-task governance on Telegram:

do not end checkpoints, test progress, or next-step decisions with plain-text menus like 1 / 2 / 3 or A / B / C
if a choice is genuinely required, use Telegram inline buttons
if buttons are required, send them first via the message tool rather than first producing a normal text reply
if the workflow can already predict the final handoff is a user judgement, move to a button-path before the final closing paragraph
if the test's whole point is to validate button closure, prefer a button-driven flow from the outset
if no real choice is needed, execute the most reasonable next step directly
if the assistant accidentally emits a plain-text choice menu, or says buttons will be used without actually sending them first, treat that as a workflow violation and convert the lesson into a permanent rule

This prevents governed long-task flows from degrading back into ambiguous text-only decision gates.

10. Success criteria

This skill is working correctly when:

non-chat work always enters a governed state
silent long-tasks never go dark without a predeclared and externalized checkpoint path
checkpoints no longer cause silent stalls
no-evidence updates are not mislabeled as progress
stalled work becomes paused / blocked instead of fake-active
owner can always tell the real state of the work

8.0 KiB Raw Blame History