
About
Forensic root cause analyzer for Antigravity sessions. Classifies scope deltas, rework patterns, root causes, hotspots, and auto-improves prompts/health.
name: analyze-project description: Forensic root cause analyzer for Antigravity sessions. Classifies scope deltas, rework patterns, root causes, hotspots, and auto-improves prompts/health. risk: unknown source: community version: "1.0" tags: [analysis, diagnostics, meta, root-cause, project-health, session-review]
/analyze-project — Root Cause Analyst Workflow
Analyze AI-assisted coding sessions in ~/.gemini/antigravity/brain/ and produce a report that explains not just what happened, but why it happened, who/what caused it, and what should change next time.
Goal
For each session, determine:
- What changed from the initial ask to the final executed work
- Whether the main cause was:
- user/spec
- agent
- repo/codebase
- validation/testing
- legitimate task complexity
- Whether the opening prompt was sufficient
- Which files/subsystems repeatedly correlate with struggle
- What changes would most improve future sessions
When to Use
- You need a postmortem on AI-assisted coding sessions, especially when scope drift or repeated rework occurred.
- You want root-cause analysis that separates user/spec issues from agent mistakes, repo friction, or validation gaps.
- You need evidence-backed recommendations for improving future prompts, repo health, or delivery workflows.
Global Rules
- Treat
.resolved.Ncounts as iteration signals, not proof of failure - Separate human-added scope, necessary discovered scope, and agent-introduced scope
- Separate agent error from repo friction
- Every diagnosis must include evidence and confidence
- Confidence levels:
- High = direct artifact/timestamp evidence
- Medium = multiple supporting signals
- Low = plausible inference, not directly proven
- Evidence precedence:
- artifact contents > timestamps > metadata summaries > inference
- If evidence is weak, say so
Step 0.5: Session Intent Classification
Classify the primary session intent from objective + artifacts:
DELIVERYDEBUGGINGREFACTORRESEARCHEXPLORATIONAUDIT_ANALYSIS
Record:
session_intentsession_intent_confidence
Use intent to contextualize severity and rework shape. Do not judge exploratory or research sessions by the same standards as narrow delivery sessions.
Step 1: Discover Conversations
- Read available conversation summaries from system context
- List conversation folders in the user’s Antigravity
brain/directory - Build a conversation index with:
conversation_idtitleobjectivecreatedlast_modified
- If the user supplied a keyword/path, filter to matching conversations; otherwise analyze all
Output: indexed list of conversations to analyze.
Step 2: Extract Session Evidence
For each conversation, read if present:
Core artifacts
task.mdimplementation_plan.mdwalkthrough.md
Metadata
*.metadata.json
Version snapshots
task.md.resolved.0 ... Nimplementation_plan.md.resolved.0 ... Nwalkthrough.md.resolved.0 ... N
Additional signals
- other
.mdartifacts - timestamps across artifact updates
- file/folder/subsystem names mentioned in plans/walkthroughs
- validation/testing language
- explicit acceptance criteria, constraints, non-goals, and file targets
Record per conversation:
Lifecycle
has_taskhas_planhas_walkthroughis_completedis_abandoned_candidate= task exists but no walkthrough
Revision / change volume
task_versionsplan_versionswalkthrough_versionsextra_artifacts
Scope
task_items_initialtask_items_finaltask_completed_pctscope_delta_rawscope_creep_pct_raw
Timing
created_atcompleted_atduration_minutes
Content / quality
objective_textinitial_plan_summaryfinal_plan_summaryinitial_task_excerptfinal_task_excerptwalkthrough_summarymentioned_files_or_subsystemsvalidation_requirements_presentacceptance_criteria_presentnon_goals_presentscope_boundaries_presentfile_targets_presentconstraints_present
Step 3: Prompt Sufficiency
Score the opening request on a 0–2 scale for:
- Clarity
- Boundedness
- Testability
- Architectural specificity
- Constraint awareness
- Dependency awareness
Create:
prompt_sufficiency_scoreprompt_sufficiency_band= High / Medium / Low
Then note which missing prompt ingredients likely contributed to later friction.
Do not punish short prompts by default; a narrow, obvious task can still have high sufficiency.
Step 4: Scope Change Classification
Classify scope change into:
- Human-added scope — new asks beyond the original task
- Necessary discovered scope — work required to complete the original task correctly
- Agent-introduced scope — likely unnecessary work introduced by the agent