extraction-form

Name: extraction-form
Rating: 70
Author: WILLOSCAR

by WILLOSCAR

Research pipelines as semantic execution units: each skill declares inputs/outputs, acceptance criteria, and guardrails. Evidence-first methodology prevents hollow writing through structured intermediate artifacts.

⭐ 83🍴 10📅 Jan 24, 2026

claude claude-code codex gpt pipeline research research-paper research-project

View on GitHub Run in Manus

SKILL.md

name: extraction-form description: | Extract study data into a structured table (`papers/extraction_table.csv`) using the protocol’s extraction schema. Trigger: extraction form, extraction table, data extraction, 信息提取, 提取表. Use when: systematic review 在 screening 后进入 extraction（C3），需要把纳入论文按字段落到 CSV 以支持后续 synthesis。 Skip if: 还没有 `papers/screening_log.csv` 或 protocol 未锁定。 Network: none. Guardrail: 严格按 schema 填字段；不要在此阶段写 narrative synthesis（那是 `synthesis-writer`）。

Extraction Form (systematic review)

Goal: create a consistent, analysis-ready extraction table that is directly grounded in the protocol.

Inputs

Required:

papers/screening_log.csv
output/PROTOCOL.md

Optional:

papers/paper_notes.jsonl (if you already have structured notes)

Outputs

papers/extraction_table.csv

Workflow

Determine the included set
- From papers/screening_log.csv, collect all rows with decision=include.
Build/confirm the schema
- Use the extraction schema defined in output/PROTOCOL.md.
- If the protocol does not define fields yet, stop and update output/PROTOCOL.md first.
Populate papers/extraction_table.csv
- One row per included paper.
- If papers/paper_notes.jsonl exists, use it as a structured source for values/provenance (but keep the table schema governed by output/PROTOCOL.md).
- Always include provenance columns:
  - paper_id, title, year, url
- For each protocol-defined field:
  - fill concrete values (units explicit)
  - use an explicit sentinel for unknowns (recommended: empty cell + notes)
Keep it auditable
- If a value is inferred (not directly stated), mark it in a notes column.
- Do not write synthesis; only extraction.
Quick QA
- Ensure 1:1 coverage: included papers == extraction rows.
- Spot-check a few rows against the paper text/notes.

Definition of Done

papers/extraction_table.csv exists.
Every included paper from papers/screening_log.csv has exactly one extraction row.
Column meanings match output/PROTOCOL.md (no ad-hoc columns without updating the protocol).

Troubleshooting

Issue: the protocol does not specify extraction fields

Fix:

Update output/PROTOCOL.md (extraction schema section) and re-run extraction.

Issue: extraction table mixes narrative text with fields

Fix:

Move narrative into a notes column and keep the rest as atomic values (numbers/enums/short strings).

Score

Total Score

70/100

Based on repository quality metrics

✓SKILL.md

SKILL.mdファイルが含まれている

+20

○LICENSE

ライセンスが設定されている

0/10

✓説明文

100文字以上の説明がある

+10

○人気

GitHub Stars 100以上

0/15

○最近の活動

3ヶ月以内に更新がある

0/10

✓フォーク

10回以上フォークされている

✓Issue管理

オープンIssueが50未満

✓言語

プログラミング言語が設定されている

✓タグ

1つ以上のタグが設定されている

Reviews

💬

Reviews coming soon

extraction-form

SKILL.md

Extraction Form (systematic review)

Inputs

Outputs

Workflow

Definition of Done

Troubleshooting

Issue: the protocol does not specify extraction fields

Issue: extraction table mixes narrative text with fields

Score

Reviews

prompt-lookup

skill-lookup

changelog-automation

web-component-design

dbt-transformation-patterns

market-sizing-analysis

extraction-form

SKILL.md

Extraction Form (systematic review)

Inputs

Outputs

Workflow

Definition of Done

Troubleshooting

Issue: the protocol does not specify extraction fields

Issue: extraction table mixes narrative text with fields

Score

Reviews

Related

Related Skills

prompt-lookup

skill-lookup

changelog-automation

web-component-design

dbt-transformation-patterns

market-sizing-analysis