Validierungsprotokoll

Verständliche Validierungslage für Reviewerinnen und Teilnehmende (gleiches Dokument wie im öffentlichen Repository).

# PCMS validation protocol

**Audience:** researchers, institutional reviewers, and funders evaluating the Perceptual & Cognitive Mapping System (PCMS).

This document states what psychometric and cross-cultural evidence **exists today**, what **does not**, and what a **minimum credible path** to publication-grade validity looks like. It is intended to sit alongside technical specifications (e.g. confidence model, item banks) without overstating claims.

For a complementary technical roadmap (IRT, CFA, larger samples, item-level calibration), see [`VALIDATION_ROADMAP.md`](./VALIDATION_ROADMAP.md).

---

## Current status (be honest)

### Scoring weights

Dimension scores and routing weights are **derived from literature and expert design**, not from **empirical calibration** on a dedicated PCMS response corpus. That is a normal starting point for a new dimensional instrument, but it must be stated plainly: **weights are hypotheses to be tested**, not parameters estimated from this instrument’s own normative data.

### Cultural adaptation

The Ghana question bank was developed with **cultural input** in item wording and framing. It has **not** been validated against a **local normative sample** (Ghana or West Africa) with reported reliability, structure, and score comparability. Until such work exists, Ghana deployments should be described as **culturally informed**, not **locally normed**.

### Reliability

**Internal consistency** (e.g. Cronbach’s α, McDonald’s ω) has **not** been computed from **real PCMS response data** at scale and reported per dimension for each locale. Session-level “confidence” in the app is **not** a substitute for sample-based reliability reporting.

### Predictive validity

**Predictive validity** (outcomes, behaviors, or external criteria beyond self-report) has **not** been tested in peer-reviewed or pre-registered PCMS studies. Any informal correlations users draw are **exploratory**, not validated claims of the system.

---

## Validation roadmap (Phase 1 — minimum for publication)

These steps describe a **Phase 1** bar that many journals and funders would still consider incomplete for high-stakes use, but that moves PCMS from “prototype” toward **evidence-backed reporting**.

1. **Pilot study:** **N = 150 adults per locale** (English, German, Ghana-oriented deployment), **convenience sampling**, with documented inclusion criteria, consent, and data handling.
2. **Reliability:** Compute **Cronbach’s α** and **McDonald’s ω** per **dimension** (and report confidence intervals where appropriate).
3. **Test–retest:** **Two-week** interval, **N = 40 per locale**, with clear instructions not to rehearse items; report stability coefficients and interpret attrition.
4. **Convergent validity:** Correlate PCMS dimension scores with **established instruments** where theory predicts overlap — e.g. **BIS/BAS** (or related) for social/approach–avoidance facets linked to the **E** dimension, **HSPS** (or validated short forms) for **S** (sensory sensitivity), plus additional constructs mapped a priori. Pre-register hypotheses and comparison rules.
5. **Local norms:** Publish **mean and SD per dimension per locale** (and ideally percentiles once N permits), with explicit warnings if variances differ across locales.

Phase 1 does **not** by itself justify cross-locale score interchangeability; that requires **measurement invariance** work (beyond this checklist).

---

## What the system CANNOT currently claim

- That scores **identify children who would benefit from support** (clinical, educational, or therapeutic decisions).
- That **Ghana scores are directly comparable** to German or English scores without invariance and local norm evidence.
- That the **adaptive stopping rule** is **calibrated** to this population’s information rates or decision thresholds (it is engineered for coverage and efficiency under stated assumptions, not empirically tuned to validated stopping rules for each locale).

---

## What a teacher or facilitator needs that the current UI does not provide

PCMS is oriented toward **self-understanding and research**, not classroom placement. Responsible facilitation would still require:

- **A translation layer:** dimension score → **plain-language observation guide** (what might show up in behavior or preference, without labels or “diagnosis”).
- **A threshold guide:** explicit, evidence-backed **decision rules** only after norms exist — e.g. *“If S and R exceed local T90, consider environmental adjustments such as …”* — with **uncertainty** and **local validation** stated; the product must not imply precision it does not have.
- **An observer instrument for children under ~12:** **parent or teacher rating** (or structured observation), not sole child self-report, aligned to the same dimensions where feasible, with its own pilot and reliability.

---

## Revision

This protocol should be updated when each roadmap milestone is completed, with version, date, and a short evidence summary (sample, locale, analysis, and limitations).