Conformance

Conformance snapshot, methodology, and current upstream-derived coverage for Ferrocat.

Conformance

Ferrocat's conformance work answers a simple product question: when a catalog uses real-world PO or ICU behavior, do we match the ecosystem expectations that users already depend on?

The repository carries a hermetic conformance snapshot under repository conformance snapshot, with case definitions in case definitions.

Phase 1 intentionally excludes GNU gettext. The current snapshot uses:

izimobil/polib as the primary PO edge-case baseline
rubenv/pofile as a secondary JS-oriented PO cross-check
Babel as a targeted PO supplement
the official ICU MessageFormat tests as the parser reference for ferrocat-icu
FormatJS/messageformat syntax examples as representative ICU ecosystem cases

Current Counts

Current snapshot totals as of 2026-05-12:

60 source-attributed conformance cases
454 concrete assertions checked by the harness
55 expected passes
5 expected rejects
0 documented known_gap cases

Per suite:

po-pofile: 30 cases / 301 assertions
po-polib: 12 cases / 88 assertions
po-babel: 5 cases / 32 assertions
icu-official: 11 cases / 29 assertions
icu-ecosystem: 2 cases / 4 assertions

The case count tracks individually addressable upstream-derived scenarios. The assertion count tracks the concrete field- and structure-level comparisons performed by the harness, which is the better number to use when communicating weight and breadth.

Small structured expectations now live inline in the Rust case definitions next to each case. External files are kept mainly for realistic upstream inputs and full rendered outputs such as roundtrip or merge snapshots.

Snapshot Scope

po-polib: comment ordering, UTF-8 BOM handling, strict invalid quoting rejects, wrapping, merge semantics, and merge output parsing
po-pofile: multiline values, structured references, comments, contexts, obsolete entries, C-string escapes, normalized headerless roundtrip behavior, and Plural-Forms
po-babel: unknown locale roundtrip, irregular multiline msgstr, and enclosed location parsing with structured references
icu-official: simple arguments, plural/selectordinal, nested tags, skeleton formatters, nested select/plural, apostrophe escaping, and parser-visible failure cases
icu-ecosystem: rich text with nested placeholders and select messages with nested formatters

Local Coverage Mapping

Existing local tests still provide broad regression coverage in:

parse, serialize, merge, and api behavior inside ferrocat-po
parser and utility behavior inside ferrocat-icu

The conformance layer is intentionally narrower and source-attributed. It exists to answer a different question: whether Ferrocat matches independently maintained reference behavior on representative upstream cases.

Scoreboard

Use:

cargo test --workspace
cargo run -p ferrocat-bench -- conformance-report

The report prints totals per suite and capability, broken down into pass, reject, and known_gap.

It also prints assertion totals, so we can talk about both "how many source-attributed cases" and "how many concrete checks" without inflating fixture counts.

Known gaps are counted and documented, but they do not fail CI. The current snapshot has 0.

Headerless PO files are not treated as a gap. ferrocat-po intentionally normalizes them on write by emitting an explicit empty header entry.

Not every upstream-derived behavior is treated as a desired future target. previous_msgid history from traditional gettext merge workflows is intentionally out of scope and therefore not counted as a known_gap.

Phase 1 Exclusion

GNU gettext is not part of the phase 1 scoreboard. The main reason is repository hygiene: its tests are powerful, but much harder to adopt hermetically without either GPL test vendoring or a much heavier adaptation layer. The current snapshot is intentionally built from MIT/BSD/Unicode-licensed sources first.