NEWS


effectcheck 0.6.2

Exact binomial test reported with Cohen's h. New test_type = "binomial" matched via pat_binom_h, anchored on a "binomial p [op] " clause followed (within ~80 non-period chars) by "Cohen('s)? h = ". When a " out of " clause is present in the same verbatim, N is recovered (N_source = "binom_n_out_of_N") and check.R re-computes the two-sided binomial p via stats::binom.test() assuming p_null = 0.5 (the most common null in binomial-vs-chance reporting); the recomputed vs reported delta appears in uncertainty_reasons. When N isn't recoverable, status routes to NOTE -- the Cohen's h is accepted as reported.

Surfaced by the 2026-05-25 escicheck-iterate corpus expansion against the CRSP decoy-effect papers (Xiao/Zeng/Feldman 2021 et al), where 2-5 binomial-with-h rows previously fell through to WEAK_GOLD or OUT_OF_SCOPE. The NOTE-only template (LESSONS.md "NOTE-only test_type template") was extended cleanly: parse layer adds the pattern + dispatch branch, check.R adds a tt == "binomial" branch with conditional recompute. A v0.6.3 follow-up could detect a stated null proportion ("vs 1/3 chance" etc.) to replace the p_null = 0.5 default.

Regression tests in tests/testthat/test-v062-binomial-h.R (7 cases: full CRSP verbatim with N recovery, bare binomial+h with N=NA NOTE, 80-char-lookahead far-apart rejection, "h" without "binomial p" anchor guard, chisq+h still routes to chisq, lowercase "cohen h" form, and uncertainty-message contents when N is recovered).

effectcheck 0.6.1

Bare t = X, p [op] Y (no df) extraction. Surfaced by the Lee-Feldman 2025 RSOS Newman-2014 RR replication during the 2026-05-25 escicheck-iterate corpus expansion (24 occurrences in one paper's Tables 10-15: compact <label> M = m (sd), t = X, p < .001 form where df lives only in the table header, not the immediate sentence). Before v0.6.1 such reports returned 0 rows from check_text().

A new pat_t_p_nodf pattern matches t = X followed within ~80 chars by a p [<=>] clause; (?<![a-zA-Z]) keeps dt =, pt =, etc. from false-positive matching, and the 80-char lookahead bound prevents a stray t = X from being yoked to an unrelated downstream p = in long prose. df1 stays NA — check.R routes to status NOTE because the exact p-check needs df. Dispatch position: AFTER pat_t_nodf (t = X, df = Y form keeps priority and yields status=OK with full verification when df is present).

Regression tests in tests/testthat/test-v061-bare-t-p-nodf.R.

effectcheck 0.6.0

Clinical-trial RR / rdpct / md_hl independent verification, completing the v0.5.16/17/18 PROSECCO-trial test-type set. Closes the deferred v0.6.x follow-through promised in the v0.5.16-18 NEWS entries.

Verification (the v0.5.x NOTE rows now compute a comparison)

Regression tests in tests/testthat/test-v060-rr-rdpct-mdhl-verification.R. Closes the 2026-05-25-v06x-clinical-trial-compute-branches handoff.

effectcheck 0.5.18

Median-difference (Hodges-Lehmann) with IQR + CI (escicheck-iterate cycle 8). Completes the PLOS Med PROSECCO-trial PARSE-MISS punch-list opened in cycle 1.

New test type

effectcheck 0.5.17

Risk-difference percent with CI (escicheck-iterate cycle 7).

New test type

effectcheck 0.5.16

Clinical-trial risk ratio with two-proportion slash counts (escicheck-iterate cycle 7).

New test type

effectcheck 0.5.15

Cochran Q meta-analytic heterogeneity test (escicheck-iterate cycle-5, after user scope decision 2026-05-24 to bring Q in-scope).

New test type

effectcheck 0.5.14

Two narrow parse fixes from the 2026-05-24 escicheck-iterate cycle-4 validation against the Collabra canary.

Parse fixes

effectcheck 0.5.12

Recall fix for the Collabra / APA partial-eta-squared convention.

Parse fixes

effectcheck 0.5.11

Documentation-only release. The design_ambiguous output flag has always combined two semantically distinct cases under one name; this release makes the distinction explicit and parseable without changing behaviour.

Documentation / output-string clarifications

effectcheck 0.5.10

Bare r = with a confidence interval — a parse fix found by escicheck-iterate.

Bug fixes

effectcheck 0.5.9

Chi-square chi^2 caret token — a parse fix found by escicheck-iterate.

Bug fixes

effectcheck 0.5.8

Chi-square bare-n sample size — a parse fix found by escicheck-iterate.

Bug fixes

effectcheck 0.5.7

DSCF (Dwass-Steel-Critchlow-Fligner) post-hoc W — a parse + categorisation fix found by escicheck-iterate.

Bug fixes

effectcheck 0.5.6

Bare regression-coefficient lines — a parse fix found by escicheck-iterate.

Bug fixes

effectcheck 0.5.5

JASP "nobs" sample-size token — a parse fix found by escicheck-iterate running effectcheck against the real-article AI gold corpus.

Bug fixes

effectcheck 0.5.4

Regression-coefficient handling — a categorisation fix found by escicheck-iterate running effectcheck against the real-article AI gold corpus.

Bug fixes

effectcheck 0.5.3

Scientific-notation p-values — a parse fix found by escicheck-iterate running effectcheck against the real-article AI gold corpus.

Bug fixes

effectcheck 0.5.2

Subscripted chi-square notation — a parse fix found by escicheck-iterate running effectcheck against the real-article AI gold corpus.

Bug fixes

effectcheck 0.5.1

Stage 1 validation fixes — four gaps found by validating the v0.5.0 Stage 1 coverage against six real articles (AI gold generated via the article-finder skill).

Bug fixes

effectcheck 0.5.0

Coverage Stage 1 — closes effect-size / test-type gaps from the 2026-05-16 coverage roadmap (P1, P2, P3, P6, P7).

New features

Internal

effectcheck 0.4.2

Bug fixes

effectcheck 0.4.1

Bug fixes

effectcheck 0.4.0

Breaking changes — extraction layer removed

All file-input functions are now .Defunct() and emit an error directing callers to extract via docpluck and pass the resulting text to check_text():

The pure-text-analysis API (check_text(), compute_and_compare_one(), the parsing layer, all effect-size and CI computations, and every output column) is unchanged.

The package no longer requires poppler-utils, tesseract, magick, or qpdf system dependencies. SystemRequirements field removed from DESCRIPTION; corresponding entries removed from Suggests.

Migration: see https://docpluck.app/api-docs for the API contract. Working R reference implementation in the ESCImate web-app repo at tests/scripts/docpluck_shootout.R.

New features (carried over from 0.3.6 deception-detection work)

API documentation

effectcheck 0.3.5

Addresses MetaESCI v0.3.5 request: CI-audit feature pack. Adds CI computation coverage for previously-uncomputable effect-size families (OR, R², standardized β, partial r, semi-partial r) and new per-row metadata for characterizing CI reporting quality at scale (precision tracking, completeness flags, level mismatch, bounded-parameter clipping, symmetry classification).

Purely additive — no v0.3.4 behavior changes.

Compute: CI computation coverage gaps closed

Parse: decimal-place precision tracking

Check: CI audit metadata (Phase 6)

Frontend (escimate.app)

effectcheck 0.3.4

Addresses MetaESCI v0.3.4 request: 42 Category A ERROR false positives where reported eta2/etap2 was cross-matched to cohens_f/cohens_f2 without detection.

Check: Phase 8D Signal 14 — eta/f cross-family detection (E11)

effectcheck 0.3.3

Follow-up to 0.3.2 addressing MetaESCI v0.3.3 request: the E8 pre-strip was a no-op on real docpluck output.

Parse: thousand-sep comma strip now handles spaces after comma (E8 follow-up)

effectcheck 0.3.2

Follow-up to 0.3.1 addressing MetaESCI requests E8 and E10.

Parse: thousand-separator commas in test-statistic parens (E8, HIGH)

Compute: Cohen's dz CI uses noncentral-t inversion (E10, MEDIUM)

Parse: decimal-comma no longer corrupts author affiliation markers

E9 — Smaller parse.R gaps (deferred, needs repro bundle)

effectcheck 0.3.1

This is a housekeeping release packaging the v0.3.0f → v0.3.0n bug-fix wave with a stable CRAN-style version number, batch-stdout hygiene, a schema stability test, and a new decision_error_reason diagnostic column. Addresses MetaESCI requests E1–E4 and E7.

DESCRIPTION version sync (E2)

Batch stdout: noncentral-t overflow spam silenced (E1)

Schema stability test (E3)

New column: decision_error_reason (E7)

Expected row-count delta vs v0.3.0f (E4 — MetaESCI batch guidance)

On the MetaESCI metaesci_regression 200-PDF frozen benchmark (seed 42), comparing v0.3.0f (last full batch) to v0.3.0n / 0.3.1:

| subset | v0.3.0f rows | v0.3.0n rows | delta | v0.3.0f ERRORs | v0.3.0n ERRORs | |----------------------|-------------:|-------------:|---------------:|---------------:|---------------:| | meta_psychology (139)| 464 | 464 | 0 | 0 | 0 | | metaesci_regression | 2,209 | 3,385 | +1,176 (+53%) | 13 | 0 |

The +53% row-count delta on metaesci_regression is driven by parser gains, not a config-default change (plausibility_filter and try_tables defaults are unchanged). The new rows come from:

Downstream consumers must re-derive all aggregate numbers from a fresh v0.3.1 batch — old v0.3.0f aggregates are not directly comparable. The 13 → 0 ERROR reduction on metaesci_regression is real (v0.3.0n's F ≈ 0 crash fix + multi-predictor-beta fix), not artefactual.

No columns were added or removed vs v0.3.0n other than the new decision_error_reason column described above.

effectcheck 0.3.0

Bug fixes (MetaESCI v0.3.0m batch deep-dive)

Bug fixes (MetaESCI batch validation)

Enhancements

Bug fixes

Bug fixes

Bug fixes and cleanup

Bug fixes and cleanup

Bug fixes and new features

Bug fixes

Parser fixes and artifact detection

Addresses 13 false positive ERRORs from MetaESCI v0.3.0c validation (132,537 results, 24 ERRORs). Expected: 24 -> ~10 ERRORs.

Bug fixes

New features

Tests


effectcheck 0.2.8

Design ambiguity improvements

Addresses 399 remaining ERRORs from MetaESCI v0.2.7 audit (132,499 results). Philosophy: compute ALL plausible alternatives under different design assumptions; if ANY alternative matches, downgrade severity.

New features

Bug fixes

Internal

effectcheck 0.2.7

Bug fixes and API improvements

Bug fixes

Documentation


effectcheck 0.2.6

Design ambiguity + decision error fixes

Based on MetaESCI analysis of 132,499 results from 8,415 articles. These changes reduce the ERROR false positive rate from ~3.9% to ~0.8%.

Design-ambiguous t-test downgrade (check.R)

Decision error requires reported p-value (check.R)

r-test global N guard (check.R)

API changes

effectcheck 0.2.5

PDF extraction quality improvements

Based on MetaESCI extraction analysis of 121,040 results from 8,415 PDFs across 7 journals. These changes reduce PDF extraction artifacts affecting statistical parsing from ~6.5% to ~0.6%.

Header/footer stripping (utils-pdf.R)

Dropped decimal recovery (parse.R)

General line-break joining (parse.R)

Standalone page number removal (parse.R)

Computation-guided decimal recovery (check.R — Phase 5B)

New columns

Tests

effectcheck 0.2.4

Validation-driven improvements

Based on comprehensive validation of 19,690 results across 7 journals (MetaESCI).

Bug fixes (Category A — 673 results)

Extraction guards (Category B — 41 PDF extraction artifacts)

New features

effectcheck 0.2.3 (2026-03-25)

New features

Bug fixes

API changes

effectcheck 0.2.2

Bug fixes

New features

effectcheck 0.2.1

Bug fixes

Improvements

CRAN-related

effectcheck 0.2.0

New features

Bug fixes

Parser improvements

effectcheck 0.1.0