How to Use ESCIMate
A guide to using the tool and interpreting results
Quick Start
Upload a PDF, HTML, or DOCX file — or paste text directly.
Click RUN AUDIT and wait for analysis. First request may take ~90s (server cold start).
Green = consistent, amber = review needed, red = discrepancy found.
Status Definitions
We recomputed this statistic and it matches what the paper reports. The effect size and p-value are consistent with each other.
No action needed. The numbers are consistent.
No effect size was reported for this test, so we could only check the p-value. It's consistent with the test statistic.
No action needed. P-value is consistent (no effect size to compare).
The result appears consistent, but we couldn't fully verify it — for example, the study design was unclear, which affects the exact effect size calculation.
Review the caveat if it matters for your analysis.
There is a moderate discrepancy between the reported and computed values, or the significance conclusion may change. This doesn't necessarily mean there's an error, but it deserves a closer look.
Worth double-checking the original calculation in the paper.
There is a large discrepancy between the reported and computed values. This could indicate a typo, copy-paste error, or calculation mistake in the paper.
Investigate the discrepancy — could be a typo, rounding, or calculation error.
This test statistic was found in the text, but no p-value or effect size was reported alongside it, so nothing could be checked.
Not verifiable. Consider reporting original values more completely.
Supported Test Types
tt-test: Compares the means of two groups to see if they differ significantly.FF-test (ANOVA): Tests whether the means of two or more groups are significantly different.rCorrelation: Measures the strength and direction of a linear relationship between two variables.chisqChi-square test: Tests whether observed counts differ from what you'd expect by chance.chi2Chi-square test: Tests whether observed counts differ from what you'd expect by chance.zz-test: Similar to a t-test, used with large samples or known population variance.UMann-Whitney U: Compares two groups without assuming data is normally distributed.WWilcoxon W: Compares paired observations without assuming normal distribution.HKruskal-Wallis H: Compares three or more groups without assuming normal distribution.regressionRegression: Tests whether a predictor variable is significantly related to an outcome.Effect Size Glossary
Known Limitations
- • Table-format statistics (not parsed from table structures)
- • Sign errors not detected (absolute value comparison by design)
- • Some multi-stat sentences: only first statistic captured (~2% of detections)
- • eta² from F actually computes partial eta² (total eta² requires SS, not available from F/df alone)
- • Welch t-test N estimation is approximate (depends on 4 unknowns)
- • Design-ambiguous t-tests may show WARN when paired/independent is unclear
Frequently Asked Questions
Reporting Bugs
Found an issue? Please report it on GitHub Issues.
Include: the input text or PDF, the expected result, and what ESCIMate reported.