PREreview of EpiLink: a simulation-based compatibility model for genomic transmission clustering in infectious disease surveillance
- Published
- DOI
- 10.5281/zenodo.20785870
- License
- CC BY 4.0
Major Issues
The “threshold-free” framing needs qualification. EpiLink avoids fixed SNP-distance thresholds, but the final clusters still depend on a sparsification threshold and Leiden resolution parameter. The authors should rephrase this as “not dependent on fixed genetic-distance thresholds” and provide clearer guidance on how users should choose or vary graph parameters in real surveillance analyses.
The default target scenario is narrow. The main analysis uses direct transmission and co-primary infection as the target set. This is interpretable, but many surveillance questions involve short chains with unsampled intermediates. The manuscript should more clearly explain when this default is appropriate, when users should include hidden intermediates, and how conclusions change as the scenario set expands.
Synthetic validation may be optimistic because the evaluation model resembles the inference model. The synthetic data are generated using assumptions close to EpiLink’s own natural-history and mutation model. This is useful for controlled benchmarking, but it may overstate performance under real-world model misspecification. The authors should add or discuss simulations with different generation-time distributions, sampling fractions, sequencing error, heterogeneous ascertainment, within-host diversity, and mutation processes not matched to the EpiLink assumptions.
Comparator framing could be stronger. Logistic regression trained on 10% of all pair labels is a useful upper benchmark, but it may not represent a realistic outbreak-response setting where labels are scarce and future cases are unseen. A temporal train/test split or low-label regime would make the comparison more practical. The authors could also include simpler baselines such as fixed SNP plus sampling-time thresholds, since these are common in applied surveillance.
The cluster ground truth needs more justification. The reference cluster definition appears to emphasize direct infectees and close transmission neighborhoods, while positive pairs also include sibling infections. The authors should justify why this target best matches EpiLink’s intended use and report sensitivity to alternative definitions, such as same transmission chain within one or two generations or same superspreading event.
Boston validation is suggestive but not definitive. Enrichment for known exposure categories is encouraging, but it does not prove that inferred clusters correspond to transmission clusters. The authors should compare more directly with published outbreak labels, sampling dates, known epidemiological links, and cluster fragmentation/merging patterns. Reporting robustness of Boston clusters across Leiden resolution and sparsification settings would also improve confidence.
Score interpretation needs stronger user-facing guidance. The compatibility score is not a posterior probability of transmission. This is stated, but readers may still treat high scores as probabilities. The authors should include calibration-style diagnostics or examples showing how scores should and should not be interpreted in public health decision-making.
Minor Issues
The acronyms EDD, EDS, ESD, and ESS are hard to remember. A small table defining each variant near first use would help.
The use of a 5,000 bp synthetic genome should be justified more clearly, especially because SARS-CoV-2 has a much larger genome. Readers may wonder how this choice affects genetic resolution and transferability.
Some figures and tables are dense. Figure captions are informative, but the manuscript would benefit from slightly more visual explanation of what each panel demonstrates.
The authors should report computational scaling more explicitly for larger datasets, since pairwise methods can become expensive as surveillance datasets grow.
A short practical workflow box would be helpful: choose pathogen parameters, choose target scenarios, run compatibility scoring, vary graph settings, interpret clusters cautiously.
There are minor formatting issues in captions and supplementary references, such as awkward “Fig. S1 Fig” phrasing and occasional spacing problems.
Competing interests
The author declares that they have no competing interests.
Use of Artificial Intelligence (AI)
The author declares that they did not use generative AI to come up with new ideas for their review.