Write a PREreview

Generating Structurally Diverse Therapeutic Peptides with GFlowNet

by Edward Wijaya

Posted: January 7, 2026
Server: bioRxiv
DOI: 10.64898/2026.01.05.697258

Reinforcement learning approaches for therapeutic peptide generation suffer from mode collapse, converging to narrow regions of sequence space even when explicit diversity penalties are applied. Fine-grained analysis reveals persistent mode-seeking behavior invisible to standard diversity metrics.

We propose GFlowNet for peptide generation, which samples sequences proportionally to reward rather than maximizing expected reward. This objective provides diversity through proportional sampling without requiring explicit output diversity penalties. Comparing against GRPO with explicit diversity enforcement, GFlowNet achieves substantially more uniform sequence sampling and fewer repetitive motifs. Critically, when diversity mechanisms are removed from the reward, GRPO collapses completely while GFlowNet maintains natural diversity. These results demonstrate that proportional sampling is inherently robust to reward function design, offering a key advantage for drug discovery pipelines requiring diverse candidates.

You can write a PREreview of Generating Structurally Diverse Therapeutic Peptides with GFlowNet. A PREreview is a review of a preprint and can vary from a few sentences to a lengthy report, similar to a journal-organized peer-review report.

Before you start

We will ask you to log in with your ORCID iD. If you don’t have an iD, you can create one.