Skip to main content

Write a PREreview

From Entropy and Beyond: A Comprehensive Survey of Probability-Space Unsupervised Objectives

Posted
Server
Preprints.org
DOI
10.20944/preprints202606.0394.v1

In an era where compute resources are rapidly advancing with better algorithms and larger clusters, the growth of labeled data, the fossil fuel of AI, has not kept pace. This disparity has spurred a growing interest in learning paradigms that rely solely on unlabeled data. A class of these paradigms employ unsupervised learning objectives that operate directly in the probability or prediction space, with Shannon entropy being one common example among many. Such objectives leverage unlabeled domain data to enable diverse tasks within the target domain. Yet, these methods remain scattered across the literature, with no systematic overview to guide their comparison or use. This work addresses that gap by providing a high-level compilation of the designs, implementations, and applications of 17 such unsupervised loss functions, focusing on their roles in common learning applications while also exploring their broader potential. By presenting their theoretical underpinnings, practical applications, and small-scale yet extensive experiments, this study aims to shape future research by addressing data scarcity, reducing dependence on labeled annotations, and enabling the unsupervised optimization of increasingly large models.

You can write a PREreview of From Entropy and Beyond: A Comprehensive Survey of Probability-Space Unsupervised Objectives. A PREreview is a review of a preprint and can vary from a few sentences to a lengthy report, similar to a journal-organized peer-review report.

Before you start

We will ask you to log in with your ORCID iD. If you don’t have an iD, you can create one.

What is an ORCID iD?

An ORCID iD is a unique identifier that distinguishes you from everyone with the same or similar name.

Start now