Methodology

Overview

The composite ranking for wwileg.org draws on three bibliometric data sources: arXiv preprints, OpenAlex works and citations, and zbMATH editor-classified documents. Each source is scored independently, then merged using a weighted order-statistic formula.

arXiv (math.NT)

Preprints are fetched from the arXiv API in the math.NT (Number Theory) category using the search terms listed below. Author names are extracted from each matching paper. Two scores are computed: a raw paper count, and an eigenvector-centrality score from the co-authorship network. The two scores are combined and normalized to [0, 1].

Search terms used:

Legendre's conjecture
primes in short intervals
prime gaps
short interval
large gaps between primes
bounded gaps between primes
difference between consecutive primes
distribution of prime numbers
Andrica conjecture
Oppermann conjecture
Baker Harman Pintz
Hoheisel
Goldston Pintz Yildirim

OpenAlex

Works are fetched from OpenAlex filtered to the Mathematics field, using the same search terms as arXiv. For each qualifying work, all authors are recorded along with the work's cited-by count. Each author's score combines their total qualifying works and total accumulated citations, normalized to [0, 1].

zbMATH

Documents are fetched from the zbMATH API restricted to two Mathematics Subject Classification codes: 11N05 (Distribution and density of primes) and 11N36 (Applications of sieves to the theory of numbers). Author document counts are accumulated across both classes and normalized to [0, 1].

Score merging

For each researcher, the three normalized scores (arXiv, OpenAlex, zbMATH) are sorted best to worst. The composite score is a weighted average: 0.70 × best rank + 0.20 × middle rank + 0.10 × worst rank. Researchers who appear in only one or two sources have their missing ranks estimated by linear interpolation from the neighboring scores; those estimated values appear in square brackets in the directory.

Known limitations

Author-name disambiguation is not performed beyond the automatic reconciliation provided by each source. Researchers who publish under variant name spellings may be split across multiple entries. zbMATH classes 11N05 and 11N36 are broader than Legendre's conjecture alone; highly-cited analysts who work on sieve methods generally may score higher than their Legendre-specific output would warrant. The arXiv search terms include several broad phrases (for example, "short interval") that retrieve papers not directly about Legendre's conjecture; such papers increase the score of researchers who work on the general prime-gap and short-interval problems that are the closest known approaches to proving the conjecture.