\pdfrender

TextRenderingMode=2, LineWidth=0.1pt

A new elementary proof of the Prime Number Theorem

By  Florian K. Richter
Abstract

Let Ω​(n)Ω𝑛\Omega(n) denote the number of prime factors of n𝑛n. We show that for any bounded f:β„•β†’β„‚:𝑓→ℕℂf\colon\mathbb{N}\to\mathbb{C} one has

1Nβ€‹βˆ‘n=1Nf​(Ω​(n)+1)=1Nβ€‹βˆ‘n=1Nf​(Ω​(n))+oNβ†’βˆžβ€‹(1).1𝑁superscriptsubscript𝑛1𝑁𝑓Ω𝑛11𝑁superscriptsubscript𝑛1𝑁𝑓Ω𝑛subscripto→𝑁1\frac{1}{N}\sum_{n=1}^{N}\,f(\Omega(n)+1)=\frac{1}{N}\sum_{n=1}^{N}\,f(\Omega(n))+{\rm o}_{N\to\infty}(1).

This yields a new elementary proof of the Prime Number Theorem.

1.   Introduction

One of the most fundamental results in mathematics is the Prime Number Theorem, which states that

limNβ†’βˆž|{pβ©½N:p​prime}|N/log⁑N= 1.subscript→𝑁conditional-set𝑝𝑁𝑝prime𝑁𝑁1\lim_{N\to\infty}\leavevmode\nobreak\ \frac{|\{p\leqslant N:p\leavevmode\nobreak\ \text{prime}\}|}{{N}/{\log N}}\leavevmode\nobreak\ =\leavevmode\nobreak\ 1. (1.1)

It was conjectured independently by Gauß and Legendre towards the end of the 18thth{}^{\text{th}} century and proved independently by Hadamard and de la Vallée Poussin in the year 1896. Their proofs were similar in nature and relied on sophisticated analytic machinery from complex analysis developed throughout the 18thth{}^{\text{th}} and 19thth{}^{\text{th}} century by the combined effort of many great mathematicians of this era, including Euler, Dirichlet, Chebyshev, and Riemann. This method of proving the Prime Number Theorem became known as the analytic method. We refer the reader to [Apo00, Gol73b, Gol73a] for more details on the history behind the analytic proof and to [New80] for an abridged version of it; see also [Zag97].

Even though it was believed for a long time not to be possible, an elementary proof of the Prime Number Theorem was eventually found by ErdΕ‘s and Selberg in [Erd49, Sel49]. In this context, elementary does not necessarily mean simple, but refers to methods that avoid using complex analysis and instead rely only on rudimentary facts from calculus and basic arithmetic identities and inequalities. Their approach was based on Selberg’s β€œfundamental formula”, which states that

βˆ‘pβ©½xlog2⁑(p)+βˆ‘p​qβ©½xlog⁑(p)​log⁑(q)= 2​x​log⁑(x)+O​(x).subscript𝑝π‘₯superscript2𝑝subscriptπ‘π‘žπ‘₯π‘π‘ž2π‘₯π‘₯Oπ‘₯\sum_{p\leqslant x}\log^{2}(p)\,+\,\sum_{pq\leqslant x}\log(p)\log(q)\leavevmode\nobreak\ =\leavevmode\nobreak\ 2x\log(x)\,+\,{\rm O}(x). (1.2)

We refer to [Lev69] for a streamlined exposition of the ErdΕ‘s-Selberg proof, and to [Gol04] and [SG09] for the history behind it. See also [Sha50] for a short proof of (1.2) and [Dia82, Gra10] for more general surveys on this topic. A novel and dynamically inspired way of deriving the Prime Number Theorem from (1.2), which bears many similarities to the argument that we present in Section 2, was recently and independently discovered by McNamara [McN20].

Today we also know of other elementary ways of proving the Prime Number Theorem. For instance, an alternative elementary proof was found by Daboussi in [Dab84], using what he called the β€œconvolution method” (cf. [Dab89, p. 1]). We refer the reader to Chapter 4 in the book of Tenenbaum and MendΓ©s France [TMF00] for a friendly rendition of Daboussi’s argument. A third elementary proof, which is different from the proofs of ErdΕ‘s-Selberg and Daboussi, was provided by Hildebrand in [Hil86] and relies on a corollary of the large sieve ([Mon71, Corollary 3.2]) as a starting point.

The purpose of this paper is to provide yet another elementary proof of the Prime Number Theorem. More precisely, we prove the following result, which contains an equivalent form of the Prime Number Theorem as a special case:

Theorem 1.1.

Let Ω​(n)Ω𝑛\Omega(n) denote the number of prime factors of a positive integer n𝑛n (counted with multiplicities). Then for any bounded f:β„•β†’β„‚:𝑓→ℕℂf\colon\mathbb{N}\to\mathbb{C} one has

1Nβ€‹βˆ‘n=1Nf​(Ω​(n)+1)=1Nβ€‹βˆ‘n=1Nf​(Ω​(n))+oNβ†’βˆžβ€‹(1).1𝑁superscriptsubscript𝑛1𝑁𝑓Ω𝑛11𝑁superscriptsubscript𝑛1𝑁𝑓Ω𝑛subscripto→𝑁1\displaystyle\frac{1}{N}\sum_{n=1}^{N}\,f(\Omega(n)+1)\leavevmode\nobreak\ =\leavevmode\nobreak\ \frac{1}{N}\sum_{n=1}^{N}\,f(\Omega(n))\,+\,{\rm o}_{N\to\infty}(1). (1.3)

Letting λ​(n)=(βˆ’1)Ω​(n)πœ†π‘›superscript1Ω𝑛\lambda(n)=(-1)^{\Omega(n)} denote the classical Liouville function, it immediately follows from 1.1 applied to the sequence f​(n)=(βˆ’1)n𝑓𝑛superscript1𝑛f(n)=(-1)^{n} that

limNβ†’βˆž1Nβ€‹βˆ‘n=1Nλ​(n)= 0.subscript→𝑁1𝑁superscriptsubscript𝑛1π‘πœ†π‘›β€‰0\displaystyle\lim_{N\to\infty}\,\frac{1}{N}\sum_{n=1}^{N}\,\lambda(n)\,=\,0. (1.4)

This is a well-known equivalent form of the Prime Number Theorem.111The validity of (1.4) was first observed by von Mangoldt in [vM97, p. 852] and the equivalence between (1.4) and (1.1) was later realized by Landau (see [Lan99, Β§4] and [Lan11] and [Lan09, pp. 620–621] and [Lan09, pp. 631–632]). See also [Ten95, p. 55].

1.1 also recovers other results in multiplicative number theory. For instance, by considering f​(n)=ΞΆn𝑓𝑛superscriptπœπ‘›f(n)=\zeta^{n} where ΞΆβ‰ 1𝜁1\zeta\neq 1 is a mπ‘šm-th root of unity, we obtain from (1.3) that

limNβ†’βˆž1Nβ€‹βˆ‘n=1N΢Ω​(n)= 0.subscript→𝑁1𝑁superscriptsubscript𝑛1𝑁superscriptπœΞ©π‘›β€‰0\lim_{N\to\infty}\,\frac{1}{N}\sum_{n=1}^{N}\,\zeta^{\Omega(n)}\,=\,0.

This implies a theorem of Pillai and Selberg [Pil40, Sel39], which says that for mβˆˆβ„•π‘šβ„•m\in\mathbb{N} and r∈{0,1,…,mβˆ’1}π‘Ÿ01β€¦π‘š1r\in\{0,1,\ldots,m-1\} the set {nβˆˆβ„•:Ω​(n)≑rmodm}conditional-set𝑛ℕΩ𝑛moduloπ‘Ÿπ‘š\{n\in\mathbb{N}:\Omega(n)\equiv r\bmod m\} has asymptotic density 1/m1π‘š1/m. In a similar vein, (1.3) applied to sequences of the form f​(n)=e2​π​i​α​n𝑓𝑛superscript𝑒2πœ‹π‘–π›Όπ‘›f(n)=e^{2\pi i\alpha n} for Ξ±βˆˆβ„\β„šπ›Ό\β„β„š\alpha\in\mathbb{R}\backslash\mathbb{Q} yields a classical result of ErdΕ‘s and Delange (see [Erd46, p. 2, lines 4–5] and [Del58]), asserting that (Ω​(n)​α)nβˆˆβ„•subscriptΩ𝑛𝛼𝑛ℕ(\Omega(n)\alpha)_{n\in\mathbb{N}} is uniformly distributed mod 111 for any irrational α𝛼\alpha. 1.1 also recovers several results recently obtained by the author in [BR20], including [BR20, Theorem A].

The proof of 1.1 is self-contained (with the exception of Stirling’s approximation formula used in Section 3 without a proof) and was inspired by the author’s work in [BR20]. It is worth noting that this is the first proof of the Prime Number Theorem that builds on Chebyshev’s original idea of estimating the number of primes between n𝑛n and 2​n2𝑛2n.

2.   The proof

A well-known relation in number theory, which is often regarded as a corollary of the TurΓ‘n-Kubilius inequality (cf. [Dab75, Lemma 1], [KΓ‘t86, Eq. (3.1)], and [Ell79, Lemma 4.7]), states that for any finite set of primes 𝒫𝒫\mathcal{P} one has

lim supNβ†’βˆž1Nβ€‹βˆ‘n=1N|βˆ‘pβˆˆπ’«1p∣nβˆ’βˆ‘pβˆˆπ’«1p|2=O​(βˆ‘pβˆˆπ’«1p),subscriptlimit-supremum→𝑁1𝑁superscriptsubscript𝑛1𝑁superscriptsubscript𝑝𝒫subscript1conditional𝑝𝑛subscript𝑝𝒫1𝑝2Osubscript𝑝𝒫1𝑝\limsup_{N\to\infty}\,\frac{1}{N}\sum_{n=1}^{N}\,\Biggl{|}\,\sum_{p\in\mathcal{P}}1_{p\mid n}-\sum_{p\in\mathcal{P}}\text{\small$\dfrac{1}{p}$}\,\Biggr{|}^{2}\,=\,{\rm O}\Biggl{(}\sum_{p\in\mathcal{P}}\text{\small$\dfrac{1}{p}$}\Biggr{)}, (2.1)

where 1p∣nsubscript1conditional𝑝𝑛1_{p\mid n} denotes the function that is 111 if p𝑝p divides n𝑛n and 00 otherwise. It is common to interpret (2.1) using a probabilistic point of view: By considering {1,2,…,N}12…𝑁\{1,2,\ldots,N\} as a discrete probability space (with normalized counting measure as the probability measure) and βˆ‘pβˆˆπ’«1p∣nsubscript𝑝𝒫subscript1conditional𝑝𝑛\sum_{p\in\mathcal{P}}1_{p\mid n} as a random variable on this space, (2.1) says that for large N𝑁N the expected number of primes in 𝒫𝒫\mathcal{P} that divide a ”randomly chosen” n∈{1,…,N}𝑛1…𝑁n\in\{1,\ldots,N\} approximately equals βˆ‘pβˆˆπ’«1/psubscript𝑝𝒫1𝑝\sum_{p\in\mathcal{P}}1/p, with a standard deviation on the scale of (βˆ‘pβˆˆπ’«1/p)1/2superscriptsubscript𝑝𝒫1𝑝12(\sum_{p\in\mathcal{P}}1/p)^{1/2}.

An important role in our proof of 1.1 is played by a generalization of (2.1) where the finite set of primes 𝒫𝒫\mathcal{P} is replaced by an arbitrary finite set of positive integers BβŠ‚β„•π΅β„•B\subset\mathbb{N}.

Proposition 2.1.

Suppose BβŠ‚β„•π΅β„•B\subset\mathbb{N} is finite and non-empty. Then

1Nβ€‹βˆ‘n=1N|βˆ‘q∈B1q∣nβˆ’βˆ‘q∈B1q|2=βˆ‘q∈Bβˆ‘qβ€²βˆˆBΦ​(q,qβ€²)q​qβ€²+O​(|B|2N),1𝑁superscriptsubscript𝑛1𝑁superscriptsubscriptπ‘žπ΅subscript1conditionalπ‘žπ‘›subscriptπ‘žπ΅1π‘ž2subscriptπ‘žπ΅subscriptsuperscriptπ‘žβ€²π΅Ξ¦π‘žsuperscriptπ‘žβ€²π‘žsuperscriptπ‘žβ€²Osuperscript𝐡2𝑁\frac{1}{N}\sum_{n=1}^{N}\,\Biggl{|}\,\sum_{q\in B}1_{q\mid n}-\sum_{q\in B}\text{\small$\dfrac{1}{q}$}\,\Biggr{|}^{2}\,=\,\sum_{q\in B}\sum_{q^{\prime}\in B}\text{\small$\dfrac{\Phi(q,q^{\prime})}{qq^{\prime}}$}+{\rm O}\biggl{(}\frac{|B|^{2}}{N}\biggr{)}, (2.2)

where Ξ¦:β„•Γ—β„•β†’β„•βˆͺ{0}:Ξ¦β†’β„•β„•β„•0\Phi\colon\mathbb{N}{\mkern 0.0mu\times\mkern-0.3mu}\mathbb{N}\to\mathbb{N}\cup\{0\} is the function Φ​(m,n)≔gcd⁑(m,n)βˆ’1β‰”Ξ¦π‘šπ‘›π‘šπ‘›1\Phi(m,n)\coloneqq\gcd(m,n)-1.

Note that (2.2) implies (2.1), because for any finite set of primes 𝒫𝒫\mathcal{P} one has

βˆ‘pβˆˆπ’«βˆ‘pβ€²βˆˆπ’«Ξ¦β€‹(p,pβ€²)p​pβ€²=βˆ‘pβˆˆπ’«1p​(1βˆ’1p)=O​(βˆ‘pβˆˆπ’«1p).subscript𝑝𝒫subscriptsuperscript𝑝′𝒫Φ𝑝superscript𝑝′𝑝superscript𝑝′subscript𝑝𝒫1𝑝11𝑝Osubscript𝑝𝒫1𝑝\sum_{p\in\mathcal{P}}\sum_{p^{\prime}\in\mathcal{P}}\text{\small$\dfrac{\Phi(p,p^{\prime})}{pp^{\prime}}$}\,=\,\sum_{p\in\mathcal{P}}\text{\small$\dfrac{1}{p}$}\Bigl{(}1-\text{\small$\dfrac{1}{p}$}\Bigr{)}\,=\,{\rm O}\Biggl{(}\sum_{p\in\mathcal{P}}\text{\small$\dfrac{1}{p}$}\Biggr{)}.
Proof of 2.1.

Define aβ‰”βˆ‘q∈B1/qβ‰”π‘Žsubscriptπ‘žπ΅1π‘ža\coloneqq\sum_{q\in B}1/q. By expanding the square in (2.2) we get 1Nβ€‹βˆ‘n=1N|βˆ‘q∈B1q∣nβˆ’βˆ‘q∈B1/q|2=S1βˆ’2​a​S2+a21𝑁superscriptsubscript𝑛1𝑁superscriptsubscriptπ‘žπ΅subscript1conditionalπ‘žπ‘›subscriptπ‘žπ΅1π‘ž2subscript𝑆12π‘Žsubscript𝑆2superscriptπ‘Ž2\frac{1}{N}\sum_{n=1}^{N}|\sum_{q\in B}1_{q\mid n}-\sum_{q\in B}{1}/{q}|^{2}=S_{1}-2aS_{2}+a^{2}, where S1≔1Nβ€‹βˆ‘n=1Nβˆ‘q,qβ€²βˆˆB1q∣n​1qβ€²βˆ£n≔subscript𝑆11𝑁superscriptsubscript𝑛1𝑁subscriptπ‘žsuperscriptπ‘žβ€²π΅subscript1conditionalπ‘žπ‘›subscript1conditionalsuperscriptπ‘žβ€²π‘›S_{1}\coloneqq\frac{1}{N}\sum_{n=1}^{N}\sum_{q,q^{\prime}\in B}1_{q\mid n}1_{q^{\prime}\mid n} and S2≔1Nβ€‹βˆ‘n=1Nβˆ‘q∈B1q∣n≔subscript𝑆21𝑁superscriptsubscript𝑛1𝑁subscriptπ‘žπ΅subscript1conditionalπ‘žπ‘›S_{2}\coloneqq\frac{1}{N}\sum_{n=1}^{N}\sum_{q\in B}1_{q\mid n}. Since 1Nβ€‹βˆ‘n=1N1q∣n=1/q+O​(1/N)1𝑁superscriptsubscript𝑛1𝑁subscript1conditionalπ‘žπ‘›1π‘žO1𝑁\frac{1}{N}\sum_{n=1}^{N}1_{q\mid n}=1/q+{\rm O}({1}/{N}), we obtain S2=a+O​(|B|/N)subscript𝑆2π‘ŽO𝐡𝑁S_{2}=a+{\rm O}({|B|}/{N}). On the other hand, 1Nβ€‹βˆ‘n=1N1q∣n​1qβ€²βˆ£n=gcd⁑(q,qβ€²)/q​qβ€²+O​(1/N)1𝑁superscriptsubscript𝑛1𝑁subscript1conditionalπ‘žπ‘›subscript1conditionalsuperscriptπ‘žβ€²π‘›π‘žsuperscriptπ‘žβ€²π‘žsuperscriptπ‘žβ€²O1𝑁\frac{1}{N}\sum_{n=1}^{N}1_{q\mid n}1_{q^{\prime}\mid n}=\gcd(q,q^{\prime})/qq^{\prime}+{\rm O}({1}/{N}) implies S1=βˆ‘q∈Bβˆ‘qβ€²βˆˆBΦ​(q,qβ€²)/q​qβ€²+a2+O​(|B|2/N)subscript𝑆1subscriptπ‘žπ΅subscriptsuperscriptπ‘žβ€²π΅Ξ¦π‘žsuperscriptπ‘žβ€²π‘žsuperscriptπ‘žβ€²superscriptπ‘Ž2Osuperscript𝐡2𝑁S_{1}=\sum_{q\in B}\sum_{q^{\prime}\in B}\Phi(q,q^{\prime})/qq^{\prime}+a^{2}+{\rm O}(|B|^{2}/N). Substituting these estimates into S1βˆ’2​a​S2+a2subscript𝑆12π‘Žsubscript𝑆2superscriptπ‘Ž2S_{1}-2aS_{2}+a^{2} finishes the proof of (2.2). ∎

Before we proceed further, it will be convenient to rewrite (2.2) using the language of averages. Given a finite set AβŠ‚β„•π΄β„•A\subset\mathbb{N} and an arithmetic function f:Aβ†’β„‚:𝑓→𝐴ℂf\colon A\to\mathbb{C}, we denote the CesΓ ro average and the logarithmic average of f𝑓f over A𝐴A respectively by

𝔼n∈A​f​(n)≔1|A|β€‹βˆ‘n∈Af​(n)and π”Όlogn∈A​f​(n)β‰”βˆ‘n∈Af​(n)/nβˆ‘n∈A 1/n.formulae-sequence≔𝑛𝐴superscript𝔼absent𝑓𝑛1𝐴subscript𝑛𝐴𝑓𝑛and β‰”𝑛𝐴superscript𝔼log𝑓𝑛subscript𝑛𝐴𝑓𝑛𝑛subscript𝑛𝐴1𝑛\underset{n\in A}{{{\mathbb{E}}}^{\leavevmode\nobreak\ }}\,f(n)\coloneqq\,\frac{1}{|A|}\sum_{n\in A}f(n)\qquad\text{and }\qquad\underset{n\in A}{{{\mathbb{E}}}^{\text{\footnotesize log}}}\,f(n)\coloneqq\,\frac{\sum_{n\in A}\,{f(n)}/{n}}{\sum_{n\in A}\,{1}/{n}}.

Also let [x]delimited-[]π‘₯[x] abbreviate the set {1,2,…,⌊xβŒ‹}12…π‘₯\{1,2,\ldots,\lfloor x\rfloor\}. After dividing both sides of (2.2) by (βˆ‘q∈B1/q)2superscriptsubscriptπ‘žπ΅1π‘ž2(\sum_{q\in B}1/q)^{2}, we obtain the following equivalent version of it expressed in terms of averages:

𝔼n∈[N]​|𝔼logq∈B​(q​1q∣nβˆ’1)|2=𝔼logq∈B​𝔼logqβ€²βˆˆB​Φ​(q,qβ€²)+O​(|B|2N).𝑛delimited-[]𝑁superscript𝔼absentsuperscriptπ‘žπ΅superscript𝔼logπ‘žsubscript1conditionalπ‘žπ‘›12π‘žπ΅superscript𝔼logsuperscriptπ‘žβ€²π΅superscript𝔼logΞ¦π‘žsuperscriptπ‘žβ€²Osuperscript𝐡2𝑁\underset{n\in[N]}{{{\mathbb{E}}}^{\leavevmode\nobreak\ }}\,\left|\,\underset{q\in B}{{{\mathbb{E}}}^{\text{\footnotesize log}}}\,\big{(}q1_{q\mid n}-1\big{)}\,\right|^{2}\,=\leavevmode\nobreak\ \underset{q\in B}{{{\mathbb{E}}}^{\text{\footnotesize log}}}\,\,\underset{q^{\prime}\in B}{{{\mathbb{E}}}^{\text{\footnotesize log}}}\,\,\Phi(q,q^{\prime})+{\rm O}\biggl{(}\text{\small$\dfrac{|B|^{2}}{N}$}\biggr{)}. (2.3)

The following proposition is our main technical result. By combining it with (2.3), we will be able to finish the proof of 1.1 rather quickly.

Proposition 2.2.

For all Ξ·>0πœ‚0\eta>0, there exists k0βˆˆβ„•subscriptπ‘˜0β„•k_{0}\in\mathbb{N} such that for all kβ©Ύk0π‘˜subscriptπ‘˜0k\geqslant k_{0} there exist two finite, non-empty sets B1,B2βŠ‚β„•subscript𝐡1subscript𝐡2β„•B_{1},B_{2}\subset\mathbb{N} with the following properties:

  1. (a)

    all elements in B1subscript𝐡1B_{1} are primes and all elements in B2subscript𝐡2B_{2} are a product of exactly kπ‘˜k primes;

  2. (b)

    the sets B1subscript𝐡1B_{1} and B2subscript𝐡2B_{2} have the same cardinality and if B1={p1<…<pt}subscript𝐡1subscript𝑝1…subscript𝑝𝑑B_{1}=\{p_{1}<\ldots<p_{t}\} and B2={q1<…<qt}subscript𝐡2subscriptπ‘ž1…subscriptπ‘žπ‘‘B_{2}=\{q_{1}<\ldots<q_{t}\} then (1βˆ’Ξ·)​pjβ©½qjβ©½(1+Ξ·)​pj1πœ‚subscript𝑝𝑗subscriptπ‘žπ‘—1πœ‚subscript𝑝𝑗(1-\eta)p_{j}\leqslant q_{j}\leqslant(1+\eta)p_{j} holds for all j=1,…,t𝑗1…𝑑j=1,\ldots,t;

  3. (c)

    𝔼m∈Bilog​𝔼n∈Bilog​Φ​(m,n)β©½Ξ·subscriptsuperscriptπ”Όπ‘šsubscript𝐡𝑖subscriptsuperscript𝔼𝑛subscriptπ΅π‘–Ξ¦π‘šπ‘›πœ‚\mathbb{E}^{\log}_{m\in B_{i}}\mathbb{E}^{\log}_{n\in B_{i}}\Phi(m,n)\leqslant\eta for i=1,2𝑖12i=1,2, where ΦΦ\Phi is as in 2.1.

Proof of 1.1 assuming 2.2.

Fix Ξ·>0πœ‚0\eta>0 and let k0βˆˆβ„•subscriptπ‘˜0β„•k_{0}\in\mathbb{N} be as guaranteed by 2.2. This means that for every kβ©Ύk0π‘˜subscriptπ‘˜0k\geqslant k_{0} we can find two finite and non-empty sets B1,B2βŠ‚β„•subscript𝐡1subscript𝐡2β„•B_{1},B_{2}\subset\mathbb{N} satisfying properties (a), (b), and (c). For any g:β„•β†’β„‚:𝑔→ℕℂg\colon\mathbb{N}\to\mathbb{C} with |g​(n)|β©½1𝑔𝑛1|g(n)|\leqslant 1 we thus have

|𝔼n∈[N]​g​(Ω​(n))βˆ’π”Όlogq∈B2​𝔼n∈[N/q]​g​(Ω​(q​n))|2superscript𝑛delimited-[]𝑁superscript𝔼absentπ‘”Ξ©π‘›π‘žsubscript𝐡2superscript𝔼log𝑛delimited-[]π‘π‘žsuperscript𝔼absentπ‘”Ξ©π‘žπ‘›2\displaystyle\left|\underset{n\in[N]}{{{\mathbb{E}}}^{\leavevmode\nobreak\ }}\,g(\Omega(n))-\underset{q\in B_{2}}{{{\mathbb{E}}}^{\text{\footnotesize log}}}\,\underset{n\in[\nicefrac{{N}}{{q}}]}{{{\mathbb{E}}}^{\leavevmode\nobreak\ }}\,g(\Omega(qn))\right|^{2} =|𝔼n∈[N]​g​(Ω​(n))βˆ’π”Όlogq∈B2​𝔼n∈[N]​q​1q∣n​g​(Ω​(n))|2+O​(1N)absentsuperscript𝑛delimited-[]𝑁superscript𝔼absentπ‘”Ξ©π‘›π‘žsubscript𝐡2superscript𝔼log𝑛delimited-[]𝑁superscript𝔼absentπ‘žsubscript1conditionalπ‘žπ‘›π‘”Ξ©π‘›2O1𝑁\displaystyle=\left|\underset{n\in[N]}{{{\mathbb{E}}}^{\leavevmode\nobreak\ }}\,g(\Omega(n))-\underset{q\in B_{2}}{{{\mathbb{E}}}^{\text{\footnotesize log}}}\,\underset{n\in[N]}{{{\mathbb{E}}}^{\leavevmode\nobreak\ }}\,q1_{q\mid n}g(\Omega(n))\right|^{2}+{\rm O}\Big{(}\text{\small$\dfrac{1}{N}$}\Big{)}
⩽𝔼n∈[N]​|𝔼logq∈B2​(1βˆ’q​1q∣n)|2+O​(1N)absent𝑛delimited-[]𝑁superscript𝔼absentsuperscriptπ‘žsubscript𝐡2superscript𝔼log1π‘žsubscript1conditionalπ‘žπ‘›2O1𝑁\displaystyle\,\leqslant\,\underset{n\in[N]}{{{\mathbb{E}}}^{\leavevmode\nobreak\ }}\,\left|\,\underset{q\in B_{2}}{{{\mathbb{E}}}^{\text{\footnotesize log}}}\,\big{(}1-q1_{q\mid n}\big{)}\,\right|^{2}+\,{\rm O}\Big{(}\text{\small$\dfrac{1}{N}$}\Big{)}
β©½Ξ·+O​(1N),absentπœ‚O1𝑁\displaystyle\,\leqslant\,\eta\,+\,{\rm O}\Big{(}\text{\small$\dfrac{1}{N}$}\Big{)},

where the second to last inequality follows from the Cauchy-Schwarz inequality and the last inequality follows from property (c) combined with (2.3). Since Ω​(n​q)=Ω​(n)+Ω​(q)Ξ©π‘›π‘žΞ©π‘›Ξ©π‘ž\Omega(nq)=\Omega(n)+\Omega(q), we get

𝔼n∈[N]​g​(Ω​(n))𝑛delimited-[]𝑁superscript𝔼absent𝑔Ω𝑛\displaystyle\underset{n\in[N]}{{{\mathbb{E}}}^{\leavevmode\nobreak\ }}\,\,g(\Omega(n)) =𝔼logq∈B2​𝔼n∈[N/q]​g​(Ω​(n)+Ω​(q))+O​(Ξ·1/2+Nβˆ’1/2).absentπ‘žsubscript𝐡2superscript𝔼log𝑛delimited-[]π‘π‘žsuperscript𝔼absentπ‘”Ξ©π‘›Ξ©π‘žOsuperscriptπœ‚12superscript𝑁12\displaystyle\leavevmode\nobreak\ =\leavevmode\nobreak\ \underset{q\in B_{2}}{{{\mathbb{E}}}^{\text{\footnotesize log}}}\,\,\underset{n\in[\nicefrac{{N}}{{q}}]}{{{\mathbb{E}}}^{\leavevmode\nobreak\ }}\,\,g(\Omega(n)+\Omega(q))\leavevmode\nobreak\ +\leavevmode\nobreak\ {\rm O}\left(\eta^{1/2}+N^{-1/2}\right). (2.4)

An analogous calculation carried out with B1subscript𝐡1B_{1} in place of B2subscript𝐡2B_{2} and g​(n+kβˆ’1)π‘”π‘›π‘˜1g(n+k-1) in place of g​(n)𝑔𝑛g(n) yields

𝔼n∈[N]​g​(Ω​(n)+kβˆ’1)𝑛delimited-[]𝑁superscript𝔼absentπ‘”Ξ©π‘›π‘˜1\displaystyle\underset{n\in[N]}{{{\mathbb{E}}}^{\leavevmode\nobreak\ }}\,\,g(\Omega(n)+k-1) =𝔼logp∈B1​𝔼n∈[N/p]​g​(Ω​(n)+Ω​(p)+kβˆ’1)+O​(Ξ·1/2+Nβˆ’1/2).absent𝑝subscript𝐡1superscript𝔼log𝑛delimited-[]𝑁𝑝superscript𝔼absentπ‘”Ξ©π‘›Ξ©π‘π‘˜1Osuperscriptπœ‚12superscript𝑁12\displaystyle\leavevmode\nobreak\ =\leavevmode\nobreak\ \underset{p\in B_{1}}{{{\mathbb{E}}}^{\text{\footnotesize log}}}\,\,\underset{n\in[\nicefrac{{N}}{{p}}]}{{{\mathbb{E}}}^{\leavevmode\nobreak\ }}\,\,g(\Omega(n)+\Omega(p)+k-1)\leavevmode\nobreak\ +\leavevmode\nobreak\ {\rm O}\left(\eta^{1/2}+N^{-1/2}\right). (2.5)

Recall that B1subscript𝐡1B_{1} consists only of primes and B2subscript𝐡2B_{2} only of kπ‘˜k-almost primes, which means Ω​(p)=1Ω𝑝1\Omega(p)=1 for all p∈B1𝑝subscript𝐡1p\in B_{1} and Ω​(q)=kΞ©π‘žπ‘˜\Omega(q)=k for all q∈B2π‘žsubscript𝐡2q\in B_{2}. This allows us to rewrite (2.4) and (2.5) as

𝔼n∈[N]​g​(Ω​(n))𝑛delimited-[]𝑁superscript𝔼absent𝑔Ω𝑛\displaystyle\underset{n\in[N]}{{{\mathbb{E}}}^{\leavevmode\nobreak\ }}\,\,g(\Omega(n)) =𝔼logq∈B2​𝔼n∈[N/q]​g​(Ω​(n)+k)+O​(Ξ·1/2+Nβˆ’1/2),absentπ‘žsubscript𝐡2superscript𝔼log𝑛delimited-[]π‘π‘žsuperscript𝔼absentπ‘”Ξ©π‘›π‘˜Osuperscriptπœ‚12superscript𝑁12\displaystyle\leavevmode\nobreak\ =\leavevmode\nobreak\ \underset{q\in B_{2}}{{{\mathbb{E}}}^{\text{\footnotesize log}}}\,\,\underset{n\in[\nicefrac{{N}}{{q}}]}{{{\mathbb{E}}}^{\leavevmode\nobreak\ }}\,\,g(\Omega(n)+k)\leavevmode\nobreak\ +\leavevmode\nobreak\ {\rm O}\left(\eta^{1/2}+N^{-1/2}\right), (2.6)
𝔼n∈[N]​g​(Ω​(n)+kβˆ’1)𝑛delimited-[]𝑁superscript𝔼absentπ‘”Ξ©π‘›π‘˜1\displaystyle\underset{n\in[N]}{{{\mathbb{E}}}^{\leavevmode\nobreak\ }}\,\,g(\Omega(n)+k-1) =𝔼logq∈B1​𝔼n∈[N/q]​g​(Ω​(n)+k)+O​(Ξ·1/2+Nβˆ’1/2).absentπ‘žsubscript𝐡1superscript𝔼log𝑛delimited-[]π‘π‘žsuperscript𝔼absentπ‘”Ξ©π‘›π‘˜Osuperscriptπœ‚12superscript𝑁12\displaystyle\leavevmode\nobreak\ =\leavevmode\nobreak\ \underset{q\in B_{1}}{{{\mathbb{E}}}^{\text{\footnotesize log}}}\,\,\underset{n\in[\nicefrac{{N}}{{q}}]}{{{\mathbb{E}}}^{\leavevmode\nobreak\ }}\,\,g(\Omega(n)+k)\leavevmode\nobreak\ +\leavevmode\nobreak\ {\rm O}\left(\eta^{1/2}+N^{-1/2}\right). (2.7)

Finally, let B1={p1<…<pt}subscript𝐡1subscript𝑝1…subscript𝑝𝑑B_{1}=\{p_{1}<\ldots<p_{t}\} and B2={q1<…<qt}subscript𝐡2subscriptπ‘ž1…subscriptπ‘žπ‘‘B_{2}=\{q_{1}<\ldots<q_{t}\} be enumerations of B1subscript𝐡1B_{1} and B2subscript𝐡2B_{2}. Since (1βˆ’Ξ·)​pjβ©½qjβ©½(1+Ξ·)​pj1πœ‚subscript𝑝𝑗subscriptπ‘žπ‘—1πœ‚subscript𝑝𝑗(1-\eta)p_{j}\leqslant q_{j}\leqslant(1+\eta)p_{j}, it follows that 𝔼n∈[N/pj]​g​(Ω​(n)+k)=𝔼n∈[N/qj]​g​(Ω​(n)+k)+O​(Ξ·)subscript𝔼𝑛delimited-[]𝑁subscriptπ‘π‘—π‘”Ξ©π‘›π‘˜subscript𝔼𝑛delimited-[]𝑁subscriptπ‘žπ‘—π‘”Ξ©π‘›π‘˜Oπœ‚\mathbb{E}_{n\in[N/p_{j}]}g(\Omega(n)+k)=\mathbb{E}_{n\in[N/q_{j}]}g(\Omega(n)+k)+{\rm O}(\eta). Taking logarihtmic averages over B1={p1,…,pt}subscript𝐡1subscript𝑝1…subscript𝑝𝑑B_{1}=\{p_{1},\ldots,p_{t}\} and B2={q1,…,qt}subscript𝐡2subscriptπ‘ž1…subscriptπ‘žπ‘‘B_{2}=\{q_{1},\ldots,q_{t}\} (cf. [BR20, Lemma 2.3]) leaves us with

𝔼logp∈B1​𝔼n∈[N/p]​g​(Ω​(n)+k)=𝔼logq∈B2​𝔼n∈[N/q]​g​(Ω​(n)+k)+O​(Ξ·).𝑝subscript𝐡1superscript𝔼log𝑛delimited-[]𝑁𝑝superscript𝔼absentπ‘”Ξ©π‘›π‘˜π‘žsubscript𝐡2superscript𝔼log𝑛delimited-[]π‘π‘žsuperscript𝔼absentπ‘”Ξ©π‘›π‘˜Oπœ‚\underset{p\in B_{1}}{{{\mathbb{E}}}^{\text{\footnotesize log}}}\,\,\underset{n\in[\nicefrac{{N}}{{p}}]}{{{\mathbb{E}}}^{\leavevmode\nobreak\ }}\,\,g(\Omega(n)+k)\leavevmode\nobreak\ =\leavevmode\nobreak\ \underset{q\in B_{2}}{{{\mathbb{E}}}^{\text{\footnotesize log}}}\,\,\underset{n\in[\nicefrac{{N}}{{q}}]}{{{\mathbb{E}}}^{\leavevmode\nobreak\ }}\,\,g(\Omega(n)+k)\leavevmode\nobreak\ +\leavevmode\nobreak\ {\rm O}(\eta). (2.8)

From (2.6), (2.7), and (2.8) it follows that 𝔼n∈[N]​g​(Ω​(n))=𝔼n∈[N]​g​(Ω​(n)+kβˆ’1)+O​(Ξ·1/2+Nβˆ’1/2)subscript𝔼𝑛delimited-[]𝑁𝑔Ω𝑛subscript𝔼𝑛delimited-[]π‘π‘”Ξ©π‘›π‘˜1Osuperscriptπœ‚12superscript𝑁12\mathbb{E}_{n\in[N]}g(\Omega(n))=\mathbb{E}_{n\in[N]}g(\Omega(n)+k-1)+{\rm O}(\eta^{1/2}+N^{-1/2}). This holds for all kβ©Ύk0π‘˜subscriptπ‘˜0k\geqslant k_{0} and hence

𝔼n∈[N]​g​(Ω​(n)+k)=𝔼n∈[N]​g​(Ω​(n)+l)+O​(Ξ·1/2+Nβˆ’1/2)𝑛delimited-[]𝑁superscript𝔼absentπ‘”Ξ©π‘›π‘˜π‘›delimited-[]𝑁superscript𝔼absent𝑔Ω𝑛𝑙Osuperscriptπœ‚12superscript𝑁12\displaystyle\underset{n\in[N]}{{{\mathbb{E}}}^{\leavevmode\nobreak\ }}\,\,g(\Omega(n)+k)\leavevmode\nobreak\ =\leavevmode\nobreak\ \underset{n\in[N]}{{{\mathbb{E}}}^{\leavevmode\nobreak\ }}\,\,g(\Omega(n)+l)\,+\,{\rm O}\left(\eta^{1/2}+N^{-1/2}\right) (2.9)

for all k,lβ©Ύk0π‘˜π‘™subscriptπ‘˜0k,l\geqslant k_{0}. Relation (1.3) now follows from (2.9) by taking k=k0π‘˜subscriptπ‘˜0k=k_{0}, l=k0+1𝑙subscriptπ‘˜01l=k_{0}+1, g​(n)=f​(nβˆ’k0)𝑔𝑛𝑓𝑛subscriptπ‘˜0g(n)=f(n-k_{0}), and letting Ξ·πœ‚\eta go to 00. ∎

3.   Proof of 2.2

The starting point for our proof of 2.2 are Chebyshev-type estimates on the number of primes in intervals. More precisely, we derive a rough lower bound on the number of primes in (8x,8x+1]superscript8π‘₯superscript8π‘₯1(8^{x},8^{x+1}], as well as a rough upper bound on the number of primes in (8x,8x+Ξ΅]superscript8π‘₯superscript8π‘₯πœ€(8^{x},8^{x+\varepsilon}] for small Ξ΅πœ€\varepsilon.

Proposition 3.1.

Let β„™β„™\mathbb{P} be the set of primes. There are x0β©Ύ1subscriptπ‘₯01x_{0}\geqslant 1 and Ξ΅0>0subscriptπœ€00\varepsilon_{0}>0 such that

  1. (i)

    |β„™βˆ©(8x,8x+1]|β©Ύ8xxβ„™superscript8π‘₯superscript8π‘₯1superscript8π‘₯π‘₯|\mathbb{P}\cap(8^{x},8^{x+1}]|\geqslant\frac{8^{x}}{x} for all xβ©Ύx0π‘₯subscriptπ‘₯0x\geqslant x_{0}, and

  2. (ii)

    |β„™βˆ©(8x,8x+Ξ΅]|⩽Ρ​ 8xxβ„™superscript8π‘₯superscript8π‘₯πœ€πœ€superscript8π‘₯π‘₯|\mathbb{P}\cap(8^{x},8^{x+\varepsilon}]|\leqslant\frac{\sqrt{\varepsilon}\,8^{x}}{x} for all xβ©Ύx0π‘₯subscriptπ‘₯0x\geqslant x_{0} and Ρ∈(0,Ξ΅0]πœ€0subscriptπœ€0\varepsilon\in(0,\varepsilon_{0}].

The ideas used in the proof of 3.1 are classical and date back to Chebyshev. We begin with the following lemma.

Lemma 3.2.

We have the asymptotic estimate

|β„™βˆ©(1,x]|β©Ύx​log⁑(2)log⁑x+O​(1).β„™1π‘₯π‘₯2π‘₯O1\big{|}\mathbb{P}\cap(1,x]\big{|}\,\geqslant\,\frac{x\log(2)}{\log x}+{\rm O}(1). (3.1)
Proof.

To obtain (3.1) for arbitrary positive reals xπ‘₯x, it is enough to prove it for all even natural numbers, i.e., x=2​nπ‘₯2𝑛x=2n. In this case, the key is to study the prime factorization of the binomial coefficient (2​nn)binomial2𝑛𝑛{2n}\choose{n}. Observe that there are ⌊m/pβŒ‹π‘šπ‘\lfloor m/p\rfloor many numbers in {1,…,m}1β€¦π‘š\{1,\ldots,m\} that are divisible by p𝑝p. Out of those, there are ⌊m/p2βŒ‹π‘šsuperscript𝑝2\lfloor m/p^{2}\rfloor many divisible by p2superscript𝑝2p^{2}, and out of those there are ⌊m/p3βŒ‹π‘šsuperscript𝑝3\lfloor m/p^{3}\rfloor many divisible by p3superscript𝑝3p^{3}, and so on. Therefore, if ν𝜈\nu is the largest exponent for which pΞ½β©½msuperscriptπ‘πœˆπ‘šp^{\nu}\leqslant m, then the power of p𝑝p in m!π‘šm! is equal to ⌊m/pβŒ‹+⌊m/p2βŒ‹+…+⌊m/pΞ½βŒ‹.π‘šπ‘π‘šsuperscript𝑝2β€¦π‘šsuperscriptπ‘πœˆ\lfloor m/p\rfloor+\lfloor m/p^{2}\rfloor+\ldots+\lfloor m/p^{\nu}\rfloor. In light of this observation, we see that the multiplicity of a prime p𝑝p in the prime factorization of (2​nn)=(2​n)!(n!)2binomial2𝑛𝑛2𝑛superscript𝑛2\binom{2n}{n}=\frac{(2n)!}{(n!)^{2}} is given by the formula

βˆ‘1β©½iβ©½Ξ½p⌊2​n/piβŒ‹βˆ’2β€‹βŒŠn/piβŒ‹,subscript1𝑖subscriptπœˆπ‘2𝑛superscript𝑝𝑖2𝑛superscript𝑝𝑖\sum_{1\leqslant i\leqslant\nu_{p}}\,\lfloor 2n/p^{i}\rfloor-2\lfloor n/p^{i}\rfloor, (3.2)

where Ξ½psubscriptπœˆπ‘\nu_{p} is the largest exponent for which pΞ½pβ©½2​nsuperscript𝑝subscriptπœˆπ‘2𝑛p^{\nu_{p}}\leqslant 2n. Since ⌊2​n/piβŒ‹βˆ’2β€‹βŒŠn/piβŒ‹β©½12𝑛superscript𝑝𝑖2𝑛superscript𝑝𝑖1\lfloor 2n/p^{i}\rfloor-2\lfloor n/p^{i}\rfloor\leqslant 1, we can estimate βˆ‘i=1Ξ½p⌊2​n/piβŒ‹βˆ’2β€‹βŒŠn/piβŒ‹β©½Ξ½psuperscriptsubscript𝑖1subscriptπœˆπ‘2𝑛superscript𝑝𝑖2𝑛superscript𝑝𝑖subscriptπœˆπ‘\sum_{i=1}^{\nu_{p}}\leavevmode\nobreak\ \lfloor 2n/p^{i}\rfloor-2\lfloor n/p^{i}\rfloor\leavevmode\nobreak\ \leqslant\leavevmode\nobreak\ \nu_{p}. This yields

\mbinom​2​n​n⩽∏pβˆˆβ„™βˆ©(1,2​n]pΞ½pβ©½(2​n)|β„™βˆ©(1,2​n]|,\mbinom2𝑛𝑛subscriptproduct𝑝ℙ12𝑛superscript𝑝subscriptπœˆπ‘superscript2𝑛ℙ12𝑛\mbinom{2n}{n}\,\leqslant\,\prod_{p\in\mathbb{P}\cap(1,2n]}p^{\nu_{p}}\,\leqslant\,(2n)^{|\mathbb{P}\cap(1,2n]|},

which after taking logarithms leaves us with

log⁑\mbinom​2​n​nβ©½log⁑(2​n)​|β„™βˆ©(1,2​n]|.\mbinom2𝑛𝑛2𝑛ℙ12𝑛\log\mbinom{2n}{n}\,\leqslant\,\log(2n){|\mathbb{P}\cap(1,2n]|}. (3.3)

Stirling’s approximation formula implies that log⁑(m!)=m​log⁑(m)βˆ’m+O​(log⁑m)π‘šπ‘šπ‘šπ‘šOπ‘š\log(m!)=m\log(m)-m+{\rm O}(\log m). This can now be used to finish the proof by approximating log⁑(2​nn)binomial2𝑛𝑛\log{}\binom{2n}{n} with 2​log⁑(2)​n+O​(log⁑n)22𝑛O𝑛2\log(2)n+{\rm O}(\log n) in (3.3). ∎

3.2 gives a reasonable lower bound on the asymptotic number of primes in (1,x]1π‘₯(1,x], which is important for the proof for part (i) of 3.1. For the proof of part (ii) we also need an upper bound.

Lemma 3.3.

Define β​(Οƒ)≔σ​log⁑(Οƒ)βˆ’(Οƒβˆ’1)​log⁑(Οƒβˆ’1)β‰”π›½πœŽπœŽπœŽπœŽ1𝜎1\beta(\sigma)\coloneqq\sigma\log(\sigma)-(\sigma-1)\log(\sigma-1). Then for all 1<Οƒβ©½161𝜎161<\sigma\leqslant 16,

|β„™βˆ©(x,σ​x]|⩽β​(Οƒ)​xlog⁑x+O​(1).β„™π‘₯𝜎π‘₯π›½πœŽπ‘₯π‘₯O1\big{|}\mathbb{P}\cap(x,\sigma x]\big{|}\leavevmode\nobreak\ \leqslant\leavevmode\nobreak\ \frac{\beta(\sigma)\,x}{\log x}+{\rm O}(1). (3.4)
Proof.

For convenience, let us write (σ​xx)binomial𝜎π‘₯π‘₯\binom{\sigma x}{x} for the quantity (βŒŠΟƒβ€‹xβŒ‹βŒŠxβŒ‹)binomial𝜎π‘₯π‘₯\binom{\lfloor\sigma x\rfloor}{\lfloor x\rfloor}. Observe that every prime number in the interval (x,σ​x]π‘₯𝜎π‘₯(x,\sigma x] divides the number (σ​xx)binomial𝜎π‘₯π‘₯\binom{\sigma x}{x}. Therefore, the number (σ​xx)binomial𝜎π‘₯π‘₯\binom{\sigma x}{x} is greater or equal than ∏pβˆˆβ„™βˆ©(x,σ​x]psubscriptproduct𝑝ℙπ‘₯𝜎π‘₯𝑝\prod_{p\in\mathbb{P}\cap(x,\sigma x]}p. Using ∏pβˆˆβ„™βˆ©(x,σ​x]pβ©Ύx|β„™βˆ©(x,σ​x]|subscriptproduct𝑝ℙπ‘₯𝜎π‘₯𝑝superscriptπ‘₯β„™π‘₯𝜎π‘₯\prod_{p\in\mathbb{P}\cap(x,\sigma x]}p\geqslant x^{|\mathbb{P}\cap(x,\sigma x]|} and taking logarithms, we obtain

log⁑\mbinom​σ​x​xβ©Ύlog⁑(x)​|β„™βˆ©(x,σ​x]|.\mbinom𝜎π‘₯π‘₯π‘₯β„™π‘₯𝜎π‘₯\log\mbinom{\sigma x}{x}\,\geqslant\,\log(x){|\mathbb{P}\cap(x,\sigma x]|}. (3.5)

Similarly as in the proof of 3.2, we can now use Stirling’s approximation formula, log⁑(m!)=m​log⁑(m)βˆ’m+O​(log⁑m)π‘šπ‘šπ‘šπ‘šOπ‘š\log(m!)=m\log(m)-m+{\rm O}(\log m), to estimate that

log⁑\mbinom​σ​x​x\mbinom𝜎π‘₯π‘₯\displaystyle\log\mbinom{\sigma x}{x} =βŒŠΟƒβ€‹xβŒ‹β€‹log⁑(βŒŠΟƒβ€‹xβŒ‹)βˆ’βŒŠxβŒ‹β€‹log⁑(⌊xβŒ‹)βˆ’(βŒŠΟƒβ€‹xβŒ‹βˆ’βŒŠxβŒ‹)​log⁑(βŒŠΟƒβ€‹xβŒ‹βˆ’βŒŠxβŒ‹)+O​(log⁑x)absent𝜎π‘₯𝜎π‘₯π‘₯π‘₯𝜎π‘₯π‘₯𝜎π‘₯π‘₯Oπ‘₯\displaystyle\,=\,\lfloor\sigma x\rfloor\log(\lfloor\sigma x\rfloor)-\lfloor x\rfloor\log(\lfloor x\rfloor)-(\lfloor\sigma x\rfloor-\lfloor x\rfloor)\log(\lfloor\sigma x\rfloor-\lfloor x\rfloor)+{\rm O}(\log x)
=σ​x​log⁑(σ​x)βˆ’x​log⁑(x)βˆ’(Οƒβˆ’1)​x​log⁑((Οƒβˆ’1)​x)+O​(log⁑x)absent𝜎π‘₯𝜎π‘₯π‘₯π‘₯𝜎1π‘₯𝜎1π‘₯Oπ‘₯\displaystyle\,=\,\sigma x\log(\sigma x)-x\log(x)-(\sigma-1)x\log((\sigma-1)x)+{\rm O}(\log x)
=σ​x​log⁑(Οƒ)βˆ’(Οƒβˆ’1)​x​log⁑(Οƒβˆ’1)+O​(log⁑x).absent𝜎π‘₯𝜎𝜎1π‘₯𝜎1Oπ‘₯\displaystyle\,=\,\sigma x\log(\sigma)-(\sigma-1)x\log(\sigma-1)+{\rm O}(\log x).

Together with (3.5), this proves (3.4). ∎

Proof of 3.1.

The proof of part (ii) simply follows from 3.3 (applied with Οƒ=8Ρ𝜎superscript8πœ€\sigma=8^{\varepsilon}) and the fact that the order of magnitude of Ξ΅πœ€\sqrt{\varepsilon} is much larger than the order of magnitude of β​(8Ξ΅)𝛽superscript8πœ€\beta(8^{\varepsilon}) as Ξ΅πœ€\varepsilon tends to 00.

For the proof of part (i), we start by rewriting the interval (8x,8x+1]superscript8π‘₯superscript8π‘₯1(8^{x},8^{x+1}] in the form (8x,8x+1]=(1,8x+1]\⋃0β©½nβ©½3​x(8x2n+1,8x2n].superscript8π‘₯superscript8π‘₯1\1superscript8π‘₯1subscript0𝑛3π‘₯superscript8π‘₯superscript2𝑛1superscript8π‘₯superscript2𝑛(8^{x},8^{x+1}]=(1,8^{x+1}]\backslash\bigcup_{0\leqslant n\leqslant 3x}(\tfrac{8^{x}}{2^{n+1}},\tfrac{8^{x}}{2^{n}}]. 3.2 gives the estimate |β„™βˆ©(1,8x+1]|β©Ύ8x+1/3​(x+1)+O​(1)β„™1superscript8π‘₯1superscript8π‘₯13π‘₯1O1|\mathbb{P}\cap(1,8^{x+1}]|\geqslant{8^{x+1}}/{3(x+1)}+{\rm O}(1), whereas 3.3 (applied with Οƒ=2𝜎2\sigma=2) gives the estimate |β„™βˆ©(8x/2n+1,8x/2n]|β©½8x/2n+1​x+O​(1)β„™superscript8π‘₯superscript2𝑛1superscript8π‘₯superscript2𝑛superscript8π‘₯superscript2𝑛1π‘₯O1|\mathbb{P}\cap({8^{x}}/{2^{n+1}},{8^{x}}/{2^{n}}]|\leqslant 8^{x}/{2^{n+1}x}+{\rm O}(1). Therefore

|β„™βˆ©(8x,8x+1]|β„™superscript8π‘₯superscript8π‘₯1\displaystyle\big{|}\mathbb{P}\cap(8^{x},8^{x+1}]\big{|} β©Ύ8x+13​(x+1)βˆ’βˆ‘0β©½nβ©½3​x8x2n+1​x+O​(x)β©Ύ8x+13​(x+1)βˆ’8xx+O​(x).absentsuperscript8π‘₯13π‘₯1subscript0𝑛3π‘₯superscript8π‘₯superscript2𝑛1π‘₯Oπ‘₯superscript8π‘₯13π‘₯1superscript8π‘₯π‘₯Oπ‘₯\displaystyle\leavevmode\nobreak\ \geqslant\leavevmode\nobreak\ \frac{8^{x+1}}{3(x+1)}\,-\,\sum_{0\leqslant n\leqslant 3x}\frac{8^{x}}{2^{n+1}x}\,+\,{\rm O}(x)\leavevmode\nobreak\ \geqslant\leavevmode\nobreak\ \frac{8^{x+1}}{3(x+1)}-\frac{8^{x}}{x}\,+\,{\rm O}(x).

This implies that if x0subscriptπ‘₯0x_{0} is sufficiently large then |β„™βˆ©(8x,8x+1]|β©Ύ8xxβ„™superscript8π‘₯superscript8π‘₯1superscript8π‘₯π‘₯|\mathbb{P}\cap(8^{x},8^{x+1}]|\geqslant\frac{8^{x}}{x} for all xβ©Ύx0π‘₯subscriptπ‘₯0x\geqslant x_{0}. ∎

3.1 is the only number-theoretic component in our proof of 2.2 and, as we have mentioned above, it doesn’t use any ideas that weren’t already available to Chebyshev. The rest of our argument is more combinatorial in nature.

The first part of 3.1 tells us that we can find a fair amount of primes in any interval of the form (8n,8n+1]superscript8𝑛superscript8𝑛1(8^{n},8^{n+1}]. However, we will need a bit more control over where these primes are within this interval. The following proposition tells us that in (8n,8n+1]superscript8𝑛superscript8𝑛1(8^{n},8^{n+1}] we can find two smaller intervals (8x,8x+Ξ΄]superscript8π‘₯superscript8π‘₯𝛿(8^{x},8^{x+\delta}] and (8y,8y+Ξ΄]superscript8𝑦superscript8𝑦𝛿(8^{y},8^{y+\delta}] which are not too close together but also not too far apart, and each containing a good amount of primes.

Lemma 3.4.

Let x0subscriptπ‘₯0x_{0} be as in 3.1. There exists Ξ΅1>0subscriptπœ€10\varepsilon_{1}>0 such that for all Ρ∈(0,Ξ΅1]πœ€0subscriptπœ€1\varepsilon\in(0,\varepsilon_{1}] and all δ∈(0,1)𝛿01\delta\in(0,1) there exists D=D​(Ξ΅,Ξ΄)∈(0,1)π·π·πœ€π›Ώ01D=D(\varepsilon,\delta)\in(0,1) with the following property: For all nβ©Ύx0𝑛subscriptπ‘₯0n\geqslant x_{0} there are x,y∈[n,n+1)π‘₯𝑦𝑛𝑛1x,y\in[n,n+1) with Ξ΅4<yβˆ’x<Ξ΅superscriptπœ€4𝑦π‘₯πœ€\varepsilon^{4}<y-x<\varepsilon such that

|β„™βˆ©(8x,8x+Ξ΄]|β©ΎD​8nn,and|β„™βˆ©(8y,8y+Ξ΄]|β©ΎD​8nn.formulae-sequenceβ„™superscript8π‘₯superscript8π‘₯𝛿𝐷superscript8𝑛𝑛andβ„™superscript8𝑦superscript8𝑦𝛿𝐷superscript8𝑛𝑛\big{|}\mathbb{P}\cap(8^{x},8^{x+\delta}]\big{|}\,\geqslant\,\frac{D8^{n}}{n},\qquad\text{and}\qquad\big{|}\mathbb{P}\cap(8^{y},8^{y+\delta}]\big{|}\,\geqslant\,\frac{D8^{n}}{n}.
Proof.

As guaranteed by 3.1, the number of primes in (8n,8n+1]superscript8𝑛superscript8𝑛1(8^{n},8^{n+1}] is at least 8n/nsuperscript8𝑛𝑛{8^{n}}/{n}. Therefore, by the Pigeonhole Principle, for some t∈[n,n+1)𝑑𝑛𝑛1t\in[n,n+1) the number of primes in (8t,8t+Ξ΅]superscript8𝑑superscript8π‘‘πœ€(8^{t},8^{t+\varepsilon}] is at least Ρ​8n/2​nπœ€superscript8𝑛2𝑛\varepsilon 8^{n}/2n. We can then cover the interval (8t,8t+Ξ΅]superscript8𝑑superscript8π‘‘πœ€(8^{t},8^{t+\varepsilon}] by Kβ‰”βŒˆΞ΅βˆ’3βŒ‰β‰”πΎsuperscriptπœ€3K\coloneqq\lceil\varepsilon^{-3}\rceil many smaller intervals in the following way:

(8t,8t+Ξ΅]=(8t,8t+Ξ΅4]βˆͺ(8t+Ξ΅4,8t+2​Ρ4]βˆͺ…βˆͺ(8t+(Kβˆ’1)​Ρ4,8t+K​Ρ4].superscript8𝑑superscript8π‘‘πœ€superscript8𝑑superscript8𝑑superscriptπœ€4superscript8𝑑superscriptπœ€4superscript8𝑑2superscriptπœ€4…superscript8𝑑𝐾1superscriptπœ€4superscript8𝑑𝐾superscriptπœ€4(8^{t},8^{t+\varepsilon}]=\big{(}8^{t},8^{t+\varepsilon^{4}}\big{]}\cup\big{(}8^{t+\varepsilon^{4}},8^{t+2\varepsilon^{4}}\big{]}\cup\ldots\cup\big{(}8^{t+(K-1)\varepsilon^{4}},8^{t+K\varepsilon^{4}}\big{]}.

By 3.3, each of the intervals (8t+i​Ρ4,8t+(i+1)​Ρ4]superscript8𝑑𝑖superscriptπœ€4superscript8𝑑𝑖1superscriptπœ€4(8^{t+i\varepsilon^{4}},8^{t+(i+1)\varepsilon^{4}}] contains at most O​(Ξ΅2​8n/n)Osuperscriptπœ€2superscript8𝑛𝑛{\rm O}({\varepsilon^{2}8^{n}}/{n}) many primes, whereby the whole interval (8t,8t+Ξ΅]superscript8𝑑superscript8π‘‘πœ€(8^{t},8^{t+\varepsilon}] contains at least O​(Ρ​8n/n)Oπœ€superscript8𝑛𝑛{\rm O}({\varepsilon 8^{n}}/{n}) many primes. It follows that if Ξ΅πœ€\varepsilon is chosen sufficiently small, say smaller than some threshold Ξ΅1subscriptπœ€1\varepsilon_{1}, then we can find two non-consecutive a,b∈{0,1,…,Kβˆ’1}π‘Žπ‘01…𝐾1a,b\in\{0,1,\ldots,K-1\} such that the intervals (8t+a​Ρ4,8t+(a+1)​Ρ4]superscript8π‘‘π‘Žsuperscriptπœ€4superscript8π‘‘π‘Ž1superscriptπœ€4(8^{t+a\varepsilon^{4}},8^{t+(a+1)\varepsilon^{4}}] and (8t+b​Ρ4,8t+(b+1)​Ρ4]superscript8𝑑𝑏superscriptπœ€4superscript8𝑑𝑏1superscriptπœ€4(8^{t+b\varepsilon^{4}},8^{t+(b+1)\varepsilon^{4}}] contain at least O​(Ξ΅4​8n/n)Osuperscriptπœ€4superscript8𝑛𝑛{\rm O}({\varepsilon^{4}8^{n}}/{n}) many primes. Using the Pigeonhole Principle once more we can then find for every δ∈(0,1)𝛿01\delta\in(0,1) some x∈[t+a​Ρ4,t+(a+1)​Ρ4)π‘₯π‘‘π‘Žsuperscriptπœ€4π‘‘π‘Ž1superscriptπœ€4x\in[t+a\varepsilon^{4},t+(a+1)\varepsilon^{4}) and some y∈[t+b​Ρ4,t+(b+1)​Ρ4)𝑦𝑑𝑏superscriptπœ€4𝑑𝑏1superscriptπœ€4y\in[t+b\varepsilon^{4},t+(b+1)\varepsilon^{4}) such that the intervals (8x,8x+Ξ΄]superscript8π‘₯superscript8π‘₯𝛿(8^{x},8^{x+\delta}] and (8y,8y+Ξ΄]superscript8𝑦superscript8𝑦𝛿(8^{y},8^{y+\delta}] contain at least O​(δ​Ρ4​8n/n)O𝛿superscriptπœ€4superscript8𝑛𝑛{\rm O}({\delta\varepsilon^{4}8^{n}}/{n}) many primes. Since aπ‘Ža and b𝑏b are non-consecutive, we have yβˆ’x>Ξ΅4𝑦π‘₯superscriptπœ€4y-x>\varepsilon^{4}, and since x,y∈[t,t+Ξ΅)π‘₯π‘¦π‘‘π‘‘πœ€x,y\in[t,t+\varepsilon) we have yβˆ’x<Ρ𝑦π‘₯πœ€y-x<\varepsilon. ∎

The final ingredient in our proof of 2.2 is a purely combinatorial lemma.

Lemma 3.5.

Fix x0β©Ύ1subscriptπ‘₯01x_{0}\geqslant 1 and Ξ΅>0πœ€0\varepsilon>0. Suppose 𝒳𝒳\mathcal{X} is a subset of ℝℝ\mathbb{R} with the property that for every nβ©Ύx0𝑛subscriptπ‘₯0n\geqslant x_{0} there exist x,yβˆˆπ’³βˆ©[n,n+1)π‘₯𝑦𝒳𝑛𝑛1x,y\in\mathcal{X}\cap[n,n+1) with Ξ΅4<yβˆ’x<Ξ΅superscriptπœ€4𝑦π‘₯πœ€\varepsilon^{4}<y-x<\varepsilon. Let k⩾⌈2/Ξ΅4βŒ‰π‘˜2superscriptπœ€4k\geqslant\lceil 2/\varepsilon^{4}\rceil. Then for all n1,…,nk∈{nβˆˆβ„•:nβ©Ύx0}subscript𝑛1…subscriptπ‘›π‘˜conditional-set𝑛ℕ𝑛subscriptπ‘₯0n_{1},\ldots,n_{k}\in\{n\in\mathbb{N}:n\geqslant x_{0}\} there exist z,z1,…,zkβˆˆπ’³π‘§subscript𝑧1…subscriptπ‘§π‘˜π’³z,z_{1},\ldots,z_{k}\in\mathcal{X} such that

  1. (I)

    zi∈[ni,ni+1)subscript𝑧𝑖subscript𝑛𝑖subscript𝑛𝑖1z_{i}\in[n_{i},n_{i}+1) for all 1β©½iβ©½k1π‘–π‘˜1\leqslant i\leqslant k;

  2. (II)

    z1+…+zk∈[z,z+Ξ΅)subscript𝑧1…subscriptπ‘§π‘˜π‘§π‘§πœ€z_{1}+\ldots+z_{k}\in[z,z+\varepsilon).

Proof.

Let n1,…,nk∈{nβˆˆβ„•:nβ©Ύx0}subscript𝑛1…subscriptπ‘›π‘˜conditional-set𝑛ℕ𝑛subscriptπ‘₯0n_{1},\ldots,n_{k}\in\{n\in\mathbb{N}:n\geqslant x_{0}\}. According to the hypothesis of the lemma, we can find for every i=1,…,k𝑖1β€¦π‘˜i=1,\ldots,k two numbers xi,yi∈[ni,ni+1)βˆ©π’³subscriptπ‘₯𝑖subscript𝑦𝑖subscript𝑛𝑖subscript𝑛𝑖1𝒳x_{i},y_{i}\in[n_{i},n_{i}+1)\cap\mathcal{X} with Ξ΅4<yiβˆ’xi<Ξ΅superscriptπœ€4subscript𝑦𝑖subscriptπ‘₯π‘–πœ€\varepsilon^{4}<y_{i}-x_{i}<\varepsilon. Define, for every i=0,1,…,k𝑖01β€¦π‘˜i=0,1,\ldots,k, the number ui≔x1+…+xi+yi+1+…+yk.≔subscript𝑒𝑖subscriptπ‘₯1…subscriptπ‘₯𝑖subscript𝑦𝑖1…subscriptπ‘¦π‘˜u_{i}\coloneqq x_{1}+\ldots+x_{i}+y_{i+1}+\ldots+y_{k}. Since Ξ΅4<yiβˆ’xisuperscriptπœ€4subscript𝑦𝑖subscriptπ‘₯𝑖\varepsilon^{4}<y_{i}-x_{i} we have ukβˆ’u0β©Ύk​Ρ4β©Ύ2subscriptπ‘’π‘˜subscript𝑒0π‘˜superscriptπœ€42u_{k}-u_{0}\geqslant k\varepsilon^{4}\geqslant 2. This implies that there exists some zβˆˆπ’³π‘§π’³z\in\mathcal{X} with u0<z<uksubscript𝑒0𝑧subscriptπ‘’π‘˜u_{0}<z<u_{k}. Since ui+1βˆ’ui<Ξ΅subscript𝑒𝑖1subscriptπ‘’π‘–πœ€u_{i+1}-u_{i}<\varepsilon, there is some i0∈{0,1,…,k}subscript𝑖001β€¦π‘˜i_{0}\in\{0,1,\ldots,k\} such that ui0∈[z,z+Ξ΅)subscript𝑒subscript𝑖0π‘§π‘§πœ€u_{i_{0}}\in[z,z+\varepsilon). Setting zi≔xi≔subscript𝑧𝑖subscriptπ‘₯𝑖z_{i}\coloneqq x_{i} for iβ©½i0𝑖subscript𝑖0i\leqslant i_{0} and zi≔yi≔subscript𝑧𝑖subscript𝑦𝑖z_{i}\coloneqq y_{i} for i>i0𝑖subscript𝑖0i>i_{0}, we obtain z1+…+zk=ui0∈[z,z+Ξ΅)subscript𝑧1…subscriptπ‘§π‘˜subscript𝑒subscript𝑖0π‘§π‘§πœ€z_{1}+\ldots+z_{k}=u_{i_{0}}\in[z,z+\varepsilon) as desired. ∎

Proof of 2.2.

Let η∈(0,1)πœ‚01\eta\in(0,1) be given. Let x0subscriptπ‘₯0x_{0} and Ξ΅0subscriptπœ€0\varepsilon_{0} be as in 3.1 and let Ξ΅1subscriptπœ€1\varepsilon_{1} be as in 3.4. Pick any Ξ΅>0πœ€0\varepsilon>0 with Ξ΅<min⁑{Ξ΅0,Ξ΅1,log⁑(1+Ξ·)/log⁑(64)}πœ€subscriptπœ€0subscriptπœ€11πœ‚64\varepsilon<\min\{\varepsilon_{0},\varepsilon_{1},{\log(1+\eta)}/{\log(64)}\} and set k0β‰”βŒˆ2/Ξ΅4βŒ‰β‰”subscriptπ‘˜02superscriptπœ€4k_{0}\coloneqq\lceil 2/\varepsilon^{4}\rceil. We claim that k0subscriptπ‘˜0k_{0} is as desired, meaning that for all kβ©Ύk0π‘˜subscriptπ‘˜0k\geqslant k_{0} there exist two finite, non-empty sets B1,B2βŠ‚β„•subscript𝐡1subscript𝐡2β„•B_{1},B_{2}\subset\mathbb{N} satisfying properties (a), (b), and (c).

To verify this claim, fix kβ©Ύk0π‘˜subscriptπ‘˜0k\geqslant k_{0}, set δ≔Ρ/kβ‰”π›Ώπœ€π‘˜\delta\coloneqq{\varepsilon}/{k}, and let D=D​(Ξ΅,Ξ΄)π·π·πœ€π›ΏD=D(\varepsilon,\delta) be as in 3.4. Define

𝒳≔{xβ©Ύx0:|β„™βˆ©(8x,8x+Ξ΄]|β©ΎD​8⌊xβŒ‹/⌊xβŒ‹},≔𝒳conditional-setπ‘₯subscriptπ‘₯0β„™superscript8π‘₯superscript8π‘₯𝛿𝐷superscript8π‘₯π‘₯\mathcal{X}\,\coloneqq\,\Big{\{}x\geqslant x_{0}:\big{|}\mathbb{P}\cap(8^{x},8^{x+\delta}]\big{|}\geqslant{D8^{\lfloor x\rfloor}}/{\lfloor x\rfloor}\Big{\}},

and for every xβˆˆπ’³π‘₯𝒳x\in\mathcal{X} let Pxsubscript𝑃π‘₯P_{x} be a subset of β„™βˆ©(8x,8x+Ξ΄]β„™superscript8π‘₯superscript8π‘₯𝛿\mathbb{P}\cap(8^{x},8^{x+\delta}] with |Px|=⌊D​8⌊xβŒ‹/⌊xβŒ‹βŒ‹subscript𝑃π‘₯𝐷superscript8π‘₯π‘₯|P_{x}|=\lfloor{D8^{\lfloor x\rfloor}}/{\lfloor x\rfloor}\rfloor. The sets Pxsubscript𝑃π‘₯P_{x} are the building blocks from which we will construct B1subscript𝐡1B_{1} and B2subscript𝐡2B_{2}.

According to 3.4 the set 𝒳𝒳\mathcal{X} satisfies the hypothesis of 3.5, which allows us to find for all nβ†’=(n1,…,nk)∈{nβˆˆβ„•:nβ©Ύx0}k→𝑛subscript𝑛1…subscriptπ‘›π‘˜superscriptconditional-set𝑛ℕ𝑛subscriptπ‘₯0π‘˜\vec{n}=(n_{1},\ldots,n_{k})\in\{n\in\mathbb{N}:n\geqslant x_{0}\}^{k} numbers znβ†’,z1,nβ†’,…,zk,nβ†’βˆˆπ’³subscript𝑧→𝑛subscript𝑧1→𝑛…subscriptπ‘§π‘˜β†’π‘›π’³z_{\vec{n}},z_{1,\vec{n}},\ldots,z_{k,\vec{n}}\in\mathcal{X} such that

  1. (I)

    zi,nβ†’βˆˆ[ni,ni+1)subscript𝑧𝑖→𝑛subscript𝑛𝑖subscript𝑛𝑖1z_{i,\vec{n}}\in[n_{i},n_{i}+1) for all 1β©½iβ©½k1π‘–π‘˜1\leqslant i\leqslant k, and

  2. (II)

    z1,nβ†’+…+zk,nβ†’βˆˆ[znβ†’,znβ†’+Ξ΅)subscript𝑧1→𝑛…subscriptπ‘§π‘˜β†’π‘›subscript𝑧→𝑛subscriptπ‘§β†’π‘›πœ€z_{1,\vec{n}}+\ldots+z_{k,\vec{n}}\in[z_{\vec{n}},z_{\vec{n}}+\varepsilon).

Note that Property (I) and the definition of Pxsubscript𝑃π‘₯P_{x} imply

|Pzi,nβ†’|=⌊D​8ni/niβŒ‹.subscript𝑃subscript𝑧𝑖→𝑛𝐷superscript8subscript𝑛𝑖subscript𝑛𝑖\displaystyle|P_{z_{i,\vec{n}}}|\,=\,\left\lfloor{D8^{n_{i}}}/{n_{i}}\right\rfloor. (3.6)

Next, let N=N​(D,Ξ·,k)π‘π‘π·πœ‚π‘˜N=N(D,\eta,k) be a constant that is to be determined later, and define sets A1,…,AkβŠ‚{nβˆˆβ„•:nβ©Ύx0}subscript𝐴1…subscriptπ΄π‘˜conditional-set𝑛ℕ𝑛subscriptπ‘₯0A_{1},\ldots,A_{k}\subset\{n\in\mathbb{N}:n\geqslant x_{0}\} in the following way: Pick any s1βˆˆβ„•subscript𝑠1β„•s_{1}\in\mathbb{N} with s1>max⁑{x0,2​k}subscript𝑠1subscriptπ‘₯02π‘˜s_{1}>\max\{x_{0},2k\} and let A1subscript𝐴1A_{1} be any finite subset of s1​ℕ={s1​n:nβˆˆβ„•}subscript𝑠1β„•conditional-setsubscript𝑠1𝑛𝑛ℕs_{1}\mathbb{N}=\{s_{1}n:n\in\mathbb{N}\} with βˆ‘n∈A11/nβ©ΎNsubscript𝑛subscript𝐴11𝑛𝑁\sum_{n\in A_{1}}1/n\geqslant N. Then, assuming Aisubscript𝐴𝑖A_{i} has already been found, take any si+1>max⁑(A1+…+Ai)subscript𝑠𝑖1subscript𝐴1…subscript𝐴𝑖s_{i+1}>\max(A_{1}+\ldots+A_{i}) and let Ai+1subscript𝐴𝑖1A_{i+1} be any finite subset of si+1​ℕsubscript𝑠𝑖1β„•s_{i+1}\mathbb{N} with βˆ‘n∈Ai+11/nβ©ΎNsubscript𝑛subscript𝐴𝑖11𝑛𝑁\sum_{n\in A_{i+1}}1/n\geqslant N. Following this procedure until i=kπ‘–π‘˜i=k, we end up with a family of finite sets A1,…,Aksubscript𝐴1…subscriptπ΄π‘˜A_{1},\ldots,A_{k} with the following property:

  1. (A)

    For any (n1,…,nk)β‰ (n1β€²,…,nkβ€²)∈A1×…×Aksubscript𝑛1…subscriptπ‘›π‘˜superscriptsubscript𝑛1′…superscriptsubscriptπ‘›π‘˜β€²subscript𝐴1…subscriptπ΄π‘˜(n_{1},\ldots,n_{k})\neq(n_{1}^{\prime},\ldots,n_{k}^{\prime})\in A_{1}\times\ldots\times A_{k} the distance between n1+…+nksubscript𝑛1…subscriptπ‘›π‘˜n_{1}+\ldots+n_{k} and n1β€²+…+nkβ€²superscriptsubscript𝑛1′…superscriptsubscriptπ‘›π‘˜β€²n_{1}^{\prime}+\ldots+n_{k}^{\prime} is at least 2​k2π‘˜2k.

We are now ready to define the sets B1subscript𝐡1B_{1} and B2subscript𝐡2B_{2}. The set B2subscript𝐡2B_{2} is defined as

B2≔⋃nβ†’βˆˆA1×…×AkPz1,n→⋅…⋅Pzk,nβ†’.≔subscript𝐡2subscript→𝑛subscript𝐴1…subscriptπ΄π‘˜β‹…subscript𝑃subscript𝑧1→𝑛…subscript𝑃subscriptπ‘§π‘˜β†’π‘›\displaystyle B_{2}\coloneqq\bigcup_{\vec{n}\in A_{1}\times\ldots\times A_{k}}P_{z_{1,\vec{n}}}\cdot\ldots\cdot P_{z_{k,\vec{n}}}. (3.7)

According to (3.6) we have

|Pz1,n→⋅…⋅Pzk,nβ†’|β©½Dk​8n1+…+nkn1⋅…⋅nkβ©½Dk​8n1+…+nkn1+…+nkβ©½D​8⌊znβ†’βŒ‹βŒŠznβ†’βŒ‹,β‹…subscript𝑃subscript𝑧1→𝑛…subscript𝑃subscriptπ‘§π‘˜β†’π‘›superscriptπ·π‘˜superscript8subscript𝑛1…subscriptπ‘›π‘˜β‹…subscript𝑛1…subscriptπ‘›π‘˜superscriptπ·π‘˜superscript8subscript𝑛1…subscriptπ‘›π‘˜subscript𝑛1…subscriptπ‘›π‘˜π·superscript8subscript𝑧→𝑛subscript𝑧→𝑛\big{|}P_{z_{1,\vec{n}}}\cdot\ldots\cdot P_{z_{k,\vec{n}}}\big{|}\,\leqslant\,\frac{D^{k}8^{n_{1}+\ldots+n_{k}}}{n_{1}\cdot\ldots\cdot n_{k}}\,\leqslant\,\frac{D^{k}8^{n_{1}+\ldots+n_{k}}}{n_{1}+\ldots+n_{k}}\,\leqslant\,\frac{D8^{\lfloor z_{\vec{n}}\rfloor}}{\lfloor z_{\vec{n}}\rfloor},

where the last inequality follows from Dkβ©½Dsuperscriptπ·π‘˜π·D^{k}\leqslant D and n1+…+nk⩽⌊znβ†’βŒ‹subscript𝑛1…subscriptπ‘›π‘˜subscript𝑧→𝑛n_{1}+\ldots+n_{k}\leqslant\lfloor z_{\vec{n}}\rfloor. Therefore |Pz1,n→⋅…⋅Pzk,nβ†’|β©½|Pznβ†’|β‹…subscript𝑃subscript𝑧1→𝑛…subscript𝑃subscriptπ‘§π‘˜β†’π‘›subscript𝑃subscript𝑧→𝑛|P_{z_{1,\vec{n}}}\cdot\ldots\cdot P_{z_{k,\vec{n}}}|\,\leqslant\,|P_{z_{\vec{n}}}|, which guarantees the existence of a set Qnβ†’βŠ‚Pznβ†’subscript𝑄→𝑛subscript𝑃subscript𝑧→𝑛Q_{\vec{n}}\subset P_{z_{\vec{n}}} with |Qnβ†’|=|Pz1,n→⋅…⋅Pzk,nβ†’|subscript𝑄→𝑛⋅subscript𝑃subscript𝑧1→𝑛…subscript𝑃subscriptπ‘§π‘˜β†’π‘›|Q_{\vec{n}}|=|P_{z_{1,\vec{n}}}\cdot\ldots\cdot P_{z_{k,\vec{n}}}|. Define B1subscript𝐡1B_{1} to be

B1≔⋃nβ†’βˆˆA1×…×AkQnβ†’.≔subscript𝐡1subscript→𝑛subscript𝐴1…subscriptπ΄π‘˜subscript𝑄→𝑛\displaystyle B_{1}\coloneqq\bigcup_{\vec{n}\in A_{1}\times\ldots\times A_{k}}Q_{\vec{n}}. (3.8)

It remains to show that B1subscript𝐡1B_{1} and B2subscript𝐡2B_{2} satisfy properties (a), (b), and (c).

Proof that B1subscript𝐡1B_{1} and B2subscript𝐡2B_{2} satisfy (a).

By construction, B1subscript𝐡1B_{1} consists only of primes and B2subscript𝐡2B_{2} only of numbers that are a product of kπ‘˜k primes. ∎

Proof that B1subscript𝐡1B_{1} and B2subscript𝐡2B_{2} satisfy (b).

By definition we have Pzi,nβ†’βŠ‚(8zi,nβ†’,8zi,nβ†’+Ξ΄]subscript𝑃subscript𝑧𝑖→𝑛superscript8subscript𝑧𝑖→𝑛superscript8subscript𝑧𝑖→𝑛𝛿P_{z_{i,\vec{n}}}\subset(8^{z_{i,\vec{n}}},8^{z_{i,\vec{n}}+\delta}], and so

Pz1,n→⋅…⋅Pzk,nβ†’β‹…subscript𝑃subscript𝑧1→𝑛…subscript𝑃subscriptπ‘§π‘˜β†’π‘›\displaystyle P_{z_{1,\vec{n}}}\cdot\ldots\cdot P_{z_{k,\vec{n}}} βŠ‚(8z1,nβ†’,8z1,nβ†’+Ξ΄]⋅…⋅(8zk,nβ†’,8zk,nβ†’+Ξ΄]absentβ‹…superscript8subscript𝑧1→𝑛superscript8subscript𝑧1→𝑛𝛿…superscript8subscriptπ‘§π‘˜β†’π‘›superscript8subscriptπ‘§π‘˜β†’π‘›π›Ώ\displaystyle\,\subset\,(8^{z_{1,\vec{n}}},8^{z_{1,\vec{n}}+\delta}]\cdot\ldots\cdot(8^{z_{k,\vec{n}}},8^{z_{k,\vec{n}}+\delta}]
βŠ‚(8z1,nβ†’+…+zk,nβ†’,8z1,nβ†’+…+zk,nβ†’+k​δ]absentsuperscript8subscript𝑧1→𝑛…subscriptπ‘§π‘˜β†’π‘›superscript8subscript𝑧1→𝑛…subscriptπ‘§π‘˜β†’π‘›π‘˜π›Ώ\displaystyle\,\subset\,(8^{z_{1,\vec{n}}+\ldots+z_{k,\vec{n}}},8^{z_{1,\vec{n}}+\ldots+z_{k,\vec{n}}+k\delta}]
βŠ‚(8znβ†’,8znβ†’+2​Ρ],absentsuperscript8subscript𝑧→𝑛superscript8subscript𝑧→𝑛2πœ€\displaystyle\,\subset\,(8^{z_{\vec{n}}},8^{z_{\vec{n}}+2\varepsilon}],

where the last inclusion follows from Property (II) and the fact that kβ€‹Ξ΄β©½Ξ΅π‘˜π›Ώπœ€k\delta\leqslant\varepsilon. Also by definition we have Qnβ†’βŠ‚(8znβ†’,8znβ†’+Ξ΄]βŠ‚(8znβ†’,8znβ†’+2​Ρ]subscript𝑄→𝑛superscript8subscript𝑧→𝑛superscript8subscript𝑧→𝑛𝛿superscript8subscript𝑧→𝑛superscript8subscript𝑧→𝑛2πœ€Q_{\vec{n}}\subset(8^{z_{\vec{n}}},8^{z_{\vec{n}}+\delta}]\subset(8^{z_{\vec{n}}},8^{z_{\vec{n}}+2\varepsilon}]. Using the fact that |n1+…+nkβˆ’znβ†’|β©½k+1subscript𝑛1…subscriptπ‘›π‘˜subscriptπ‘§β†’π‘›π‘˜1|n_{1}+\ldots+n_{k}-z_{\vec{n}}|\leqslant k+1 and Property (A), we conclude that (Pz1,n→⋅…⋅Pzk,nβ†’)∩(Pz1,n→′⋅…⋅Pzk,nβ†’β€²)=βˆ…β‹…subscript𝑃subscript𝑧1→𝑛…subscript𝑃subscriptπ‘§π‘˜β†’π‘›β‹…subscript𝑃subscript𝑧1superscript→𝑛′…subscript𝑃subscriptπ‘§π‘˜superscript→𝑛′(P_{z_{1,\vec{n}}}\cdot\ldots\cdot P_{z_{k,\vec{n}}})\cap(P_{z_{1,\vec{n}^{\prime}}}\cdot\ldots\cdot P_{z_{k,\vec{n}^{\prime}}})=\emptyset and Qnβ†’βˆ©Qnβ†’β€²=βˆ…subscript𝑄→𝑛subscript𝑄superscript→𝑛′Q_{\vec{n}}\cap Q_{\vec{n}^{\prime}}=\emptyset whenever nβ†’β‰ n→′→𝑛superscript→𝑛′\vec{n}\neq\vec{n}^{\prime}. Since |Pz1,n→⋅…⋅Pzk,nβ†’|=|Qnβ†’|β‹…subscript𝑃subscript𝑧1→𝑛…subscript𝑃subscriptπ‘§π‘˜β†’π‘›subscript𝑄→𝑛|P_{z_{1,\vec{n}}}\cdot\ldots\cdot P_{z_{k,\vec{n}}}|=|Q_{\vec{n}}|, we immediately get that B1subscript𝐡1B_{1} and B2subscript𝐡2B_{2} have the same cardinality. Moreover, both Qnβ†’subscript𝑄→𝑛Q_{\vec{n}} and Pz1,n→⋅…⋅Pzk,nβ†’β‹…subscript𝑃subscript𝑧1→𝑛…subscript𝑃subscriptπ‘§π‘˜β†’π‘›P_{z_{1,\vec{n}}}\cdot\ldots\cdot P_{z_{k,\vec{n}}} belong to the interval (8znβ†’,8znβ†’+2​Ρ]superscript8subscript𝑧→𝑛superscript8subscript𝑧→𝑛2πœ€(8^{z_{\vec{n}}},8^{z_{\vec{n}}+2\varepsilon}], which implies that the ratio between any element in Pz1,n→⋅…⋅Pzk,nβ†’β‹…subscript𝑃subscript𝑧1→𝑛…subscript𝑃subscriptπ‘§π‘˜β†’π‘›P_{z_{1,\vec{n}}}\cdot\ldots\cdot P_{z_{k,\vec{n}}} and any element in Qnβ†’subscript𝑄→𝑛Q_{\vec{n}} lies between 8βˆ’2​Ρsuperscript82πœ€8^{-2\varepsilon} and 82​Ρsuperscript82πœ€8^{2\varepsilon}. Since 82​Ρ⩽(1+Ξ·)superscript82πœ€1πœ‚8^{2\varepsilon}\leqslant(1+\eta), it follows that for enumerations Pz1,n→⋅…⋅Pzk,nβ†’={q1<…<qr}β‹…subscript𝑃subscript𝑧1→𝑛…subscript𝑃subscriptπ‘§π‘˜β†’π‘›subscriptπ‘ž1…subscriptπ‘žπ‘ŸP_{z_{1,\vec{n}}}\cdot\ldots\cdot P_{z_{k,\vec{n}}}=\{q_{1}<\ldots<q_{r}\} and Qnβ†’={p1<…<pr}subscript𝑄→𝑛subscript𝑝1…subscriptπ‘π‘ŸQ_{\vec{n}}=\{p_{1}<\ldots<p_{r}\} we have (1βˆ’Ξ·)​pjβ©½qjβ©½(1+Ξ·)​pj1πœ‚subscript𝑝𝑗subscriptπ‘žπ‘—1πœ‚subscript𝑝𝑗(1-\eta)p_{j}\leqslant q_{j}\leqslant(1+\eta)p_{j} for all j=1,…,r𝑗1β€¦π‘Ÿj=1,\ldots,r. This property now easily extends to enumerations of B1subscript𝐡1B_{1} and B2subscript𝐡2B_{2}. ∎

Proof that B1subscript𝐡1B_{1} and B2subscript𝐡2B_{2} satisfy (c).

First, let us show 𝔼p∈B1log​𝔼pβ€²βˆˆB1log​Φ​(p,pβ€²)β©½Ξ·subscriptsuperscript𝔼𝑝subscript𝐡1subscriptsuperscript𝔼superscript𝑝′subscript𝐡1Φ𝑝superscriptπ‘β€²πœ‚\mathbb{E}^{\log}_{p\in B_{1}}\mathbb{E}^{\log}_{p^{\prime}\in B_{1}}\Phi(p,p^{\prime})\leqslant\eta. Since B1subscript𝐡1B_{1} consists only of primes, we have Φ​(p,pβ€²)=0Φ𝑝superscript𝑝′0\Phi(p,p^{\prime})=0 unless p=p′𝑝superscript𝑝′p=p^{\prime}. Therefore

𝔼logp∈B1​𝔼logpβ€²βˆˆB1​Φ​(p,pβ€²)=βˆ‘p,pβ€²βˆˆB1Φ​(p,pβ€²)p​pβ€²βˆ‘p,pβ€²βˆˆB11p​pβ€²β©½βˆ‘p∈B11pβˆ‘p,pβ€²βˆˆB11p​pβ€²=1βˆ‘p∈B11p.𝑝subscript𝐡1superscript𝔼logsuperscript𝑝′subscript𝐡1superscript𝔼logΦ𝑝superscript𝑝′subscript𝑝superscript𝑝′subscript𝐡1Φ𝑝superscript𝑝′𝑝superscript𝑝′subscript𝑝superscript𝑝′subscript𝐡11𝑝superscript𝑝′subscript𝑝subscript𝐡11𝑝subscript𝑝superscript𝑝′subscript𝐡11𝑝superscript𝑝′1subscript𝑝subscript𝐡11𝑝\displaystyle\underset{p\in B_{1}}{{{\mathbb{E}}}^{\text{\footnotesize log}}}\,\,\underset{p^{\prime}\in B_{1}}{{{\mathbb{E}}}^{\text{\footnotesize log}}}\,\Phi(p,p^{\prime})\,=\,\frac{\sum_{p,p^{\prime}\in B_{1}}\frac{\Phi(p,p^{\prime})}{pp^{\prime}}}{\sum_{p,p^{\prime}\in B_{1}}\frac{1}{pp^{\prime}}}\,\leqslant\,\frac{\sum_{p\in B_{1}}\frac{1}{p}}{\sum_{p,p^{\prime}\in B_{1}}\frac{1}{pp^{\prime}}}\,=\,\frac{1}{\sum_{p\in B_{1}}\frac{1}{p}}. (3.9)

Note, in view of (3.8) we have βˆ‘p∈B11/p=βˆ‘nβ†’βˆˆA1×…×Akβˆ‘p∈Qnβ†’1/psubscript𝑝subscript𝐡11𝑝subscript→𝑛subscript𝐴1…subscriptπ΄π‘˜subscript𝑝subscript𝑄→𝑛1𝑝\sum_{p\in B_{1}}1/p=\sum_{\vec{n}\in A_{1}\times\ldots\times A_{k}}\sum_{p\in Q_{\vec{n}}}1/p and since any element in Qnβ†’subscript𝑄→𝑛Q_{\vec{n}} is smaller than 8n1+…+nk+k+1superscript8subscript𝑛1…subscriptπ‘›π‘˜π‘˜18^{n_{1}+\ldots+n_{k}+k+1} we have

βˆ‘p∈B11pβ©Ύβˆ‘nβ†’=(n1,…,nk)∈A1×…×Ak|Qnβ†’|8k+1​8n1+…+nk.subscript𝑝subscript𝐡11𝑝subscript→𝑛subscript𝑛1…subscriptπ‘›π‘˜subscript𝐴1…subscriptπ΄π‘˜subscript𝑄→𝑛superscript8π‘˜1superscript8subscript𝑛1…subscriptπ‘›π‘˜\sum_{p\in B_{1}}\text{\small$\dfrac{1}{p}$}\,\geqslant\,\sum_{\vec{n}=(n_{1},\ldots,n_{k})\in A_{1}\times\ldots\times A_{k}}\text{\small$\dfrac{|Q_{\vec{n}}|}{8^{k+1}8^{n_{1}+\ldots+n_{k}}}$}.

Then, using (3.6) and |Qnβ†’|=|Pz1,n→⋅…⋅Pzk,nβ†’|subscript𝑄→𝑛⋅subscript𝑃subscript𝑧1→𝑛…subscript𝑃subscriptπ‘§π‘˜β†’π‘›|Q_{\vec{n}}|=|P_{z_{1,\vec{n}}}\cdot\ldots\cdot P_{z_{k,\vec{n}}}| we can estimate |Qnβ†’|β©ΎDk​8βˆ’k​8n1+…+nk/(n1⋅…⋅nk)subscript𝑄→𝑛superscriptπ·π‘˜superscript8π‘˜superscript8subscript𝑛1…subscriptπ‘›π‘˜β‹…subscript𝑛1…subscriptπ‘›π‘˜|Q_{\vec{n}}|\geqslant D^{k}8^{-k}8^{n_{1}+\ldots+n_{k}}/(n_{1}\cdot\ldots\cdot n_{k}), which implies

βˆ‘p∈B11pβ©Ύβˆ‘(n1,…,nk)∈A1×…×AkDk82​k+1​n1⋅…⋅nkβ©ΎDk​Nk82​k+1.subscript𝑝subscript𝐡11𝑝subscriptsubscript𝑛1…subscriptπ‘›π‘˜subscript𝐴1…subscriptπ΄π‘˜superscriptπ·π‘˜β‹…superscript82π‘˜1subscript𝑛1…subscriptπ‘›π‘˜superscriptπ·π‘˜superscriptπ‘π‘˜superscript82π‘˜1\displaystyle\sum_{p\in B_{1}}\text{\small$\dfrac{1}{p}$}\,\geqslant\,\sum_{(n_{1},\ldots,n_{k})\in A_{1}\times\ldots\times A_{k}}\text{\small$\dfrac{D^{k}}{8^{2k+1}\,n_{1}\cdot\ldots\cdot n_{k}}$}\,\geqslant\,\text{\small$\dfrac{D^{k}N^{k}}{8^{2k+1}}$}. (3.10)

Thus, if N𝑁N was chosen sufficiently large, then 𝔼p∈B1log​𝔼pβ€²βˆˆB1log​Φ​(p,pβ€²)β©½Ξ·subscriptsuperscript𝔼𝑝subscript𝐡1subscriptsuperscript𝔼superscript𝑝′subscript𝐡1Φ𝑝superscriptπ‘β€²πœ‚\mathbb{E}^{\log}_{p\in B_{1}}\mathbb{E}^{\log}_{p^{\prime}\in B_{1}}\Phi(p,p^{\prime})\leqslant\eta follows from combining (3.9) and (3.10).

Next, let us show 𝔼q∈B2log​𝔼qβ€²βˆˆB2log​Φ​(q,qβ€²)β©½Ξ·subscriptsuperscriptπ”Όπ‘žsubscript𝐡2subscriptsuperscript𝔼superscriptπ‘žβ€²subscript𝐡2Ξ¦π‘žsuperscriptπ‘žβ€²πœ‚\mathbb{E}^{\log}_{q\in B_{2}}\mathbb{E}^{\log}_{q^{\prime}\in B_{2}}\Phi(q,q^{\prime})\leqslant\eta. In view of (3.7) we have

βˆ‘q,qβ€²βˆˆB2Φ​(q,qβ€²)q​qβ€²=βˆ‘nβ†’,nβ†’β€²βˆˆA1×…×Akβˆ‘q∈Pz1,n→​⋯​Pzk,nβ†’βˆ‘qβ€²βˆˆPz1,n→′​⋯​Pzk,n→′Φ​(q,qβ€²)q​qβ€².subscriptπ‘žsuperscriptπ‘žβ€²subscript𝐡2Ξ¦π‘žsuperscriptπ‘žβ€²π‘žsuperscriptπ‘žβ€²subscript→𝑛superscript→𝑛′subscript𝐴1…subscriptπ΄π‘˜subscriptπ‘žsubscript𝑃subscript𝑧1→𝑛⋯subscript𝑃subscriptπ‘§π‘˜β†’π‘›subscriptsuperscriptπ‘žβ€²subscript𝑃subscript𝑧1superscript→𝑛′⋯subscript𝑃subscriptπ‘§π‘˜superscriptβ†’π‘›β€²Ξ¦π‘žsuperscriptπ‘žβ€²π‘žsuperscriptπ‘žβ€²\displaystyle\sum_{q,q^{\prime}\in B_{2}}\text{\small$\dfrac{\Phi(q,q^{\prime})}{qq^{\prime}}$}\leavevmode\nobreak\ =\leavevmode\nobreak\ \sum_{\vec{n},\vec{n}^{\prime}\in A_{1}\times\ldots\times A_{k}}\leavevmode\nobreak\ \leavevmode\nobreak\ \sum_{q\in P_{z_{1,\vec{n}}}\cdots P_{z_{k,\vec{n}}}}\leavevmode\nobreak\ \leavevmode\nobreak\ \sum_{q^{\prime}\in P_{z_{1,\vec{n}^{\prime}}}\cdots P_{z_{k,\vec{n}^{\prime}}}}\text{\small$\dfrac{\Phi(q,q^{\prime})}{qq^{\prime}}$}. (3.11)

If q∈Pz1,n→⋅…⋅Pzk,nβ†’π‘žβ‹…subscript𝑃subscript𝑧1→𝑛…subscript𝑃subscriptπ‘§π‘˜β†’π‘›q\in P_{z_{1,\vec{n}}}\cdot\ldots\cdot P_{z_{k,\vec{n}}} and qβ€²βˆˆPz1,n→′⋅…⋅Pzk,nβ†’β€²superscriptπ‘žβ€²β‹…subscript𝑃subscript𝑧1superscript→𝑛′…subscript𝑃subscriptπ‘§π‘˜superscript→𝑛′q^{\prime}\in P_{z_{1,\vec{n}^{\prime}}}\cdot\ldots\cdot P_{z_{k,\vec{n}^{\prime}}} are coprime then Φ​(q,qβ€²)=0Ξ¦π‘žsuperscriptπ‘žβ€²0\Phi(q,q^{\prime})=0. Hence, such a pair does not contribute to (3.11). On the other hand, if qπ‘žq and qβ€²superscriptπ‘žβ€²q^{\prime} are not coprime then there must exist a finite non-empty set FβŠ‚{1,…,k}𝐹1β€¦π‘˜F\subset\{1,\ldots,k\}, a number u∈∏i∈FPzi,n→′𝑒subscriptproduct𝑖𝐹subscript𝑃subscript𝑧𝑖superscript→𝑛′u\in\prod_{i\in F}P_{z_{i,\vec{n}^{\prime}}}, and a number uβ€²βˆˆβˆiβˆ‰FPzi,nβ†’β€²superscript𝑒′subscriptproduct𝑖𝐹subscript𝑃subscript𝑧𝑖superscript→𝑛′u^{\prime}\in\prod_{i\notin F}P_{z_{i,\vec{n}^{\prime}}} such that qβ€²=u​uβ€²superscriptπ‘žβ€²π‘’superscript𝑒′q^{\prime}=uu^{\prime} and gcd⁑(q,qβ€²)=uπ‘žsuperscriptπ‘žβ€²π‘’\gcd(q,q^{\prime})=u; note that this only happens when ni=niβ€²subscript𝑛𝑖superscriptsubscript𝑛𝑖′n_{i}=n_{i}^{\prime} for all i∈F𝑖𝐹i\in F because Pzi,nβ†’subscript𝑃subscript𝑧𝑖→𝑛P_{z_{i,\vec{n}}} and Pzi,nβ†’β€²subscript𝑃subscript𝑧𝑖superscript→𝑛′P_{z_{i,\vec{n}^{\prime}}} are disjoint otherwise. In this case, we have

Φ​(q,qβ€²)q​qβ€²=uβˆ’1q​qβ€²β©½1q​uβ€²,Ξ¦π‘žsuperscriptπ‘žβ€²π‘žsuperscriptπ‘žβ€²π‘’1π‘žsuperscriptπ‘žβ€²1π‘žsuperscript𝑒′\frac{\Phi(q,q^{\prime})}{qq^{\prime}}\,=\,\frac{u-1}{qq^{\prime}}\,\leqslant\,\frac{1}{qu^{\prime}},

and therefore222In the published version of this paper the restriction ni=niβ€²subscript𝑛𝑖superscriptsubscript𝑛𝑖′n_{i}=n_{i}^{\prime}, i∈F𝑖𝐹i\in F, in the subscript of the second summation on the right hand side of equation (3.12) is missing, leading to a mistake that carries through the rest of the argument. This is corrected here. We thank Corentin Darreye for bringing this issue to our attention.

βˆ‘q,qβ€²βˆˆB2Φ​(q,qβ€²)q​qβ€²subscriptπ‘žsuperscriptπ‘žβ€²subscript𝐡2Ξ¦π‘žsuperscriptπ‘žβ€²π‘žsuperscriptπ‘žβ€²\displaystyle\sum_{q,q^{\prime}\in B_{2}}\text{\small$\dfrac{\Phi(q,q^{\prime})}{qq^{\prime}}$} β©½βˆ‘FβŠ‚{1,…,k}Fβ‰ βˆ…βˆ‘nβ†’,nβ†’β€²βˆˆA1×…×Akni=niβ€²,i∈Fβˆ‘q∈Pz1,n→​⋯​Pzk,nβ†’βˆ‘uβ€²βˆˆβˆiβˆ‰FPzi,nβ†’β€²1q​uβ€².absentsubscriptFRACOP𝐹1β€¦π‘˜πΉsubscriptFRACOP→𝑛superscript→𝑛′subscript𝐴1…subscriptπ΄π‘˜formulae-sequencesubscript𝑛𝑖superscriptsubscript𝑛𝑖′𝑖𝐹subscriptπ‘žsubscript𝑃subscript𝑧1→𝑛⋯subscript𝑃subscriptπ‘§π‘˜β†’π‘›subscriptsuperscript𝑒′subscriptproduct𝑖𝐹subscript𝑃subscript𝑧𝑖superscript→𝑛′1π‘žsuperscript𝑒′\displaystyle\leavevmode\nobreak\ \leqslant\leavevmode\nobreak\ \sum_{F\subset\{1,\ldots,k\}\atop F\neq\emptyset}\leavevmode\nobreak\ \leavevmode\nobreak\ \sum_{\vec{n},\vec{n}^{\prime}\in A_{1}\times\ldots\times A_{k}\atop{n_{i}=n_{i}^{\prime},\leavevmode\nobreak\ i\in F}}\leavevmode\nobreak\ \leavevmode\nobreak\ \sum_{q\in P_{z_{1,\vec{n}}}\cdots P_{z_{k,\vec{n}}}}\leavevmode\nobreak\ \leavevmode\nobreak\ \sum_{u^{\prime}\in\prod_{i\notin F}P_{z_{i,\vec{n}^{\prime}}}}\text{\small$\dfrac{1}{qu^{\prime}}$}. (3.12)

Next, we can use Pzi,nβ†’βŠ‚(8ni,8ni+1]subscript𝑃subscript𝑧𝑖→𝑛superscript8subscript𝑛𝑖superscript8subscript𝑛𝑖1P_{z_{i,\vec{n}}}\subset(8^{n_{i}},8^{n_{i}+1}] and Pzi,nβ†’β€²βŠ‚(8niβ€²,8niβ€²+1]subscript𝑃subscript𝑧𝑖superscript→𝑛′superscript8superscriptsubscript𝑛𝑖′superscript8superscriptsubscript𝑛𝑖′1P_{z_{i,\vec{n}^{\prime}}}\subset(8^{n_{i}^{\prime}},8^{n_{i}^{\prime}+1}] to deduce that

βˆ‘q∈Pz1,n→​⋯​Pzk,nβ†’βˆ‘uβ€²βˆˆβˆiβˆ‰FPzi,nβ†’β€²1q​uβ€²subscriptπ‘žsubscript𝑃subscript𝑧1→𝑛⋯subscript𝑃subscriptπ‘§π‘˜β†’π‘›subscriptsuperscript𝑒′subscriptproduct𝑖𝐹subscript𝑃subscript𝑧𝑖superscript→𝑛′1π‘žsuperscript𝑒′\displaystyle\sum_{q\in P_{z_{1,\vec{n}}}\cdots P_{z_{k,\vec{n}}}}\leavevmode\nobreak\ \sum_{u^{\prime}\in\prod_{i\notin F}P_{z_{i,\vec{n}^{\prime}}}}\text{\small$\dfrac{1}{qu^{\prime}}$} β©½βˆ‘q∈Pz1,n→​⋯​Pzk,nβ†’βˆ‘uβ€²βˆˆβˆiβˆ‰FPzi,nβ†’β€²(∏i=1k18ni)​(∏iβˆ‰F18niβ€²)absentsubscriptπ‘žsubscript𝑃subscript𝑧1→𝑛⋯subscript𝑃subscriptπ‘§π‘˜β†’π‘›subscriptsuperscript𝑒′subscriptproduct𝑖𝐹subscript𝑃subscript𝑧𝑖superscript→𝑛′superscriptsubscriptproduct𝑖1π‘˜1superscript8subscript𝑛𝑖subscriptproduct𝑖𝐹1superscript8superscriptsubscript𝑛𝑖′\displaystyle\,\leqslant\,\sum_{q\in P_{z_{1,\vec{n}}}\cdots P_{z_{k,\vec{n}}}}\leavevmode\nobreak\ \sum_{u^{\prime}\in\prod_{i\notin F}P_{z_{i,\vec{n}^{\prime}}}}\left(\prod_{i=1}^{k}\text{\small$\dfrac{1}{8^{n_{i}}}$}\right)\left(\prod_{i\notin F}\text{\small$\dfrac{1}{8^{n_{i}^{\prime}}}$}\right)
=(∏i=1k|Pzi,nβ†’|8ni)​(∏iβˆ‰F|Pzi,nβ†’β€²|8niβ€²).absentsuperscriptsubscriptproduct𝑖1π‘˜subscript𝑃subscript𝑧𝑖→𝑛superscript8subscript𝑛𝑖subscriptproduct𝑖𝐹subscript𝑃subscript𝑧𝑖superscript→𝑛′superscript8superscriptsubscript𝑛𝑖′\displaystyle\,=\,\left(\prod_{i=1}^{k}\text{\small$\dfrac{|P_{z_{i,\vec{n}}}|}{8^{n_{i}}}$}\right)\left(\prod_{i\notin F}\text{\small$\dfrac{|P_{z_{i,\vec{n}^{\prime}}}|}{8^{n_{i}^{\prime}}}$}\right).

Thereafter, using (3.6), we get

(∏i=1k|Pzi,nβ†’|8ni)​(∏iβˆ‰F|Pzi,nβ†’β€²|8niβ€²)β©½(∏i=1kDni)​(∏iβˆ‰FDniβ€²).superscriptsubscriptproduct𝑖1π‘˜subscript𝑃subscript𝑧𝑖→𝑛superscript8subscript𝑛𝑖subscriptproduct𝑖𝐹subscript𝑃subscript𝑧𝑖superscript→𝑛′superscript8superscriptsubscript𝑛𝑖′superscriptsubscriptproduct𝑖1π‘˜π·subscript𝑛𝑖subscriptproduct𝑖𝐹𝐷superscriptsubscript𝑛𝑖′\left(\prod_{i=1}^{k}\text{\small$\dfrac{|P_{z_{i,\vec{n}}}|}{8^{n_{i}}}$}\right)\left(\prod_{i\notin F}\text{\small$\dfrac{|P_{z_{i,\vec{n}^{\prime}}}|}{8^{n_{i}^{\prime}}}$}\right)\,\leqslant\,\left(\prod_{i=1}^{k}\text{\small$\dfrac{D}{n_{i}}$}\right)\left(\prod_{i\notin F}\text{\small$\dfrac{D}{n_{i}^{\prime}}$}\right).

All together, this implies that

βˆ‘q,qβ€²βˆˆB2Φ​(q,qβ€²)q​qβ€²subscriptπ‘žsuperscriptπ‘žβ€²subscript𝐡2Ξ¦π‘žsuperscriptπ‘žβ€²π‘žsuperscriptπ‘žβ€²\displaystyle\sum_{q,q^{\prime}\in B_{2}}\text{\small$\dfrac{\Phi(q,q^{\prime})}{qq^{\prime}}$} β©½βˆ‘FβŠ‚{1,…,k}Fβ‰ βˆ…βˆ‘nβ†’,nβ†’β€²βˆˆA1×…×Akni=niβ€²,i∈F(∏i=1kDni)​(∏iβˆ‰FDniβ€²)absentsubscriptFRACOP𝐹1β€¦π‘˜πΉsubscriptFRACOP→𝑛superscript→𝑛′subscript𝐴1…subscriptπ΄π‘˜formulae-sequencesubscript𝑛𝑖superscriptsubscript𝑛𝑖′𝑖𝐹superscriptsubscriptproduct𝑖1π‘˜π·subscript𝑛𝑖subscriptproduct𝑖𝐹𝐷superscriptsubscript𝑛𝑖′\displaystyle\leavevmode\nobreak\ \leqslant\leavevmode\nobreak\ \sum_{F\subset\{1,\ldots,k\}\atop F\neq\emptyset}\leavevmode\nobreak\ \leavevmode\nobreak\ \sum_{\vec{n},\vec{n}^{\prime}\in A_{1}\times\ldots\times A_{k}\atop{n_{i}=n_{i}^{\prime},\leavevmode\nobreak\ i\in F}}\leavevmode\nobreak\ \leavevmode\nobreak\ \left(\prod_{i=1}^{k}\text{\small$\dfrac{D}{n_{i}}$}\right)\left(\prod_{i\notin F}\text{\small$\dfrac{D}{n_{i}^{\prime}}$}\right)
=βˆ‘FβŠ‚{1,…,k}Fβ‰ βˆ…D2​kβˆ’|F|​(∏i=1kβˆ‘n∈Ai1ni)​(∏iβˆ‰Fβˆ‘n∈Ai1niβ€²).absentsubscriptFRACOP𝐹1β€¦π‘˜πΉsuperscript𝐷2π‘˜πΉsuperscriptsubscriptproduct𝑖1π‘˜subscript𝑛subscript𝐴𝑖1subscript𝑛𝑖subscriptproduct𝑖𝐹subscript𝑛subscript𝐴𝑖1superscriptsubscript𝑛𝑖′\displaystyle\leavevmode\nobreak\ =\leavevmode\nobreak\ \sum_{F\subset\{1,\ldots,k\}\atop F\neq\emptyset}\leavevmode\nobreak\ {D^{2k-|F|}}\left(\prod_{i=1}^{k}\sum_{n\in A_{i}}\text{\small$\dfrac{1}{n_{i}}$}\right)\left(\prod_{i\notin F}\sum_{n\in A_{i}}\text{\small$\dfrac{1}{n_{i}^{\prime}}$}\right).

A calculation similar to (3.10) shows that βˆ‘q∈B21qβ©ΎDk83​kβ€‹βˆi=1kβˆ‘n∈Ai1nsubscriptπ‘žsubscript𝐡21π‘žsuperscriptπ·π‘˜superscript83π‘˜superscriptsubscriptproduct𝑖1π‘˜subscript𝑛subscript𝐴𝑖1𝑛\sum_{q\in B_{2}}\frac{1}{q}\geqslant\frac{D^{k}}{8^{3k}}\prod_{i=1}^{k}\sum_{n\in A_{i}}\frac{1}{n}. Putting everything together gives

𝔼logq∈B2​𝔼logqβ€²βˆˆB2​Φ​(q,qβ€²)π‘žsubscript𝐡2superscript𝔼logsuperscriptπ‘žβ€²subscript𝐡2superscript𝔼logΞ¦π‘žsuperscriptπ‘žβ€²\displaystyle\underset{q\in B_{2}}{{{\mathbb{E}}}^{\text{\footnotesize log}}}\,\,\underset{q^{\prime}\in B_{2}}{{{\mathbb{E}}}^{\text{\footnotesize log}}}\,\Phi(q,q^{\prime}) =βˆ‘q,qβ€²βˆˆB2Φ​(q,qβ€²)q​qβ€²βˆ‘q,qβ€²βˆˆB21q​qβ€²absentsubscriptπ‘žsuperscriptπ‘žβ€²subscript𝐡2Ξ¦π‘žsuperscriptπ‘žβ€²π‘žsuperscriptπ‘žβ€²subscriptπ‘žsuperscriptπ‘žβ€²subscript𝐡21π‘žsuperscriptπ‘žβ€²\displaystyle\,=\,\frac{\sum_{q,q^{\prime}\in B_{2}}\frac{\Phi(q,q^{\prime})}{qq^{\prime}}}{\sum_{q,q^{\prime}\in B_{2}}\frac{1}{qq^{\prime}}}
β©½βˆ‘FβŠ‚{1,…,k},Fβ‰ βˆ…D2​kβˆ’|F|​(∏i=1kβˆ‘n∈Ai1n)​(∏iβˆ‰Fβˆ‘n∈Ai1n)(Dk83​kβ€‹βˆi=1kβˆ‘n∈Ai1n)​(Dk83​kβ€‹βˆi=1kβˆ‘n∈Ai1n)absentsubscriptformulae-sequence𝐹1β€¦π‘˜πΉsuperscript𝐷2π‘˜πΉsuperscriptsubscriptproduct𝑖1π‘˜subscript𝑛subscript𝐴𝑖1𝑛subscriptproduct𝑖𝐹subscript𝑛subscript𝐴𝑖1𝑛superscriptπ·π‘˜superscript83π‘˜superscriptsubscriptproduct𝑖1π‘˜subscript𝑛subscript𝐴𝑖1𝑛superscriptπ·π‘˜superscript83π‘˜superscriptsubscriptproduct𝑖1π‘˜subscript𝑛subscript𝐴𝑖1𝑛\displaystyle\,\leqslant\,\frac{\sum_{F\subset\{1,\ldots,k\},\,F\neq\emptyset}\leavevmode\nobreak\ {D^{2k-|F|}}\left(\prod_{i=1}^{k}\sum_{n\in A_{i}}\frac{1}{n}\right)\left(\prod_{i\notin F}\sum_{n\in A_{i}}\frac{1}{n}\right)}{\left(\frac{D^{k}}{8^{3k}}\prod_{i=1}^{k}\sum_{n\in A_{i}}\frac{1}{n}\right)\left(\frac{D^{k}}{8^{3k}}\prod_{i=1}^{k}\sum_{n\in A_{i}}\frac{1}{n}\right)}
β©½βˆ‘FβŠ‚{1,…,k}Fβ‰ βˆ…86​k​Dβˆ’|F|(∏i∈Fβˆ‘n∈Ai1n).absentsubscriptFRACOP𝐹1β€¦π‘˜πΉsuperscript86π‘˜superscript𝐷𝐹subscriptproduct𝑖𝐹subscript𝑛subscript𝐴𝑖1𝑛\displaystyle\,\leqslant\,\sum_{F\subset\{1,\ldots,k\}\atop F\neq\emptyset}\frac{{8^{6k}D^{-|F|}}}{\left(\prod_{i\in F}\sum_{n\in A_{i}}\frac{1}{n}\right)}.

Finally, since βˆ‘n∈Ai1/nβ©ΎNsubscript𝑛subscript𝐴𝑖1𝑛𝑁\sum_{n\in A_{i}}1/n\geqslant N, we see that 𝔼q∈B2log​𝔼qβ€²βˆˆB2log​Φ​(q,qβ€²)β©½Ξ·subscriptsuperscriptπ”Όπ‘žsubscript𝐡2subscriptsuperscript𝔼superscriptπ‘žβ€²subscript𝐡2Ξ¦π‘žsuperscriptπ‘žβ€²πœ‚\mathbb{E}^{\log}_{q\in B_{2}}\mathbb{E}^{\log}_{q^{\prime}\in B_{2}}\Phi(q,q^{\prime})\leqslant\eta as long as N𝑁N was chosen sufficiently large. ∎

This completes the proof of 2.2. ∎

Acknowledgments:

The author thanks Vitaly Bergelson and Redmond McNamara for commenting on an earlier version of this paper, and the anonymous referee for providing numerous helpful suggestions. This work was supported by the National Science Foundation under grant number DMS 1901453.

References

  • [Apo00] T. M. Apostol, A centennial history of the prime number theorem, in Number theory, Trends Math., BirkhΓ€user, Basel, 2000, pp. 1–14. https://doi.org/10.1007/978-3-0348-7023-8_1.
  • [BR20] V. Bergelson and F. K. Richter, Dynamical generalizations of the Prime Number Theorem and disjointness of additive and multiplicative semigroup actions, ArXiv e-prints (2020). Available at https://arxiv.org/abs/2002.03498.
  • [Dab75] H. Daboussi, Fonctions multiplicatives presque pΓ©riodiques B, in JournΓ©es ArithmΓ©tiques de Bordeaux (Conf., Univ. Bordeaux, Bordeaux, 1974), tome 24–25, Soc. Math. France, Paris, 1975, pp. 321–324. Available at http://www.numdam.org/item/AST_1975__24-25__321_0.
  • [Dab84] H. Daboussi, Sur le thΓ©orΓ¨me des nombres premiers, C. R. Acad. Sci. Paris SΓ©r. I Math. 298 no. 8 (1984), 161–164.
  • [Dab89] H. Daboussi, On the prime number theorem for arithmetic progressions, J. Number Theory 31 no. 3 (1989), 243–254. https://doi.org/10.1016/0022-314X(89)90071-1.
  • [Del58] H. Delange, On some arithmetical functions, Illinois J. Math. 2 (1958), 81–87. Available at http://projecteuclid.org/euclid.ijm/1255380835.
  • [Dia82] H. G. Diamond, Elementary methods in the study of the distribution of prime numbers, Bull. Amer. Math. Soc. (N.S.) 7 no. 3 (1982), 553–589. https://doi.org/10.1090/S0273-0979-1982-15057-1.
  • [Ell79] P. D. T. A. Elliott, Probabilistic number theory. I, Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Science] 239, Springer-Verlag, New York-Berlin, 1979, Mean-value theorems. https://doi.org/10.1007/978-1-4612-9989-9.
  • [Erd46] P. ErdΕ‘s, On the distribution function of additive functions, Ann. of Math. (2) 47 (1946), 1–20. https://doi.org/10.2307/1969031.
  • [Erd49] P. ErdΕ‘s, On a new method in elementary number theory which leads to an elementary proof of the prime number theorem, Proc. Nat. Acad. Sci. U.S.A. 35 (1949), 374–384. https://doi.org/10.1073/pnas.35.7.374.
  • [Gol04] D. Goldfeld, The elementary proof of the prime number theorem: an historical perspective, in Number theory (New York, 2003), Springer, New York, 2004, pp. 179–192. https://doi.org/10.1007/978-1-4419-9060-0_10.
  • [Gol73a] L. J. Goldstein, Correction to: β€œA history of the prime number theorem” (Amer. Math. Monthly 80 (1973), 599–615), Amer. Math. Monthly 80 (1973), 1115. https://doi.org/10.2307/2318546.
  • [Gol73b] L. J. Goldstein, A history of the prime number theorem, Amer. Math. Monthly 80 (1973), 599–615. https://doi.org/10.2307/2319162.
  • [Gra10] A. Granville, Different approaches to the distribution of primes, Milan J. Math. 78 no. 1 (2010), 65–84. https://doi.org/10.1007/s00032-010-0122-7.
  • [Hil86] A. Hildebrand, The prime number theorem via the large sieve, Mathematika 33 no. 1 (1986), 23–30. https://doi.org/10.1112/S002557930001384X.
  • [KΓ‘t86] I. KΓ‘tai, A remark on a theorem of H. Daboussi, Acta Math. Hungar. 47 no. 1-2 (1986), 223–225. https://doi.org/10.1007/BF01949145.
  • [Lan99] E. Landau, Neuer Beweis der Gleichung βˆ‘k=1βˆžΞΌβ€‹(k)ksuperscriptsubscriptπ‘˜1πœ‡π‘˜π‘˜\sum_{k=1}^{\infty}\frac{\mu(k)}{k}, Ph.D. thesis, Friedrich-Wilhelms-IniversitΓ€t zu Berlin, 1899. Available at http://resolver.sub.uni-goettingen.de/purl?PPN317979566.
  • [Lan09] E. Landau, Handbuch der Lehre von der Verteilung der Primzahlen - Zweiter Band, 2, Leipzig Und Berlin: B. G. Teubner, 1909. Available at http://name.umdl.umich.edu/ABV2766.0001.001.
  • [Lan11] E. Landau, Über die Γ€quivalenz zweier HauptsΓ€tze der analytischen Zahlentheorie, Sitzungsberichte der Kaiserlichen Akadamie der Wissenschaften in Wien, Mathematisch-Naturwissenschaftliche Klasse 120 no. 2a (1911), 1–16.
  • [Lev69] N. Levinson, A motivated account of an elementary proof of the prime number theorem, Amer. Math. Monthly 76 (1969), 225–245. https://doi.org/10.2307/2316361.
  • [vM97] H. von Mangoldt, Beweis der Gleichung βˆ‘k=1βˆžΞΌβ€‹(k)k=0superscriptsubscriptπ‘˜1πœ‡π‘˜π‘˜0\sum_{k=1}^{\infty}\frac{\mu(k)}{k}=0, Sitzungsberichte der KΓΆniglich Preussischen Akademie der Wissenschaften zu Berlin, Zweiter Halbband – Juli bis Dezember (1897), 835–852. Available at https://biodiversitylibrary.org/page/29982413.
  • [McN20] R. McNamara, A dynamical proof of the Prime Number Theorem, ArXiv e-prints (2020). Available at https://arxiv.org/abs/2002.04007.
  • [Mon71] H. L. Montgomery, Topics in multiplicative number theory, Lecture Notes in Mathematics, Vol. 227, Springer-Verlag, Berlin-New York, 1971.
  • [New80] D. J. Newman, Simple analytic proof of the prime number theorem, Amer. Math. Monthly 87 no. 9 (1980), 693–696. https://doi.org/10.2307/2321853.
  • [Pil40] S. S. Pillai, Generalisation of a theorem of Mangoldt, Proc. Indian Acad. Sci., Sect. A. 11 (1940), 13–20.
  • [Sel49] A. Selberg, An elementary proof of the prime-number theorem, Ann. of Math. (2) 50 (1949), 305–313. https://doi.org/10.2307/1969455.
  • [Sel39] S. Selberg, Zur Theorie der quadratfreien Zahlen, Math. Z. 44 no. 1 (1939), 306–318. https://doi.org/10.1007/BF01210655.
  • [Sha50] H. N. Shapiro, On a theorem of Selberg and generalizations, Ann. of Math. (2) 51 (1950), 485–497. https://doi.org/10.2307/1969336.
  • [SG09] J. Spencer and R. Graham, The elementary proof of the prime number theorem, Math. Intelligencer 31 no. 3 (2009), 18–23, With a note on the controversy by E. G. Straus and a postscript by Carl Pomerance. https://doi.org/10.1007/s00283-009-9063-9.
  • [Ten95] G. Tenenbaum, Introduction to analytic and probabilistic number theory, Cambridge Studies in Advanced Mathematics 46, Cambridge University Press, Cambridge, 1995, Translated from the second French edition (1995) by C. B. Thomas.
  • [TMF00] G. Tenenbaum and M. MendΓ¨s France, The prime numbers and their distribution, Student Mathematical Library 6, American Mathematical Society, Providence, RI, 2000, Translated from the 1997 French original by Philip G. Spain. https://doi.org/10.1090/stml/006.
  • [Zag97] D. Zagier, Newman’s short proof of the prime number theorem, Amer. Math. Monthly 104 no. 8 (1997), 705–708. https://doi.org/10.2307/2975232.

Florian K. Richter
Northwestern University
fkr@northwestern.edu