Divisibility

Resources Prime Factorization Solution - Counting Divisors GCD & LCM GCD LCM Euler's Totient Function Properties Solution Problems

If you've never encountered any number theory before, AoPS is a good place to start.

Resources
		AoPS	Alcumus	practice problems, set focus to number theory!
		AoPS	Intro to NT	good book

Resources


IUSACO	13.1, 13.2 - Elementary Number Theory	module is based off this
David Altizio	Divisors and Divisibility
CPH	21.1 - Primes & Factors
PAPS	16.1, 16.2 - Number Theory
MONT	1, 3.1, and 3.2 - Divisors
AoPS	Number Theory	nice proofs and problems

Prime Factorization

A positive integer $a$ is called a divisor or a factor of a non-negative integer $b$ if $b$ is divisible by $a$ , which means that there exists some integer $k$ such that $b = ka$ . An integer $n > 1$ is prime if its only divisors are $1$ and $n$ . Integers greater than $1$ that are not prime are composite.

Every positive integer has a unique prime factorization: a way of decomposing it into a product of primes, as follows:

$n = {p_1}^{a_1} {p_2}^{a_2} \cdots {p_k}^{a_k}

where the $p_i$ are distinct primes and the $a_i$ are positive integers.

Now, we will discuss how to find the prime factorization of any positive integer.

from typing import List


def factor(n: int) -> List[int]:
	ret = []
	i = 2
	while i * i <= n:
		while n % i == 0:
			ret.append(i)
			n //= i
		i += 1
	if n > 1:
		ret.append(n)
	return ret

This algorithm runs in $\mathcal{O}(\sqrt{n})$ time, because the for loop checks divisibility for at most $\sqrt{n}$ values. Even though there is a while loop inside the for loop, dividing $n$ by $i$ quickly reduces the value of $n$ , which means that the outer for loop runs less iterations, which actually speeds up the code.

Let's look at an example of how this algorithm works, for $n = 252$ .

$i$	$n$	$\texttt{ret}$
$2$	$252$	$\{\}$
$2$	$126$	$\{2\}$
$2$	$63$	$\{2, 2\}$
$3$	$21$	$\{2, 2, 3\}$
$3$	$7$	$\{2, 2, 3, 3\}$

At this point, the for loop terminates, because $i$ is already 3 which is greater than $\lfloor \sqrt{7} \rfloor$ . In the last step, we add $7$ to the list of factors $v$ , because it otherwise won't be added, for a final prime factorization of $\{2, 2, 3, 3, 7\}$ .

Counting Divisors

CSES - Easy

Focus Problem – try your best to solve this problem before continuing!

Solution - Counting Divisors

The most straightforward solution is just to do what the problem asks us to do - for each $x$ , find the number of divisors of $x$ in $\mathcal{O}(\sqrt x)$ time.

Warning!

Due to Python's big constant factor, the following code TLEs on quite a few test cases.

ans = []
for _ in range(int(input())):
	div_num = 0
	x = int(input())
	i = 1
	while i * i <= x:
		if x % i == 0:
			div_num += 1 if i**2 == x else 2
		i += 1
	ans.append(div_num)

print("\n".join(str(i) for i in ans))

This solution runs in $\mathcal{O}(n \sqrt x)$ time, which is just fast enough to get AC. However, we can actually speed this up to get an $\mathcal{O}((x + n) \log x)$ solution!

First, let's discuss an important property of the prime factorization. Consider:

$x = {p_1}^{a_1} {p_2}^{a_2} \cdots {p_k}^{a_k}

Then the number of divisors of $x$ is simply $(a_1 + 1) \cdot (a_2 + 1) \cdots (a_k + 1)$ .

Why is this true? The exponent of $p_i$ in any divisor of $x$ must be in the range $[0, a_i]$ and each different exponent results in a different set of divisors, so each $p_i$ contributes $a_i + 1$ to the product.

$x$ can have $\mathcal{O}(\log x)$ distinct prime factors, so if we can find the prime factorization of $x$ efficiently, we can use it with the above property to answer queries in $\mathcal{O}(\log x)$ time instead of the previous $\mathcal{O}(\sqrt x)$ time.

Here's how we find the prime factorization of $x$ in $\mathcal{O}(\log x)$ time with $\mathcal{O}(x \log x)$ preprocessing:

For each $k \leq 10^6$ , find any prime number that divides $k$ . To find this, we can use the Sieve of Eratosthenes which runs in $\mathcal{O}(n \log n)$ , where $n$ is the larger numbers we consider. There's also a version of the sieve that runs in linear time, but we won't be needing it.
We can find the prime factorization of $x$ by repeatedly dividing it by the prime numbers we calculated earlier until $x = 1$ .

Using this method gives us the following code:

MAX_N = 10**6

# max_div[i] contains the largest prime that can go into i
max_div = [0 for _ in range(MAX_N + 1)]
for i in range(2, MAX_N + 1):
	if max_div[i] == 0:
		for j in range(i, MAX_N + 1, i):
			max_div[j] = i

ans = []

GCD & LCM

GCD

The greatest common divisor (GCD) of two integers $a$ and $b$ is the largest integer that is a factor of both $a$ and $b$ . In order to find the GCD of two non-negative integers, we use the Euclidean Algorithm, which is as follows:

$\gcd(a, b) = \begin{cases} a & b = 0 \\ \gcd(b, a \bmod b) & b \neq 0 \end{cases}

This algorithm can be implemented with a recursive function as follows:

def gcd(a: int, b: int) -> int:
	return a if b == 0 else gcd(b, a % b)

You won't have to actually implement this in-contest, as the built-in math library has a gcd and lcm function.

This function runs in $\mathcal{O}(\log ab)$ time because $a\le b \implies b\pmod a <\frac{b}{2}$ .

The worst-case scenario for the Euclidean algorithm is when $a$ and $b$ are consecutive Fibonacci numbers $F_n$ and $F_{n + 1}$ . In this case, the algorithm will calculate that $\gcd(F_n, F_{n + 1}) = \gcd(F_{n - 1}, F_n) = \dots = \gcd(0, F_1)$ . This takes a total of $n+1$ calls, which is proportional to $\log \left(F_n F_{n+1}\right)$ .

LCM

The least common multiple (LCM) of two integers $a$ and $b$ is the smallest integer divisible by both $a$ and $b$ . The LCM can be calculated with the GCD using this property:

$\operatorname{lcm}(a, b) = \frac{a \cdot b}{\gcd(a, b)}

Warning!

Coding $\text{lcm}$ as a * b / gcd(a, b) might cause integer overflow if the value of a * b is greater than the max size of the data type of a * b (e.g. the max size of int in C++ and Java is around 2 billion). Dividng a by gcd(a, b) first, then multiplying it by b will prevent integer overflow if the result fits in an int.

Also, these two functions are associative, meaning that if we want to take the GCD or LCM of more than two elements, we can do so two at a time, in any order. For example,

$\gcd(a_1, a_2, a_3, a_4) = \gcd(a_1, \gcd(a_2, \gcd(a_3, a_4))).

Euler's Totient Function

Resources
		cp-algo	Euler's Totient Function	Theory and Exercises
		CF	Euler's phi function, its properties, and how to compute it	Well-covered article

Properties

Euler's totient function - written using phi $\phi(n)$ - counts the number of positive integers in the interval $[1,n]$ which are coprime to $n$ . Two numbers $a$ and $b$ are coprime if their greatest common divisor is equal to 1 i.e. $gcd(a,b)=1$ .

Here are the values of $\phi(n)$ for the first 20 numbers:

n	1	2	3	4	5	6	7	8	9	10	11	12	13	14	15	16	17	18	19	20
$\phi(n)$	1	1	2	2	4	2	6	4	6	4	10	4	12	6	8	8	16	6	18	8

The totient function is multiplicative meaning that $\phi(nm)=\phi(n) \cdot \phi(m)$ , where $n$ and $m$ are coprime - $gcd(n, m)=1$ . For example $\phi(15)=\phi(3 \cdot 5)=\phi(3) \cdot \phi(5) = 2 \cdot 4 = 8$ .

Let's take a look at some edge cases for $\phi(n)$ :

If n is a prime number then $\phi(n)=n-1$ because $gcd(n, x)=1$ for all $1 \leq x < n$
If n is a power of a prime number, $n=p^q$ where p is a prime number and $1 \leq q$ then
there are exactly $p^{q-1}$ numbers divisible by $p$ , so $\phi(p^q)=p^{q} - p^{q-1} = p^{q-1}(p - 1)$

Using the multiplicative property and the last edge case we can compute the value of $\phi(n)$ from the factorization of number $n$ . Let the factorization be $n=p_1^{q_1} \cdot p_2^{q_2} \cdot \ldots \cdot p_k^{q_k}$ where $p_i$ is a prime factor of $n$ , then:

$\phi(n)=\phi(p_1^{q_1}) \cdot \phi(p_2^{q_2}) \cdot \ldots \cdot \phi(p_k^{q_k}) = p_1^{q_1-1}(p_1 - 1) \cdot p_2^{q_2-1}(p_2 - 1) \cdot \ldots \cdot p_k^{q_k-1}(q_k - 1)

Below is an implementation for factorization in $\mathcal{O}(\sqrt{n})$ . It can be a little bit tricky to understand why we subtract $ans/p$ from $ans$ . For example $ans=p^q \cdot x$ ,where $p$ is a prime factor and $x$ is the rest of the prime factorization. By subtracting $\frac{ans}{p}=p^{q-1} \cdot x$ we end up with: $p^q \cdot x - p^{q-1} \cdot x = p^{q-1} \cdot x \cdot (p - 1)$ which is exactly the form of $\phi(n)$ described a few lines above.

def phi(n: int) -> int:
	ans = n
	p = 2
	while p * p <= n:
		if n % p == 0:
			while n % p == 0:
				n //= p
			ans -= ans // p
		p += 1

	if n > 1:
		ans -= ans // n

	return ans

Usually in problems we need to precompute the totient of all numbers between $1$ and $n$ , then factorizing is not efficient. The idea is the same as the Sieve of Eratosthenes. Since it's almost the same with the Sieve of Eratosthenes the time complexity will be: $\mathcal{O}(N\log\log{N})$ .

def precompute():
	for i in range(1, MAX_N):
		phi[i] = i

	for i in range(2, MAX_N):
		# If i is prime
		if phi[i] == i:
			for j in range(i, MAX_N, i):
				phi[j] -= phi[j] // i

GCDEX - GCD Extreme

SPOJ - Hard

Focus Problem – try your best to solve this problem before continuing!

Solution

We are asked to compute the sum of GCDs for all pairs $(i, j)$ such that $1 \le i < j \le n$ :

$\sum_{i=1}^{n} \sum_{j=i+1}^{n} \gcd(i,j).

Let's define a helper function $f(n)$ which computes the sum of GCDs for a fixed second element $n$ :

$f(n) = \sum_{i=1}^{n} \gcd(i, n).

This function sums $\gcd(i, n)$ for all $i \le n$ . The terms where $\gcd(i, n) = d$ are exactly those where $d$ divides $n$ and $\gcd(\frac{i}{d}, \frac{n}{d}) = 1$ . The number of such integers $i$ is given by Euler's totient function $\phi(\frac{n}{d})$ . Thus, we can rewrite $f(n)$ as a sum over the divisors of $n$ :

$f(n) = \sum_{d|n} d \cdot \phi\left(\frac{n}{d}\right).

We can compute $f(n)$ for all $n$ up to $10^6$ efficiently. Instead of factoring every number, we iterate through every possible divisor $i$ and update all its multiples $j$ . For a fixed $i$ , we add the contribution $i \cdot \phi(\frac{j}{i})$ to $f(j)$ for all $j$ that are multiples of $i$ .

Finally, the problem asks for pairs where $i < j$ . The value $f(j)$ includes the case $i=j$ (where $\gcd(j, j) = j$ ), which we must exclude. The answer for a given $n$ is the prefix sum of these adjusted values:

$\text{ans}[n] = \sum_{j=1}^{n} (f(j) - j).

The total complexity of this precomputation is bounded by the harmonic series sum: $\sum_{i=1}^{N} \frac{N}{i} \approx N \ln N$

#include <bits/stdc++.h>
using namespace std;
using ll = long long;
const int MAX_N = 1e6;

ll phi[MAX_N + 1];
ll f[MAX_N + 1];
ll sum[MAX_N + 1];

int main() {

Problems

Source	Problem Name	Difficulty	Tags
AC	Div Game	Easy	Show Tags Prime Factorization
CF	Product 1 Modulo N	Easy	Show Tags Divisibility, Modular Arithmetic
CF	Power Products	Easy	Show Tags NT
CF	Diluc and Kaeya	Easy	Show Tags Divisibility
CSES	Permutation Rounds	Easy	Show Tags Functional Graph, Prime Factorization
CSES	Common Divisors	Normal	Show Tags Divisibility
CF	Orac and LCM	Normal	Show Tags Prime Factorization
CC	Maximum of GCDs	Normal	Show Tags Divisibility
CSES	Sum of Divisors	Hard	Show Tags Divisibility
SPOJ	LCM Sum	Hard	Show Tags Divisibility, Euler Totient, LCM
CF	The Number of Pairs	Hard	Show Tags Divisibility
AC	sqrt(n²+n+X)	Hard	Show Tags Divisibility

Module Progress:

Join the USACO Forum!

Stuck on a problem, or don't understand a module? Join the USACO Forum and get help from other competitive programmers!

Join Forum

Table of Contents

Table of Contents

Resources

Prime Factorization

Solution - Counting Divisors

Warning!

GCD & LCM

GCD

LCM

Warning!

Euler's Totient Function

Properties

Solution

Problems

Module Progress:

Join the USACO Forum!