Algorithm and Complexity 2, année 2024

The class NP

Definition

The class \(NP\) is the non-deterministic version of \(P\), that is, the set of languages that can be decided in non-deterministic polynomial time.

More formally, \(\textrm{P}=\bigcup_{d}\textrm{NTIME}(n^d)\).

As we will see in this lecture and the next, it is a robust class containing many natural problems.

The P vs NP problem

It is not known whether \(P \neq NP\) and it is one of the major open question in theoretical computer science. While it is obvious that \(P \subseteq NP\), it is widely believed that \(NP \not \subseteq P\). We will see later that using the notion of NP-completeness, this conjecture is used as evidence of the non-existence of a polynomial time algorithm for some problems.

Examples (Graph problems)

Certificates

Alternative defintion using certificates: a language \(L\) is in NP if there exists a polynomial \(p\) and a DTM \(M\) running in polynomial time such that \(x \in L\) if and only if there exists \(y \in \Sigma^{p(|x|)}\), called the certificate, such that \(M(x\#y)\) accepts.

Important:

Proving problems are in NP

The previous characterization of NP is more useful in practice when one tries to prove that a language is in NP. Many languages in are asking to check whether some problem has a solution:

Examples:

Such problems are often in NP. To prove this, it is sufficient to prove the following:

Exercise: use this approach to prove that CLIQUE is NP.

Tricky problem: ILP.

Reductions and NP-completeness

Complexity Theory is heavily based on the notion of reductions between problems. Informally, a decision problem \(A\) reduces to a decision problem \(B\) if whenever we have an algorithm for \(B\), then we can use it to solve problem \(A\).

Many-one Reduction

Let \(L,K\) be languages. A many-one reduction from \(L\) to \(K\) is a function \(f : \Sigma^* \to \Sigma^*\) such that for every \(x \in \Sigma^*\), \(x \in K\) if and only if \(f(x) \in L\).

If there exists a many-one reduction from \(L\) to \(K\), we say that \(K\) is many-one reducible to \(L\) and denote it by \(K \leq L\).

If moreover \(f\) is polynomial time computable, we say that \(K\) is Karp-reducible to \(L\) and denote it by \(K \leq_p L\).

Intuitively, it means that \(K\) is easier than \(L\), in the sense that if we know how to solve \(L\), we directly get an algorithm for \(K\).

Example

NP-hardness and NP-completeness

A language \(L\) is NP-hard if for every \(K \in NP\), \(K\) is Karp-reducible to \(L\). A language \(L\) is \(NP\)-complete if it is NP-hard and if \(L \in NP\).

Theorem (Cook-Levin, 71): There exist an NP-complete problem.

NP-completeness is a central notion in complexity theory. Indeed, by definition, if one can find a polynomial time algorithm for an NP-complete problem, then \(P = NP\). Proving a problem to be NP-complete is thus strong evidence that it does not admit a polynomial time algorithm (or if it were, it will be a major breakthrough).

As it turns out, many natural problems are NP-complete. The hard lifting was done by Cook and Levin to exhibit the first natural NP-complete problem, namely SAT. Once one has an NP-complete problem \(L\), to show that a problem \(K\) is NP-complete, it is sufficient to show that \(K \in NP\) and exhibit a reduction from \(L\) to \(K\).

Lemma: If \(L\) is NP-complete, \(K \in NP\) and \(L \leq_p K\) then \(K\) is NP-complete.

Exercise session

  1. Show that \(IS\) is in \(NP\) and that \(IS \leq_p CLIQUE\) and \(CLIQUE \leq_p IS\) where \(CLIQUE(G,k)\) is the problem of deciding whether there exists a clique of size at least \(k\) in \(G\) and \(IS(G,k)\) is the problem of deciding whether there exists an independent set of size at least \(k\) in \(G\).

  2. Show that \(CLIQUE \leq_p VERTEX-COVER\) where \(VERTEX-COVER(G,k)\) is the problem of deciding whether a graph \(G\) has a vertex cover of size at most \(k\) (a set \(S\) of vertices such that every edge of \(G\) contains at least one vertex from \(S\)).

    Hint: if \(S\) is a vertex-cover of \(G\), what can you say about \(V \setminus S\)?

  3. Show that \(VERTEX-COVER \leq_P 01-ILP\) where \(01\)-ILP is the problem of deciding whether there exists a vector \(x\) with entries in \(\{0,1\}\) such that given an integer matrix \(C\) and an integer vector \(d\), \(Cx \geq d\).

    Hint: See the vertices of \(G\) as variables in the ILP.

  4. Show that \(VERTEX-COVER \leq_P HAMILTON\) where \(HAMILTON\) is the problem of deciding whether given a graph \(G\), there exists an Hamiltonian path in the graph (ie, a graph going through every vertex exactly once)?

  5. Does PRIME reduces to COMPOSITE? Related: is PRIME in NP? (Yes but tricky).

The problem SAT

First we define what is a boolean formula as a formal mathematical object. We do it by induction.

Let \(X\) be a set of countably many indexed variable \(\{x_0, x_1,\ldots,\}\).

By induction, assume we have \(\phi\) and \(\psi\) which are boolean formulas, then the following formulas are boolean formulas:

An evaluation is a function from \(X\) to \(\{0, 1\}\). Given a formula and an evaluation \(E\), we define the operation \(E \models \phi\) (reads \(E\text{ models }\phi\)) by induction as follow:

A formula \(\phi\) is satisfiable if their exists an evaluation \(E\) such that \(E\models \phi\).

The SAT problem take in input a boolean formula and accept it if it is satisfiable.

Theorem. SAT is NP-complete.

Exercise

Two formulas are said to be equivalent if they agree on all evaluations.

The EQUIVALENT problem take two boolean formulas in input and accept if they are equivalent.

Show that EQUIVALENT and SAT are co-Karp-reducible.

3-SAT

We are going to show that even a restriction of SAT is NP-complete.

We first define what it means for a boolean formulas to be in CNF (conjunctive normal forms).

Finally, a formulas is a \(k\)-CNF if all its literals have at most \(3\) variables.

The problem 3-SAT is a restriction of SAT to boolean formulas which are \(3\)-CNF.

Proposition. Every boolean formulas is equivalent to a CNF-formulas.

Theorem. 3-SAT is NP-complete.

It is sufficient to show that we can reduce SAT to 3-SAT.

Theorem. 2-SAT is P.

On machine exercises

TODO

Plan of lecture:

Theorem. (COOK LEVIN) SAT is NP-complete Proof sketch. representing sequence of configuration of a poly NTM with booleans formulas and checking correctness locally.

Proposition. Every Boolean function is equivalent to a CNF Boolean formulas.

Theorem. k-SAT is NP-complete, \(k\geq 3\). Proof sketch. - Encoding a full formulas by materializing the formulas tree. - Building a 3-CNF formulas Psize from the initial one which is SAT iff the initial is SAT.

Theorem. 2-SAT is in P. Proof sketch. - Building the implication graph. - Check that no cycle frop \(x\) going through \(\lnot x\). - Assign to 0 all nodes \(x\) such that their is a path \(x\) to \(\lnot x\). - Greedy algorithm assigning values to CFC in revese topological order.

Exercise Session


Compiled the: mar. 21 janv. 2025 22:04:09 CET