Basics: The Halting Problem

0 thoughts on “Basics: The Halting Problem”

Binil Thomas February 6, 2007 at 11:02 pm

Mark,
Thanks for explaining this. I could not follow the last part of your article.
Suppose we limit ourselves to just thinking about functions on the natural numbers: {f | f:N→N}. How many functions are there in that set? It’s an infinite set – and it’s an infinite set larger than the infinite set of natural numbers.
Can such an assertion be made, since both sets are infinite?
For a computing machine, you can map every one of its programs to a natural number
Did you mean that since each program is distinct, each can be named with a natural number – or am I missing something?
Thanks,
Binil

Loading...

Reply ↓
Anonymous February 6, 2007 at 11:26 pm

Can such an assertion be made, since both sets are infinite?

Hey! That’s a wonderful suggestion for another “basics” post. (Binil: In short, yes, that assertion makes sense, and yes, you’re missing something. Hopefully something that will be covered in another basics post, or by the other commenters shortly)

Loading...

Reply ↓
Daniel Martin February 6, 2007 at 11:34 pm

I remember how bizarre I found it when I realized a few years ago that there are only countably many English-language descriptions of numbers. This means that there exist numbers between 0 and 1 which cannot be described. (“Described” here means a description which specifies the number uniquely. So “one-half square root of two” is an adequate description as is “one over pi”, but “irrational” isn’t)
I sometimes imagine something out of the Cthulhu mythos bringing one of those numbers to drive a mathematician insane. (or not; without the axiom of choice it’s awfully hard to get ahold of those things, and I don’t know how the Elder Gods feel about the Banach-Tarski paradox)
By the way, signing in with typepad only seems to cause my comments to appear as “Anonymous” (as in my short comment above)

Loading...

Reply ↓
Tordek February 6, 2007 at 11:48 pm

Aleph-0 vs Aleph-1 if you want the explanation of different infinite sizes.
An example of the infinite functions F:
F1(n)=1
F2(n)=2
…
Fm(n)=m
There you have infinite functions: each receives one variable and returns m (the function’s own number). Now, since you’ve “spent” every natural number numbering those functions, you just need to add one more, and you’ve run out of numbers (there is one more). If that is not enough, you also have another group of infinite functions: Addn(x)=x+n.
Repeat for multiplication, division, and all the operations.
Correct me if I’m wrong.

Loading...

Reply ↓
Zombie February 7, 2007 at 12:07 am

Tordek, I’m not sure that proof quite works, you could say the same thing about rationals: n/1 exhausts all the integers, but there is another mapping that is 1:1. In fact, the set of all functions you give are countable, because they’re all computable and can be encoded with a finite bit string.
However, if you assume functions are countable, then you can label every single one of them Fm(n) for some integer m. Now pick a new function G where G(n) != Fn(n). G is different from Fn for all n, but ought to be in the set of all Fm if integer functions are coutable, which is a contradiction. So integer functions of integers are not countable.

Loading...

Reply ↓
grigory February 7, 2007 at 12:10 am

For more good intro material regarding the halting problem, have a look at this link:
http://www.scottaaronson.com/democritus/lec3.html

Loading...

Reply ↓
Chad Groft February 7, 2007 at 1:06 am

MarkCC:

Take the halting oracle program H. Write a program T which includes H, and which does the following:

* If φH(T,i)=YES (meaning that the halting oracle says that T will halt on input I), then T will run an endless loop, thus never halting.

* If φH(T,i)=NO (meaning that the halting oracle says that T will not halt on input i), then T will immediately halt.

This argument doesn’t work, because it’s not clear that you can write such a T. You certainly couldn’t embed the source code of T in the source code of T, for example.
What you actually have to do is make the program part of the input, as follows: Suppose you can write H. Write a program T which halts on input i iff program i does not halt on input i. You can certainly do this, given H. Now, what happens if you run program T on input T? By the program’s own logic, it halts iff it does not halt. This is a contradiction, so our assumption (that a program H exists) is false.
It’s almost exactly like Russell’s Paradox (T halts on precisely those programs that do not halt on themselves; does T halt on itself?). The trick is treating the input i both as a program and as input to a program.

Loading...

Reply ↓
Chad Groft February 7, 2007 at 1:19 am

Binil/Tordek/Zombie: Yes, there are different infinite cardinalities. What Zombie says, both regarding Tordek’s “proof” and a corrected proof, is entirely correct. However, the distinction is not quite as simple as Aleph-0 versus Aleph-1.
To be more specific, Aleph-0 (also called Beth-0) is the smallest infinite cardinality, that of the natural numbers. Aleph-1 is the very next smallest cardinality; it’s uncountable, but a set of size Aleph-1 embeds into any uncountable set. (No, it’s not obvious that there should be such a cardinality, but given the usual axioms of set theory (ZFC) there is.) Aleph-2 is the next cardinality after Aleph-1, and so forth.
The set of all functions f: N -> N actually has the same cardinality as the set of all subsets of N, denoted P(N). This cardinality is sometimes called Beth-1. Beth-2 is the cardinality of PP(N), etc.
Now, Beth-1 and Aleph-1 may well be the same. The statement “Beth-1 = Aleph-1” is one form of the well-known Continuum Hypothesis, and it may be true; Gödel showed it was consistent with ZFC. However, it may be false; Paul Cohen showed that its negation is also consistent with ZFC, and in fact Beth-1 can be “forced” arbitrarily high in the Aleph hierarchy (which goes much higher than just Aleph-n for all natural numbers n).
Once again, we’ve gone much further than a “basics” article.

Loading...

Reply ↓
noone February 7, 2007 at 2:19 am

Chad Groft: Actually, it does work, you just need some more basics – which I won’t give here, because it is a bit much, but provided you believe me that any sufficiently powerful programming language is equivalent, we actually can construct T explicitly (in Pseudocode – feel free to translate it to something you know):
function T(x)
if H(x) then // if the program with the index x halts,
while true do ; // do an infinite loop.
else // but if it does not halt,
return 0. // return 0 and stop.
Except for the call to the Halting Oracle H this function is obivously computable, therefore T is computable if and only if H is computable.
As we have constructed T, we can compute its index (easily, as it is already represented as a number within the computer). If we apply T to its own index we get that T halts if and only if T does not halt, which is a contradiction. Therefore, T cannot be computable and by the previous observation so can’t H.
As I said, this contains some assertions which are also basic, but too numerous for such a comment, so you better look them up yourself.
Also, this commenting facility doesn’t work with text browsers.

Loading...

Reply ↓
Thony C. February 7, 2007 at 2:47 am

The priority for the halting problem should probably go to Emil Post and not Turing

Loading...

Reply ↓
Michael Ralston February 7, 2007 at 6:40 am

The Kleene Recursion Theorem strikes!
It’s possible to, given ‘programming languages’ that satisfy certain properties (which Turing machines do), actually take any program, and algorithmically modify it to produce a new program that has the same output as the first program if that first program were given itself as input.

Loading...

Reply ↓
Mark C. Chu-Carroll February 7, 2007 at 8:11 am

Chad:
If you’ve got a Gödel numbering of all programs in a language, it’s possible via a quine-like technique to include T’s number in itself.

Loading...

Reply ↓
Confused February 7, 2007 at 8:36 am

Hey Mark,
I’ve been following your blog for a couple of days now and its very interesting. But there are a couple of things that I don’t understand in this post :
For instance :
Take the halting oracle program H. Write a program T which includes H, and which does the following:
* If φH(T,i)=YES (meaning that the halting oracle says that T will halt on input I), then T will run an endless loop, thus never halting.
* If φH(T,i)=NO (meaning that the halting oracle says that T will not halt on input i), then T will immediately halt.
So T completely defeats the Halting oracle. No matter what you do, for any program that claims to be able to tell whether other programs halt, there are programs for which they can’t get the right answer. There’s always a program that for a given halting oracle will do exactly what the oracle says it won’t. And that, in turn, means that there are some functions that simply cannot be computed. And here’s the neat twist: given a halting oracle H, it’s easy to write a program which will generate the program that defeats H – that is, the Halting oracle is non-computable, but if it were computable, then the program that defeats it would also be computable.
This part defeated me. 🙂
Could you possibly explain this at an even more basic level ?

Loading...

Reply ↓
Xanthir, FCD February 7, 2007 at 11:08 am

Confused:
Hopefully I can clear this up a bit. It is a bit confusing, especially if you don’t grasp that programs can include themselves within themselves (anything that’s equivalent to a Turing machine can do this).
All right, so you’ve got the Halting Oracle. It takes some program P and some input I, and tells you whether or not P(I) will halt. Calling the Oracle looks like this: H(P,I). H will return YES if P(I) halts, and NO if P(I) loops forever. (Note that H doesn’t actually *run* P(I) – if it did that, it would loop endlessly whenever P did. It just knows how to analyze P so that it can tell what it will do without running it.)
The important thing to hold in mind is that H can really take *any* program. If we ran, say, H(H,I) (passing H to itself), then it would always return YES, because the Oracle is guaranteed to halt and give an nanswer.
Now, again, P can be any program at all. But we’re going to make a very special program, called T. T contains the code for H (so it can run it) and the code for itself (which is possible, though weird), and basically does just two things (I’ll write this in pseudocode):
Program T(data I)
{
If (H(T,I) == YES) (H says that T(I) will halt)
Then loop forever (don’t halt)
If (H(T,I) == NO) (H says that T(I) will loop forever)
Then stop (halt immediately)
}
So, if H(T,I) would return YES, it has to return NO. If it would return NO, it has to return YES. It can’t do both, and so either answer produces a paradox (similar to the “This statement is false” paradox – both work on self-reference). Thus, the Halting Oracle cannot exist (proof by contradiction).
The last paragraph of Mark’s is just a nice bonus. If the Halting Oracle actually existed (was computable), then T is guaranteed to be computable (it must exist as well). This prevents any objections of, “Well, that would be true IF we could construct T, but we can’t, so the Halting Oracle is still possible.”
Does that help?

Loading...

Reply ↓
grammar SS February 7, 2007 at 11:40 am

Can you write a program which can by looking at P can determine whether or running P will eventually stop?Can you write a program which, by looking at P, can determine whether or not running P will eventually stop?

Loading...

Reply ↓
krisztian pinter February 7, 2007 at 12:52 pm

back in my university days, i thought math is good, but boring topik, with no surprises. recently, i read a lot of absolutely crazy math things, so i had to revise my opinion. math contains more surprises then, say, chemistry. to be on topic, here is a related outrageous statement: most of the real numbers are not computable.
http://www.math.hmc.edu/funfacts/ffiles/30004.8.shtml
to be precise, the article is about writing out the decimal digits, but i think it is not the major point here.

Loading...

Reply ↓
Chad Groft February 7, 2007 at 2:10 pm

If you’ve got a Gödel numbering of all programs in a language, it’s possible via a quine-like technique to include T’s number in itself.

You’re probably right, but it’s certainly not clear. The version I proposed above is the one I’ve always seen in discussions of the subject, and it’s more immediate. To make it especially clear, one can think of the numbers as equivalent to text files, which one can then try to interpret as source code or as data.

Loading...

Reply ↓
Xanthir, FCD February 7, 2007 at 2:27 pm

krisztian pinter:
Yep, that is a crazy but true statement. Most reals are not computable, and most programs aren’t Turing-recognizable.
Basically, if a number is computable, then there is (obviously), a program that will compute it. You can translate any program into an integer (one way is to use an analog of Godel numbering). Thus, the set of all possible programs is no larger than the set of integers. The set of reals is strictly larger than the integers, so there must be an infinite number of uncomputable reals!
The proof that they give is probably easier, though, especially if given soon after the proof that the rationals are the same size as the integers (both proofs involve organizing the set into an infinite ordered set of finite groups).

Loading...

Reply ↓
Foxy February 7, 2007 at 2:40 pm

I think I understand that there are so many more uncomputable things than computable.
However, the only description I’ve ever heard of them is the halting problem, or modifications therein.
Are there any other distinct concrete examples of uncomputable problems?
Or, sort of like NP problems mapping on to each other, could you establish some sort of equivalence between uncomputable problems?
Honestly, I don’t know if I really understand what that means. But it sounds like an interesting question.

Loading...

Reply ↓
Stephen Uitti February 7, 2007 at 2:52 pm

Let’s say you have a program, p, which runs under Linux on an X86. Can you write a program q that can predict the output of p?
Maybe. If p is inefficiently coded, a program q might execute quicker, and finish first. Or, perhaps a copy of p, we’ll call q, could be compiled with optimization turned on, and therefore finish first. If p takes sufficiently long to run, it might be quicker to wait until next year, when the new x86 is faster. The a copy of p, which we’ll call q, could predict the result from p. For a little while, there was this program that could translate x86 code to Alpha native code – and run it faster than any existing x86.
I took Operating Systems with Peter Denning. Denning came up with a proof that using a Least Recently Used algorithm was the best one could do for page replacement. Interesting, and gospel in the industry for years, but sadly wrong. One of the assumptions was that the operating system couldn’t predict the page access patterns of random programs. This is, of course, false. Many programs show sequential or random memory access patterns. The operating system can see this, recognize it, and perform much better than LRU. For example, if the program is doing sequential access, the OS can perform read ahead (and read more than a page at a time which can be much faster with disk drives), and as soon as a page is actually used, it can instantly schedule that bit for page out. If the page is clean (wasn’t written to), it can be simply deallocated. This sort of OS can allow programs to get closer to the performance of sequential programmed I/O using the memory system. Very practical.
So, the difference between theory and practice is that, in theory, they’re the same. Great theory is no excuse to stop thinking. That would be prejudice.
One of the old jokes was this: The new Cray supercomputer is so fast that it can compute an infinite loop in 17 minutes!
Another: The new Cray supercomputer is so fast, that it takes five halt instructions in a row to get it to stop!
On the subject of fast computers, i understand that there is a Linux laptop in the International Space Station. At 17,000 miles per hour, it’s the fastest computer ever!

Loading...

Reply ↓
Ørjan Johansen February 7, 2007 at 4:45 pm

I find it interesting that nearly every result mentioned in this article and its comments so far is frequently proved in essentially the same way: using the technique of diagonalization. That is, by making some kind of “function” which applies any argument in a set to itself. Either to prove that the function is not in the set, or to get a function that somehow evaluates to itself when applied to itself.
This includes:
The halting problem
Gõdel’s incompleteness theorems
Kleene’s recursion theorem (I think)
The construction of quines
Russell’s paradox
There being more real numbers than integers
There being more integer functions than integers

Loading...

Reply ↓
Mark W February 7, 2007 at 5:40 pm

I wanted to know whether it is possible to write a perfect spam filter engine. My hunch based on this article seems to be no unfortunately.

Loading...

Reply ↓
Chad Groft February 7, 2007 at 10:25 pm

Are there any other distinct concrete examples of uncomputable problems?

Many. Let C be the set of all (partial) computable functions N -> N. Take a subset S of C, not the empty set, not all of C. There is no program that will accept an integer n as input and tell you whether it codes for a program whose function is in S. So there’s no program which will tell you
* whether program n is defined on all inputs;
* whether two programs m and n define the same function;
* whether program m is an extension of program n (m and n are input).
You can probably come up with some yourself.
Moving away from computability theory per se, there are a number of problems in group theory that are not solvable, but I don’t see anything on group theory in the archives, so I’ll stay away from that.
Let’s look at topology instead. Many “nice” topological spaces can be triangulated; that is, divided into (a finite number of) vertices, edges, triangles, tetrahedra, and so on up. A triangulation can serve as input to a program; one inputs the number of vertices, the pairs of vertices spanned by some edge, the triples spanned by some triangle, etc. There is no program which will take two trianguations as input and determined whether their spaces are equivalent, i.e., homeomorphic. There are other reasonable ways to specify a space (cutting out by algebraic equations comes to mind, but there are others), and they suffer from the same problem.

Loading...

Reply ↓
Ørjan Johansen February 8, 2007 at 7:35 am

Are there any other distinct concrete examples of uncomputable problems?
Or, sort of like NP problems mapping on to each other, could you establish some sort of equivalence between uncomputable problems?

Yes. In fact the whole theory of NP-completeness and complexity has borrowed basic ideas from the older theory of Turing-completeness and computability/recursiveness.
Finding uncomputable problems or Turing-complete systems by reducing to previous known ones (and ultimately to Turing machines), inspired finding NP-complete problems in the same way. There are long lists in each case, from many different areas of discrete mathematics. (Just the other day I saw a topic at Lambda the Ultimate about someone proving the undecidability of a programming language type system in this way.)
As another example, the basic proof that there is an endless succession of distinct complexity classes (exponential time problems not solvable in polynomial time, for example) is yet another adaptation of diagonalization and the halting problem.

Loading...

Reply ↓
JimmychuckMcC February 8, 2007 at 9:25 am

Enjoyed the post. What’s the deal with Lowenheim-Skolem?

Loading...

Reply ↓
Torbjörn Larsson February 8, 2007 at 12:39 pm

This thread give me the impression that while math has been busy with mainly discover what is possible, computer science is busy discovering what is not which is also useful. The only math impossibility thing this layman can remember right now is an example of a differential equation that has no solutions and full singularity descriptions.

there is a Linux laptop in the International Space Station. At 17,000 miles per hour, it’s the fastest computer ever!

I hate to nitpick a good joke, but this should be the fastest direct-accessed computer. The Pioneer’s and Voyager’s onboards must be the fastest.

I wanted to know whether it is possible to write a perfect spam filter engine. My hunch based on this article seems to be no unfortunately.

Interesting. I can’t help think that these things could be true and possibly connected by halting problems, and so extending Ørjans list:
* Not possible to write perfect spam filter. (Mark W hunch.)
* Not possible to write perfect compression program. (Torbjörn L hunch from Mark’s hunch.)
* Not possible to catch all patterns in one complexity measure. (Murray Gell-Mann.)
* Not possible to generally find simplest algorithms for given functions. (Chaitin?)
* Not possible to find one optimization or filter method that handles all inputs. (NFL theorem?)

Loading...

Reply ↓
Zombie February 8, 2007 at 7:03 pm

The “perfect spam filter” isn’t really comparable to the rest, though, because the limit isn’t necessarily a matter of fundamental computability. It’s either a specification problem (define spam) or better yet, it’s what’s semi-jokingly referred to as “AI-complete”, suggesting it requires a simulation of human consciousness to avoid false positives/negatives inherent in any simpler filter, and to keep up with spammer’s attempts to circumvent filtering.

Loading...

Reply ↓
Luke Breuer February 8, 2007 at 11:58 pm

Surely it is possible to tell if certain programs will halt on certain inputs. For example, a program with no loops (like Hello, World!) is guaranteed to halt. Given this, there must be some way to describe programs for which the halting problem can be solved. Comments?

Loading...

Reply ↓
Stu Savory February 9, 2007 at 6:51 am

Can you give us – or point us to – the state table for the current smallest UTM please?

Loading...

Reply ↓
Mark C. Chu-Carroll February 9, 2007 at 8:34 am

Luke:
There are a number of classes of programs for which the halting problem is solvable.
The largest easy-to-understand one that I can think of is the class of primitive recursive functions. Speaking very loosely, the primitive recursive functions are the class of all functions that have one parameter whose value reduces with each iteration, and the program halts when it reaches 0.
A lot of common things that we do are primitive recursive. Most interesting mathematical functions that we use in everyday programming are PR; many (if not most) algorithms that work on common data structures are PR.

Loading...

Reply ↓
Mark C. Chu-Carroll February 9, 2007 at 8:46 am

Stu:
I’d need some time to look up the actual state table…
There’s also some dispute about just how to measure the size of the machine… i.e., do the number of transitions count?
Using a measure of states×symbols, Minsky did a 4-state 7-symbol machine, which is one credible candidate for smallest.
Wolfram did an even smaller one, with 2 states and 3 symbols, but I think it’s sort-of cheating. The Wolfram solution is a Turing machine implementation of a cellular automaton, where you could implement a Turing machine using the CA. The reason that I think it’s cheating is because to be a real UTM, you need to be able to describe how to encode an arbitrary Turing machine as an input for your UTM; Wolfram has shown that you can do the encoding, but not how – and much of the real functionality of the TM being run by the UTM will need to be implemented as a part of the input.
So I tend to support the Minsky approach.
I’ll probably hunt down Minsky’s UTM, and do a post about it, showing how it works. That’s something I’ve wanted to do for a long time; maybe now’s a good time to bite the bullet and do it!

Loading...

Reply ↓
Blake Stacey February 9, 2007 at 12:35 pm

The Feynman Lectures on Computation give a diagram for a UTM. It’s not the smallest possible, since as Feynman says one could be slightly more clever and use the same state for multiple purposes. I don’t have the book in front of me, but I recall a statement to the effect that one could make a UTM with just two symbols and many states, or just two states and many symbols — so I guess we’d have to clarify our notion of “size” before we could decide upon a “smallest” UTM.
MathWorld describes Minsky’s 1962 construction as a “7-state 4-color” machine, where in Wolfram-ese, “color” means what the rest of us call symbols on the tape.

Loading...

Reply ↓
Blake Stacey February 9, 2007 at 12:51 pm

While looking for papers about minimal UTMs, I stumbled across one by P. W. K. Rothemund entitled “A DNA and restriction enzyme implementation of Turing Machines” (1995, PostScript link). The abstract follows:

Bacteria employ restriction enzymes to cut or restrict DNA at or near specific words in a unique way. Many restriction enzymes cut the two strands of double-stranded DNA at different positions leaving overhangs of single-stranded DNA. Two pieces of DNA may be rejoined or ligated if their terminal overhangs are complementary. Using these operations fragments of DNA, or oligonucleotides, may be inserted and deleted from a circular piece of plasmid DNA. We propose an encoding for the transition table of a Turing machine in DNA oligonucleotides and a corresponding series of restrictions and ligations of those oligonucleotides that, when performed on circular DNA encoding an instantaneous description of a Turing machine, simulate the operation of the Turing machine encoded in those oligonucleotides. DNA based Turing machines have been proposed by Charles Bennett but the invoke imaginary enzymes to perform the state–symbol transitions. Our approach differs in that every operation can be performed using commercially available restriction enzymes and ligases.

Rothemund accepts Minsky’s construction as “the smallest UTM known” (I believe it’s older than Wolfram’s CA-oriented trick), and he gives a DNA implementation in section 6.

Loading...

Reply ↓
Ørjan Johansen February 9, 2007 at 7:00 pm

I recall that another way to understand primitive recursive functions is as programs allowing loops, but only “for” loops where the maximal number of repetitions is calculated before the loop starts.

Loading...

Reply ↓
Xanthir, FCD February 9, 2007 at 10:44 pm

Orjan: (And I apologize for the incorrect letter)
I think that’s an accurate rendering of primitive recursion. It seems to be a pretty straightforward opposite of arbitrary recursion, where you determine the number of loops inside the loop code.

Loading...

Reply ↓
Blake Stacey February 13, 2007 at 9:32 pm

The 2-state, 5-symbol (“Rule 110”) UTM implementation was actually discovered by Matthew Cook, an employee of Wolfram.

Loading...

Reply ↓