Deconstructing "K&R C"

zedshaw · on Feb 29, 2012

Just for the record, I am as sick of this part of the book showing up on HN as everyone else. I do not post it here, other people do, and I apologize in advance for it showing up here all the time half-finished.

But, if you have a problem with this I ask that you write code disproving that the copy() function does not terminate for all possible inputs. Or, that you write a function that can check if a C style string is valid. If someone can do that it'd help me with the book.

eugenejen · on Feb 29, 2012

Zed,

Please don't feel bad or sorry here. Your critic to the code is VALID. Besides, you are not the only one who have understood the problem in K&R. I had a boss who was aware of this since 90's and talked about it when I was quite ignorant of his points then until I got bitten. But I am glad you rediscovered the problem and made an effort to write it on web.

K&R has its value in historical context. I hope you are not discouraged by the general reaction and I really think we should put more effort in creating new material to teach people in modern context. Please keep on working on it!

thatmiddleway · on Feb 29, 2012

sorry zed. wasn't trying to flame or steal. I'm just curious as to what people say.

jleader · on March 5, 2012

For people who say "C strings are null-terminated and you don't pass their length, get used to it", let me point out https://www.securecoding.cert.org/confluence/display/seccode..., which is part of the "CERT C Secure Coding Standard", and the new bounds-checking features in C11 (which I haven't actually seen yet).

Turing_Machine · on Feb 29, 2012

"because it does not terminate cleanly for most possible inputs"

"Most possible inputs", meaning "random garbage"? Insisting that an algorithm "terminate cleanly" on every possible input is... a somewhat high standard.

http://en.wikipedia.org/wiki/Halting_problem

phleet · on Feb 29, 2012

I'm a little confused by the reference to the Halting problem.

You don't need to solve the Halting problem to prove that a specific algorithm will terminate.

Random garbage here is still constrained because the input is still a sequence of bytes.

Turing_Machine · on Feb 29, 2012

Nope, you don't, in this specific case. My issue is with him asserting that an algorithm is automatically "broken" because you can't determine whether or not it will terminate. That standard would exclude every single programming language in common use.

lmkg · on Feb 29, 2012

First, programming languages aren't algorithms.

Quite a few algorithms can be proven to terminate on all inputs (even malformed ones). Zed is asserting that when you have available an algorithm that will definitely terminate, you should tend to prefer it over an algorithm that may not. I tend to agree, especially in an educational text because that's where a lot of people will start picking up habits, both good and bad. Non-terminating algorithms aren't broken in general, but this one is because there is a terminating alternative.

Turing_Machine · on Feb 29, 2012

"First, programming languages aren't algorithms."

True, but irrelevant.

Any Turing-complete language will allow you to implement functions that don't terminate, and indeed in general implement functions for which it is impossible to determine whether they terminate or not.

Better?

"Non-terminating algorithms aren't broken in general, but this one is because there is a terminating alternative."

An algorithm is only "broken" if it doesn't work. That's not the case here. Among working algorithms, one may be a better than another because it is faster, takes up less memory, is more robust when faced with garbage inputs... many engineering tradeoffs. His implementation is probably safer, but it's going to use more space and be slower. Now, that's likely a good tradeoff for most situations, but not all.

However, his claim that the algorithm is "defective" because it doesn't work on random garbage is absurd. It's as if someone claimed that a car was "defective" because it grinds to a halt when you fill the tank with soft-serve ice cream, or that the most common algorithm for integer addition is "defective" because it doesn't work for vectors.

"Safer" I could live with.

Turing_Machine · on Feb 29, 2012

To be specific, my main issue is with this statement: "This provides a formal proof that the function is defective because there are possible inputs that causes the while-loop to run forever or overflow the target."

By that standard, one can easily construct a proof that any Turing-complete language is "defective".

zedshaw · on Feb 29, 2012

Funny because I actually solve it no problem by using that wonderful technology called a for-loop, which does terminate.

So it's only a halting problem when you use K&R's code.

Turing_Machine · on Feb 29, 2012

Unless your for index overflows the MAXINT for the platform, or unless the system runs out of memory, or unless..

Your standard is unrealistically high.

emmelaich · on Feb 29, 2012

So, stop before you reach MAXINT. It's changing the meaning of strlen, but who cares?

thatmiddleway · on Feb 29, 2012

I'm interested to see what people have to say about this as someone who is looking in to learning C.

ranit8 · on Feb 29, 2012

Was this thread not enough for you? http://news.ycombinator.com/item?id=3448573 The topic is good, but couldn't you wait a bit more to resurface it?