Kernighan's Law: You are not smart enough to debug it

smoyer · on Feb 17, 2020

Over 20 years ago, after I had devised a way to use Gray Code to embed a clock into a signal without increasing the signal's modulated bandwidth, one of my engineers said that someday there'd be a "Moyer's Law". I thought that was funny, so I created one myself:

Moyer's Law: "When we can finally print from any computer to any printer consistently, there will be no CS problems left to solve."

Since every good law has a corollary, I created one of those too (simply to be funnier):

The Corollary to Moyer's Law: "The printed dates will still be wrong."

While PDFs have brought us closer to "universal printing", I won't claim we're anywhere close to solving all CS problems. Sadly, date conversion and formatting continue to be problems (hint, consider UTC ISO-8601 or RFC3339 dates/times for the JSON representation).

P.S. I don't actually think I'm smart enough to have a law named after me ... nor have I really contributed enough to "our art".

- https://en.wikipedia.org/wiki/Gray_code

- https://www.electronics-notes.com/articles/radio/modulation/...

cek · on Feb 17, 2020

Moyer's Law is legit. Thank you for coining it.

For some reason I've gravitated to printing since my earliest computing days and your law speaks to me deeply. I've recently been working on a source code printing app (yes, I'm nuts) and have been blown away by how bad printers are today. I thought the troubles I had at home were "just me".

Cheers to you!

dwmkerr · on Feb 18, 2020

Love it :) I think the "consistently connect to a projector" is similar. Moyle's Law seems popular here and has generated a great conversation, I'll raise it as a suggestion on the repo and see what people think!

smoyer · on Feb 18, 2020

I guess since it's lasted 20+ years, perhaps it's more prescient than I thought?

EDIT: I see that you've submitted an RFC but it misses (what I think is) the point. It's really the corollary of the law that is important as it states how hard it is to do date/time correctly (and you'll notice that the bulk of the discussion here is on the corollary). As I noted above, this was intended to be funny and including it in a list with real laws is just that much funnier!

thrtythreeforty · on Feb 19, 2020

Do you mind explaining the gray-code-clock technique?

vbezhenar · on Feb 17, 2020

Temporal literals really deserve dedicated syntax in JSON. I'm bored to parse yet another "18.2.2020".

carapace · on Feb 17, 2020

That's a can of hairy worms right there.

Check out "Falsehoods programmers believe about time": https://infiniteundo.com/post/25326999628/falsehoods-program...

gombosg · on Feb 17, 2020

Hairy indeed.

I just spent 2 days programming a timezone selector in a React form that changes the displayed date/time as you switch timezones but the underlying UTC representation wouldn't change.

All this without loading 500k JS timezone libraries. I only used Intl API and tz.js [1].

The trick simply was to temporarily shift the date by the difference between the local time and the edited time zone. :)

[1] https://github.com/dynasty-com/tzjs

daurnimator · on Feb 18, 2020

> The trick simply was to temporarily shift the date by the difference between the local time and the edited time zone. :)

If you're near a change in daylight savings then that will go wrong.

carapace · on Feb 18, 2020

Yeah, not to be rude (we've all been there) but that hack is bound to be borken somehow.

Really this time & timezone stuff should be handled by the OS, system calls or libs should be more sophisticated so we aren't "solving" these problems over and over again.

But I'm about to start ranting about Unicode, so I'll shut up now... ;-)

smoyer · on Feb 17, 2020

I agree but I'd like the generic implementation of that. JSON schema never really took off and I believe that part of the reason is that there's not a way to indicate what type might be contained in a string or number (I'm okay with JSON booleans and null). As ugly as it could get, adding XML Schemas to XML documents did in fact help the parser.

The reason I stated I'd like the generic version is that there are other types that we use consistently. There's a very nice RFC available for telephone numbers that we've started following and we can marshal/unmarshal pretty easily from strongly typed languages (where we control the code), but wouldn't it be nice if there was a standard way (within the JSON to let systems know it was a telephone number?

- https://tools.ietf.org/html/rfc3966

eropple · on Feb 17, 2020

I dunno, JSON Schema seems to have certainly found a lot of traction where it makes sense--web responses, etc. That may to some extent be a POV thing, as I work on a lot of OpenAPI-consuming and -producing systems myself, but I never really have to dig around to find schemas and the like for stuff I really care about.

VSCode even has JSON Schema support built in, which is cool. I use that a lot.

optimiz3 · on Feb 17, 2020

Adding a schema meta language just creates unnecessary complexity. Instead of 1 language now there's two.

If you're going to invent a new language like json-schema, you might as well skip to json2.

People who care about schemas already enforce this at the serialization layer where the objects being mapped to implicitly are the schema.

Pushing the schema down to the protocol just adds complexity and bloat.

hyperpallium · on Feb 17, 2020

> the objects being mapped to implicitly are the schema

Dynamic languages don't have an enforced schema. You can do it manually, but a schema is easier, special-purpose, declarative.

Plus, data schema languages are typically far more expressive than programming languages. e.g. java doesn't even have nonnullable (which C had, as non-typedef structs). They're closer to a database schema.

silvestrov · on Feb 17, 2020

I think it's time for a JSON version 2.

a) no mandatory quotes for dicts keys b) date and time intervals in iso 8601 format. c) optional type-specifiers for strings, so we can add e.g. ip4 and ip6 addresses. Eg. { remote: "127.0.0.1"#ip4 }

e.g. { guests: 42, arrival: @2020-02-17T17:22:45+00:00, duration: @@13:47:30 }

jnwatson · on Feb 17, 2020

There’s a corollary in there to Greenspun’s tenth.

Any sufficiently complicated serialization technology contains an ad-hoc, informally-specified, bug-ridden, slow implementation of half of ASN.1.

vbezhenar · on Feb 18, 2020

Name "JSON" would be weird, because it wouldn't be JavaScript-compatible syntax anymore. I know, that few people would agree with me, but I would propose `new Date(2020, 2, 17, 17, 22, 45)` syntax, even if nobody uses `eval` to parse JSON, keeping historical heritage is important. And if you need timezone, something like `Date.parse("2011-10-10T14:48:00.000+09:00")` could be used. Now it's not a real constructor or function calls, it's just a syntax, but it's still compatible with JavaScript.

alexis_fr · on Feb 17, 2020

And comments.

osmarks · on Feb 17, 2020

CBOR (https://cbor.io/) allows custom types, but it's a binary format and not anywhere near as popular.

hyperpallium · on Feb 17, 2020

My theory is that XML has got all the uglies you could ever want (and more), so why not just use that instead of defiling JSON?

Also, thingo aggressively protected JSON's simplicity, banishing comments because people were using them as parser directives.

Supermancho · on Feb 17, 2020

> JSON schema never really took off

I see JSON Schemas used all over. Advertising, Medical, Banking, etc.

elygre · on Feb 17, 2020

"18.2.2020" seems easy. I once walked into a meeting room where the whiteboard said "Deadline: 4/7/3". No idea what I was reading.

thaumasiotes · on Feb 17, 2020

18.2.2020 is easy if you recognize that 2020 must be a year and 18 can't be a month. But if you want to parse 18.2.2020, you probably want the same parser to handle 1.2.2020.

l0b0 · on Feb 17, 2020

To be universally applicable it would have to support at least date, time, time zone and time zone database version (since time zone definitions keep changing). You would then have to define how each of these look in a backward- and forward-compatible manner and define what every combination of them means. For example, a time and time zone but no date or time zone database could mean that time of day in that time zone on any day, using the time zone database active at that date and time. Not saying it can't be done, but it's a big job.

cortesoft · on Feb 17, 2020

Just use Unix time everywhere.

vbezhenar · on Feb 17, 2020

I would just use ISO-8601 everywhere. I would happily use it for ordinary life! But it's not always me who builds API.

saber6 · on Feb 17, 2020

Yeah but what about relativity?

ISO8601 won't scale to universe-level applications!

:)

doubleunplussed · on Feb 17, 2020

Unix time still has issues. It officially pauses for a second when leap seconds happen. You can't actually calculate a delta in seconds between two unix timestamps without a database of when the leap seconds were.

zerocrates · on Feb 17, 2020

On calculating a delta, isn't there the exact same problem with UTC timestamps? Unless one of the ends of your delta is the exact 23:59:60 moment, there's no way to account for possible leap seconds in the middle of your range without just having a list of them.

doubleunplussed · on Feb 18, 2020

Totally! Just pointing out that unix timestamps don't solve everything (even before getting to relativity).

International Atomic Time (TAI), which differs by UTC by 37 seconds since it doesn't count leap seconds, solves everything I know of. Although the clocks aren't in a single reference frame, the procedure for measuring their differences and averaging them to define TAI is well defined and so sets an objective standard for "what time is it on Earth".

stickfigure · on Feb 17, 2020

Presumably you mean unix time as a numeric scalar in the JSON. That is still not self-describing - is it time, or just a number? Which scalar data type should your parser use? And is it seconds since epoch or milliseconds since epoch?

cortesoft · on Feb 17, 2020

It should be maintained as a numeric scalar until you are going to do something with the value... and if you are going to do something with the value, you should know if it is a date or not.

JSON isn't meant to be self-describing format. There is JSON schema or the like if that is what you are after.

stickfigure · on Feb 17, 2020

JSON isn't meant to be self-describing format.

And yet to a very large extent, it is. Strings, numbers, booleans, arrays, and associative maps made the cut. Timestamps would be a pretty reasonable addition. It would certainly cut out all the controversy here.

yellowapple · on Feb 17, 2020

> is it time, or just a number?

Yes :)

That's the beauty of it: time is just a number (of seconds since 1 January 1970).

> Which scalar data type should your parser use?

Integer, since it's an integer number of seconds.

> And is it seconds since epoch or milliseconds since epoch?

Unix time is always seconds.

dragonwriter · on Feb 17, 2020

> > That is still not self-describing - is it time, or just a number?

> That's the beauty of it: time is just a number (of seconds since 1 January 1970).

Having a bare number where the units and zero point rely on out-of-band information is not self-describing.

yellowapple · on Feb 19, 2020

JSON has never been self-describing. Literally every use of JSON is fundamentally dependent upon some degree of out-of-band information.

If you want something self-describing, maybe look into XML?

growse · on Feb 17, 2020

> > is it time, or just a number?

> Yes :)

> That's the beauty of it: time is just a number (of seconds since 1 January 1970).

> > Which scalar data type should your parser use?

> Integer, since it's an integer number of seconds.

> > And is it seconds since epoch or milliseconds since epoch?

> Unix time is always seconds.

Whilst time can indeed be modelled as "just a number", Unix time spectacularly fails to achieve even that.

* It can go backwards * Simple subtractions do not give accurate intervals

It's not a good measure of time. Don't use it as such.

eyelidlessness · on Feb 17, 2020

> Unix time is always seconds.

Then it is a complete non-starter for "just use ______". I work on software that requires millisecond precision (honestly it would benefit from even greater precision) at the transport layer. It's not even really doing anything spectacularly complex or unusual. Seconds simply aren't sufficient for tons and tons of use cases.

stickfigure · on Feb 17, 2020

> Unix time is always seconds.

Javascript time (that 'JS' in JSON) is always milliseconds.

nitrogen · on Feb 17, 2020

Unix time is always seconds.

Wasn't there a BSD that used double instead of long for time_t for a while ages ago?

downerending · on Feb 17, 2020

PDFs notwithstanding, I think we've actually moved backwards over the last thirty years, in terms of printing "just working".

Fortunately, we now have teleconferencing to take our minds off of it.

jimbokun · on Feb 17, 2020

"Fortunately, we now have teleconferencing to take our minds off of it."

"Does anyone have an HDMI to DisplayPort adapater on them? What's the phone number for this conference again? What? You only have Slack installed? No, we decided to start using Microsoft Teams for these. Can you speak up? I'm having trouble hearing you. Maybe try dialing in again?"

fphhotchips · on Feb 18, 2020

"Hey does anyone know the guest wifi password?"

"Sorry, I was double muted"

"Can you turn mute on? There's a lot of echo on your end"

"Sorry, we're just trying to work out which mic works"

smoyer · on Feb 18, 2020

We had to contact our desktop support group to program the bridge number into the conferencing telephone we use for our stand-ups every day because they'd disabled the ability to program it from the touch-screen and keypad. Cisco VoIP phones now have echo cancelling that I think is as good as the old Polycom Star Trek phones, but having them managed by CCM may be a step backwards.

P.S. We're still manually dialing every day while waiting for our ticket to be processed.

quickthrowman · on Feb 18, 2020

Perhaps home printers are a wasteland, but the copy/printers and plotters at my work are pretty dang reliable, considering they do thousands of pages or thousands of linear feet a day sometimes.

downerending · on Feb 18, 2020

Indeed, but this is the "easy" case, and I suspect your IT guys/gals spent a considerable amount of time setting up and shaking the bugs out of this setup.

It's not that printers are physically unreliable, although they certainly can be. It's the logical complexity of getting an image (or text) from one device via the myriad protocols, connectors (or wireless), page layout languages, etc., correctly onto the printer. And for added fun, even after RMSs long trek, some printers still aren't open enough to drive without secret software.

scarface74 · on Feb 18, 2020

You mean you can’t still just do:

   PR#1

To print?

stared · on Feb 17, 2020

It reminds me of a typical code written by scientists.

IMHO there is the biggest gap between the raw intelligence and software engineering skills (including good practices). So, it means that most of the code ends up as code-golf-style contraptions, incomprehensible for anyone else (including their future themselves).

Full disclaimer: I used to be that guy who liked "clever hacks". Now I try to make code readable in the first place (largely thanks to Python philosophy).

xvector · on Feb 17, 2020

I remember in college, even Python often turned into "code golf" style competitions. Could we collapse this 10-line block into an incomprehensible nested list comprehension? Why not!? Surely 1 line is better than 10 lines!

Later in college when we started doing more team projects, comprehensible code became much more important. No one had the time to understand someone's doubly-nested list comprehension. We just wanted to finish the project and get on with life. In this sense, I think the moment you read someone else's shitty code is the moment you realize that you need to write good code yourself.

Still, I'd say it was good practice and helped me think about code in different ways.

cwp · on Feb 17, 2020

There is an element of relativity to this though. I guarantee that there are programmers and teams for whom nested list comprehensions are much more comprehensible than 10 lines of loop code that could be doing anything and have to be carefully analyzed to be understood. This holds for just about any programming technique that you might encounter.

Of course, Kernighan's law still applies, it's just that the definition of "too clever" recedes with skill and experience.

Gibbon1 · on Feb 17, 2020

A counter is a saw a blog post by a CS professor picking apart of a single line implementation of the Sieve of Eratosthenes. She points out the first clue is that it's performance is bad, like really really bad. And then shows it's not actually implementing the Sieve of Eratosthenes.

I might just be dumb but I've been bitten so many times I'm dubious anything can be verified by inspection.

tyurok · on Feb 17, 2020

I have experienced this.

The most brilliant programmer I've ever worked with wrote hardly comprehensible code since they could understand other's code easily, so they didn't feel the need to write readable code because, in their word, "What do you mean you don't understand it? It compiles, it runs, it works, you don't need more than that".

stared · on Feb 17, 2020

Some parts of Python are good for readable code, some - not.

I bemoan lack of nice chaining. Some libraries provide APIs that allow chaining, e.g. Pandas. In other cases, I often find JavaScript (map/filter) and R (dplyr pipe operator) pipelines nicer to read.

However, Python 's Zen of Python can (and IMHO: should) be adopted to other languages.

Balgair · on Feb 17, 2020

I'll second this.

One grad student friend of mine traded me a case of beer for a Saturday of help on his code. It was some code to record spike timings in the olfactory cortex and also control valves (smelly research).

By the 12th nested for-loop, I gave him back the beer.

hinkley · on Feb 17, 2020

I was helping a guy with some text-based game he was working on that had some pretty clever bits of code for constructing scene description text.

The problem was that once in a while you'd just get some garbled text on the screen.

Eventually I figured out why: Some of the sentence fragments were malloc'ed, others were returned from the stack. Once the description got over a certain number of bytes he was smashing his own stack. I say 'his' but the problem was introduced by an earlier collaborator.

Buddy, I really want to help you with this but I (much younger me) am not up for unwinding a use-after-free bug of this level of recursion. Good luck, I'm out.

hinkley · on Feb 17, 2020

I thought this story was going to end with you consuming the beer on the spot.

On several occasions, and with lesser crimes, I've told the person something like "I think you are confusing yourself with your own code. I want you to change to meaningful variable names (stop recycling variables) and factor out a couple of child functions here, and here. If you still can't see the problem, come get me and we'll try this again."

Balgair · on Feb 17, 2020

> I thought this story was going to end with you consuming the beer on the spot.

Haha, no way. Grad students are poor as is. Drinking a month's beer budget in front of the guy would be bad, leaving him with that code was cruel enough.

stared · on Feb 17, 2020

Nested 12 loops, is this part of not-so-smart code.

Less code-golf, more hackathon-style "one more copy-paste and it should work". It has some use cases, just readability or maintainability are not among these.

perl4ever · on Feb 17, 2020

This. I'm not saying I know much about good software engineering practices, but I used to be interested in amateur astronomy and spent time translating Fortran and BASIC code written by astronomers to C, and it was awful bordering on enraging. Several years ago I discouraged someone who called me about a job supporting supercomputing applications for scientists - perhaps subconsciously due to this prejudice.

edmundhuber · on Feb 17, 2020

You shouldn't feel bad about this, you might have saved someone from a lifetime of fixing wacky science bugs in exchange for not a lot of money.

perl4ever · on Feb 18, 2020

>someone

Me?

jimws · on Feb 17, 2020

I am one of those who moved from Perl to Python. As a Perl coder I loved writing clever hacks. After a few months of Python I began appreciating simplicity and elegance in code.

edejong · on Feb 17, 2020

"The competent programmer is fully aware of the strictly limited size of his own skull; therefore he approaches the programming task in full humility, and among other things he avoids clever tricks like the plague. " - EWD 340 [1]

[1] E.W. Dijkstra - The Humble Programmer, ACM Turing Lecture 1972 (https://www.cs.utexas.edu/~EWD/transcriptions/EWD03xx/EWD340...)

ScottBurson · on Feb 18, 2020

Really interesting to read this now, almost 50 years later. So much has changed — and so little has changed.

edejong · on Feb 19, 2020

Absolutely. Many of Dijkstra's EWDs have aged remarkably well. [1]

[1] https://www.cs.utexas.edu/users/EWD/

pattisapu · on Feb 17, 2020

There's a saying in my family, which is full of good cooks:

The sign of a great cook is not the ability to make great dishes -- it's being able to fix dishes that someone else screwed up!

contingencies · on Feb 17, 2020

Nice one. Added a paraphrased version to https://github.com/globalcitizen/taoup

pattisapu · on Feb 18, 2020

Wow! Interesting to see "firsthand" how the various bits of folk wisdom we come across make it into fortunes. Nice work on this project by the way!

arsome · on Feb 17, 2020

I've always found this one a bit peculiar - usually debugging is about looking at something on a simpler, lower level than implementation. Rather than looking at a whole set of logic, let's step through it piece by piece, look at each operation. When done slowly with care and understanding of the lower level components in the system, debugging code is simpler than learning and implementing most architectural patterns in all but the most extreme of circumstances (compiler bugs, hardware faults).

Or maybe I just spent too much time in OllyDbg at a young age to notice.

leeoniya · on Feb 17, 2020

> usually debugging is about looking at something on a simpler, lower level than implementation

the lower you go, the less context you have about the author's intent. with too-clever code it's easy to miss the forest for the trees.

e.g. it usually works fine to fix spelling errors at word level, but less well to restructure complex sentances without larger context.

ves · on Feb 17, 2020

Please tell me “sentances” was intentional.

yellowapple · on Feb 17, 2020

Waht maeks yuo tihnk twas intantional?

nathanlied · on Feb 17, 2020

As others have pointed out, while it's easy at a quintessential level, it's not easy if you're dealing with actually complex (and clever) structures.

I don't know if you've ever had that experience in your OllyDbg-using days, but did you ever try to analyse software that was using some form of hashing function, or cryptography (ECDSA, for instance) with OllyDbg, without knowing what it was? The idea is kind of like that. While it's easy to get a concrete idea of what that block of code is doing (feed it an ASCII string and some memory structures, it spits gobledygook out), it's not so easy to look at it and conclude "Ha! This implements AES-CBC!" without some strong intuition or experience.

saagarjha · on Feb 17, 2020

I will note that it doesn’t actually take all that much reverse engineering experience to be able to recognize most algorithms if you how they work “in code”.

mlyle · on Feb 17, 2020

If they're inlined and optimized, it's often hard to tell where they begin and end... and if anything else is interleaved into the beginning or end.

saagarjha · on Feb 18, 2020

Hard, but not impossible. Usually the really annoying bit is if it ends up being split across multiple functions with stuff in between.

ehnto · on Feb 17, 2020

I will usually plop a breakpoint at where the problem is presenting itself (often in a controller or view), scan up the stack to find where I think the problem is actually being caused, then break at that point and walk through the code as you suggest.

If I understand the application really well often I don't need to do the first step, but it can be a handy shortcut.

joe_the_user · on Feb 17, 2020

I think the law could be put more generally: Specifying how to do X using Y tool-set is half as hard as discovering the order corner case where Y tool-set breaks down.

So, yeah, specifying an architecture may be easier than debugging a routine but discovering the "hole" in the architecture can be twice as hard as specifying it.

jonahx · on Feb 17, 2020

> Or maybe I just spent too much time in OllyDbg.

Yes. Because if you have to use a debugger to understand a piece of code (your own or someone else's), the author has already failed. Good code can be read and understood without any additional tools.

bluGill · on Feb 17, 2020

For complex code you cannot understand it in the debugger. You can understand how it works, but you can't get the design of why it works (or should work if it had been implemented to the design)

BossingAround · on Feb 17, 2020

I don't know, if I can debug code on my workstation, that does not seem extremely difficult. Time consuming, if anything, but I feel like anyone with sufficient grit could do that, regardless of education.

Tracking down a bug and fixing it with a one-liner does not feel like productive programming (though it brings value to the business).

If I can't debug code on my workstation, e.g. it's running on Kubernetes with service mesh and a cool feature must be added, that's when all hell breaks loose. For example, debugging why Kubernetes deployed on OpenStack will push an image to its internal repository but won't pull from it, that is hell and I'd much rather debug HTTPD than that.

bluGill · on Feb 17, 2020

For simple bugs yes. However if the bug is a one in a million race condition between threads where the lock is held/released at the right time except for one place where it isn't even obvious the other thread can write the data. This is particularly bad if there are performance considerations and the design ensures some places where it looks like a race are actually not and so a lock would be a bad thing. Without fully understanding the complex design you cannot add the needed lock in without adding in uneeded locks that kill performance.

nullc · on Feb 17, 2020

> However if the bug is a one in a million

This is what RR is for: you only need to be able to reproduce the bug once inside RR.

> ensures some places where it looks like a race are actually not

Uh be careful, if you're writing in a high level language... such as C: there are no safe data races.

https://software.intel.com/en-us/blogs/2013/01/06/benign-dat...

The compiler is free to reorder operations in ways that make what appears to be a safe data race unsafe.

bluGill · on Feb 17, 2020

There are limits you can place on the compiler. There are atomics in the compiler. There are times when you know you are safe because there is a future write that is memory barrier protected so even though there is a potential race, before the other threads can read it this thread will write something else. There are times where you know the other thread isn't started yet. There are platform specific ways to flush all the caches long after the race in question which can be tied to something else that is synchronized across threads...

saagarjha · on Feb 17, 2020

> There are times when you know you are safe because there is a future write that is memory barrier protected so even though there is a potential race, before the other threads can read it this thread will write something else.

Often it is difficult to prove this to the compiler.

bluGill · on Feb 18, 2020

I don't need to prove it to the compiler. I can prove it by a higher level understanding of my own system. So long as the compiler emits the memory barrier when I need it to, and the other thread stays in the state where it won't be reading the memory in question it doesn't matter what the compiler knows.

Skunkleton · on Feb 17, 2020

>The compiler is free to reorder operations in ways that make what appears to be a safe data race unsafe.

So is the CPU.

ajross · on Feb 17, 2020

Only as documented by the architecture. Memory order can be affirmatively proven by reading assembly as long as you know the rules.

Skunkleton · on Feb 17, 2020

How do you mean? On x86 even if you have the assembly, you don’t know what order loads and stores will execute in. Depending on the architecture there will be some constraints, but usually not enough to guarantee a particular order. That is why barriers exist.

ajross · on Feb 18, 2020

Memory barriers (also cache control mechanisms) are documented by the architecture, and are exposed as instructions or similar hardware interfaces. Obviously OO execution exists, but it's not a magical insurmountable fence (see what I did there?) that blocks all optimization. It has well-specified rules on every architecture. If it didn't, concurrent code couldn't be written correctly at all.

And on x86, the rules amount to "the processor will expend herculean effort to make sure that all memory orderings look identical to all readers in the system" (with a few oddball exceptions, read the SDM, yada yada). Really, on x86 you pretty much don't have to worry about this. If two memory operations happen in a given sequence in the disassembly, literally everyone who reads those locations will agree they happened in the order presented, no matter what optimizations the hardware might have done internally.

Skunkleton · on Feb 18, 2020

Sure, as long as DMA or SMP isn’t involved. And I’m not sure why you are bringing up optimizations here? Out of order memory access is itself a fairly important optimization, why would it preclude optimization?

bluGill · on Feb 18, 2020

On x86 the architecture guarantees memory read/write order. They are not allowed to make some important optimizations because it would change observed memory read/write. DMA or SMP makes no difference to this.

On most other architectures the CPU can and will change memory read/write order. As a result there is a lot of multi-threaded code that works correctly on x86 that fails when run elsewhere.

Skunkleton · on Feb 19, 2020

X86 absolutely can make observable reorderings. Here’s a link with citations to Intel docs if anyone wants to read more. https://bartoszmilewski.com/2008/11/05/who-ordered-memory-fe...

ajross · on Feb 19, 2020

FWIW: this gets argued about occasionally, but consensus seems to be that the cited line in the SDM is documenting a misfeature on an older CPU (though the details escape me about which it is). That effect is, IIRC, not observable on current hardware.

Skunkleton · on Feb 19, 2020

Maybe. I’ve implemented a ring buffer that is used between two virtual machine domains. There were a few places where barriers were needed. If they were removed the ring buffer would start corrupting data. These barriers are in addition to the many obviously needed compiler barriers.

saagarjha · on Feb 17, 2020

On x86_64 this is the only kind of reordering that is allowed; there are other things that presented in a way that guarantees apparent order.

ajross · on Feb 17, 2020

No real world compiler exists for that regime that doesn't provide appropriate control over memory ordering. But yes, the problem is difficult and you have to use those features.

lowbloodsugar · on Feb 17, 2020

see https://en.wikipedia.org/wiki/Heisenbug

ehnto · on Feb 17, 2020

> if I can debug code on my workstation, that does not seem extremely difficult. Time consuming, if anything,

I find debugging saves time overall. I don't have to guess about anything, I can see it working or not working, quickly run one liners to confirm. Sometimes getting an application into a certain state takes a bit of wrangling too, and a debugger helps save time by not having to constantly set up that state to see how things have occurred. You can just sit on a breakpoint while you deduce what's going on. A bug that happens at checkout success is a good example. Going through checkout over and over again is time consuming.

BossingAround · on Feb 17, 2020

> debugger helps save time by not having to constantly set up that state to see how things have occurred

Absolutely. I found out very quickly in my career that I can either stare into the code for hours to see the tiny mistake, or spend 10 minutes stepping through the code to let it tell me what's wrong.

But, it is of course more difficult if you have to debug a complex code base that you don't know. It might take you hours to even know where to place debuggers. So, it can be time consuming, but of course it's still orders of magnitude faster than staring into the code, trying to see where the issue is.

onion2k · on Feb 17, 2020

I don't know, if I can debug code on my workstation, that does not seem extremely difficult.

If you wrote the code, sure. Then you have the context of each step you took to iterate the code to what's there now, with some insight in to where the issue might lie.

Take someone else's code that's the final, non-working version and it's a different story. Debugging is hard.

leetrout · on Feb 17, 2020

Is that a common issue you’ve hit with K8S?

I had this exact issue in an OpenShift system.

nitwit005 · on Feb 17, 2020

You're ready for a promotion from software engineer to senior software engineer once you've spent hours debugging some piece of software, only to realize it's actually working as intended.

aewens · on Feb 18, 2020

Is this implying the debugging was due to the code appearing incorrect or that the previously preceived erroneous behavior was actually intended?

nitwit005 · on Feb 18, 2020

The previously perceived erroneous behavior was actually intended. Just code that's extremely difficult to understand and debug.

piinbinary · on Feb 17, 2020

A possible corollary is that if you are smart enough to make the code simple, you are also smart enough to debug that code.

(Making simple code for complex problems is hard!)

davnicwil · on Feb 17, 2020

I agree it's hard to build a simple solution to a complex problem, but not necessarily wrt what we normally talk about as problem solving skills in programming.

What I mean is the hard part is stepping back and actually solving the real problem in the best way. That might be utterly trivial code wise, it might be brute force vs optimal. It might be just solving a readily parallelisable problem on a single thread. It might be a unilanguage monolith instead of microservices in the 'right language for every task', etc.

The smartness involved in making code simple (and I mean simple, not elegant) is more about pragmatism, lateral thinking, discipline and focus on end goals than any sort of analytical smartness.

naniwaduni · on Feb 17, 2020

Normally doing the former entails the latter anyway...

finolex1 · on Feb 17, 2020

The intelligence required to write a piece of code =/= the intelligence required to understand it. Usually the second is a lot easier. Writing a fancy Haskell typeclass to structure a problem might require deep expertise or creativity. Working with such an existing solution isn't as hard.

Even in math or science it's harder to create something new than it is to merely imitate it.

s_dev · on Feb 17, 2020

It's easier to write code than to read. This isn't a fact but it is generally accepted by many. It doesn't seem obvious at first why but "reading code" isn't like "reading".

https://jonathanfries.net/code-is-still-harder-to-read-than-...

yebyen · on Feb 17, 2020

It's also called the law of "Uphill Analysis, Downhill Invention" which states "It is much more difficult to guess the internal structure of an entity just by observing its behavior than it is to actually create the structure that leads to that behavior."

Been reusing and repeating this one quite a lot, since I saw it mentioned in one of this year's RubyConf keynotes: https://blog.jessitron.com/2019/11/05/keynote-collective-pro...

(There is a blog-post version behind this link as well, in case anybody wanted to skim the contents rather than watch video of the entire talk to find out how it fits in.)

gherkinnn · on Feb 17, 2020

Uphill Analysis, Downhill Synthesis is a major point in the book Vehicles. [0]

Fascinating thought experiments.

Here’s an introduction on YT [1], though it really only scratches the surface.

[0] https://www.goodreads.com/book/show/483485.Vehicles

[1] https://m.youtube.com/watch?v=A-fxij3zM7g

ehnto · on Feb 17, 2020

Fancy algorithms or one liners are not all that difficult to understand, because all the information is right there. It's when you're writing convoluted code inside a big system that issues of understanding begin to arise, because the incoming developer is unlikely to have all of the context that made the code make sense to the person who wrote it.

It's hard to write a concise example because it really takes a big application to make the point, but consider:

    featureFlag(10, false);
    processOrder();

What's flag 10? I have to go figure it out, wastes time, oh turns out it was to disable tax, why are we doing that? Okay in the function that called this one we checked if we were processing a subscription order.

versus

     if(order->isSubscription)
        processOrderWithoutTax()

yc_mat · on Feb 17, 2020

Imho this is a fallacy.

I would argue that in order to fully and truly understand some code you have to be able to re-create it from scratch. E.g. to explain weird constants, data structures, design decisions, estimations on complexity. Hence, in my eyes, the intelligence required to perform those tasks are the same.

But, there is a backdoor. Solving problems needs immense creativity. Not everyone that easily grasps a piece of code (a domain expert, savant, MIT wizard) could have written it. To my eyes, that doesn't mean that the savant is necessarily of inferior intelligence, but likely less creative in this area.

jstimpfle · on Feb 17, 2020

It's not about working with an existing solution, but about spotting the error in it.

rantwasp · on Feb 17, 2020

yeah no. this is not imitating something. This is troubleshooting something that does not behave as expected. The issue is that it works on the normal case/flow but it does not in other edge cases. You're in trouble when you no longer understand why you did things is a certain way when you thought you were "clever"

hinkley · on Feb 17, 2020

The trick, I believe, is in convincing yourself that finding a simpler way to achieve the same result is a more worthwhile form of cleverness.

You still get to be clever, but only you will ever know how clever.

xt00 · on Feb 17, 2020

Kernighans law:

Debugging is twice as hard as writing the code in the first place. Therefore, if you write the code as cleverly as possible, you are, by definition, not smart enough to debug it.

(Brian Kernighan)

The rest of the laws are interesting, so I’d recommend taking a look as well..

lkbm · on Feb 17, 2020

I liked the optimistic twist on this, Kernighan's Lever: http://www.linusakesson.net/programming/kernighans-lever/

Kernighan's Law means that you're continually forcing yourself to level-up in order to debug your own code. It prevents stagnation.

Use caution in multi-developer environments. :-)

strangerw · on Feb 17, 2020

Or it could mean, be too stupid for complex tools, be dumb and choose [simplicity].

[simplicity]: https://github.com/dosyago/dumbass

Personally I find the original law inspiring.

Bartweiss · on Feb 17, 2020

A related interpretation: your debugging tools need to be at least as good as your programming tools. Ideally, better.

Debugging a K8 cluster with print statements is hopeless, but if the cleverest code you can write in a dumb editor is going through a good test suite and profiler, you might be fine. How many as-clever-as-possible optimization tricks have been made viable by Valgrind?

_bxg1 · on Feb 17, 2020

I write every piece of code as if someone else is going to have to be able to use it later, even if it's just for me. And I find remarkable overlap between the clarity that some random person would need and the clarity that future-me ends up needing.

yellowapple · on Feb 17, 2020

This law is actively biting me in the ass right now as we speak ;)

Gotta love T-SQL, where hard things are easy, easy things are hard, and one must perpetually choose between extreme cleverness and extreme verbosity because there is absolutely no middle ground.

madez · on Feb 17, 2020

> Debugging is twice as hard as writing the code in the first place. Therefore, if you write the code as cleverly as possible, you are, by definition, not smart enough to debug it.

The expression "by definition" is misplaced here. The first proposition "Debugging is twice as hard as writing the code in the first place" is not a definition. At best, we can treat it as an axiom. In any case, the conclusion "if you write the code as cleverly as possible, you are not smart enough to debug it" does not follow by definition.

hinkley · on Feb 17, 2020

I promised myself I was going to stop getting sucked into pedantry.

I believe that the sense in which he meant 'by definition' is that it's tautological to say that more of something is more than less of something.

For n > 0, 2n > n

madez · on Feb 17, 2020

The meaning of what the quote is saying is clear. It is also clear how the logical reasoning is supposed to be. My - admittedly pedantic - point is that the choice of words is incorrect.

duxup · on Feb 17, 2020

I worked in a technical support department for a long time. I often was the guy between the customer (the technical customers) and the engineering teams.

After a while figured out pretty good ways to manage my interactions with the engineering teams, despite having no coding experience or really any visibility to the code.

How I worked with Development and Continuation teams was DRAMATICALLY different.

When working with "Development Engineering" (they wrote new code) I always asked them:

"What does X, Y, Z do?"

Effectively I always asked them what they think the code "should" do and what the results should be. There was no point in presenting them "OMG it doesn't do the thing" because they would just panic, get defensive / shutdown. Rather I understood they knew what their code does (well what they thought it did) and asking them that was the key to get them talking / sharing.

After I had their words and phrasing I could better present "Hey we see A, B, C under D conditions. I expected to see X, Y, Z." That would get a lot more buy in from those folks.

I also had to avoid some of the more obvious "hey it does G" where G would obviously break the feature entirely... they often didn't understand use cases (one guy I'm pretty sure didn't even know what the product did, but he could program an ASIC for sure..) so you had to be careful about spelling out customer experiences and rope in a program manager if you felt there was a fundamental problem.

When working with Continuation Engineering (bugfixes and etc):

These guys were much more receptive to the customer's story on on how the customer is using the code / equipment, and you could much more quickly present "Seeing A, B, C, under D conditions. So then I changed L and got M..." and so on. Continuation engineering grocked the meaning of why someone would change L... and so it was helpful. Continuation didn't often know what the code "should do" (or they weren't confidant in their understanding) so just saying "Saw A... that's not right" meant nothing to them. In contrast development engineering needed to talk about the happy path first.

Note that much of this occured after I gave them a good writeup / heads up of all the information I had (even if they didn't read it I never kept anyone in the dark).

It got to the point that if a panicked support call came in and it was end times and someone gave engineering a heads up, they would ask it be assigned to me if I was in the office. Technically I was far from the best tech, but I could talk to engineering.

It was telling how much a difference there was between debugging and initial development.

datenhorst · on Feb 17, 2020

> Effectively I always asked them what they think the code "should" do and what the results should be.

https://rubberduckdebugging.com/

duxup · on Feb 17, 2020

Pretty much, just warming them up to get them into debug mode ;)

For me I (and the customer) all thought we knew what it should do, but it's always good to spell it out for everyone so we all understand when / or even if we're seeing an exception.

Lots of "Woah hey is that what the protocol really says?" moments too.

lowbloodsugar · on Feb 17, 2020

Hit a performance cliff and rewrote a simple divide-and-conquer multithreaded application into a two-thread solution using hand-rolled map and list structures using optimistic concurrency with atomics and paying attention to dining philosophers. Didn't work first time, funnily enough.

For about a day I was terrified that it would never work. Then I found my mistakes (two).

Except in emergencies like this, my time is much better spent writing "inefficient" yet easy to develop code that everyone can understand.

KerryJones · on Feb 17, 2020

This highlights the importance of writing clean, legible, and well-thought-out code but is also an excellent example of hyperbole and false statistics. I've debugged my code consistently for 15+ years (which if you were to take this as law, would imply I doubled my intelligence with a crazy-high compounding rate, and while I enjoy my ego, that's a bit too much for me).

tomrod · on Feb 17, 2020

I'm tickled pink to see Hyrum's Law included here. Hyrum and I were grad school chums. Not too many brighter folks out there.

dwmkerr · on Feb 18, 2020

That's awesome! Hyrum's Law is one of my favourites in the repo and one I share the most often :)

commandlinefan · on Feb 17, 2020

Writing code that can be debugged is a skill that can’t be taught, only learned from experience. Sometimes painful experience.

carapace · on Feb 17, 2020

See also Mark Miller's adages: https://web.archive.org/web/20090215132813/www.caplet.com/ad...

gombosg · on Feb 17, 2020

Such a good read, thanks for sharing this page! (And the authors of course)

tomohawk · on Feb 17, 2020

This is one of the things I learned to really appreciate about Go. Code readability and comprehensibility is way better than any other language I've ever used.

joker3 · on Feb 18, 2020

The output of a machine learning algorithm is code that no one was clever enough to write. In light of Kernighan's law, that should give you pause.

contravariant · on Feb 17, 2020

Although curiously it's a lot more difficult to write simple code (if you want it both complete and correct).

lr4444lr · on Feb 17, 2020

Eh... Not denying the law is correct, but this kind of thinking makes more sense when you're not doing some cloud SaaS solution that can have daily changes due to business needs and continuous deployment. Code can get complicated over time because business moves fast and there isn't time do everything perfectly. It isn't always people trying to be too clever by a half.

Swizec · on Feb 17, 2020

The business folks can be too clever too.

Dealing with that at work right now. We got super clever, changed a couple times, tried to be even more clever, and engineering gets to pick up the mess and try to make it all work.

CodeWriter23 · on Feb 18, 2020

I say: get clever, spend the inordinate amount of time debugging, then level up.

artemonster · on Feb 17, 2020

Disappointed that there was no „Greenspun‘s tenth rule“. It‘s HN, after all ;)

leeoniya · on Feb 17, 2020

title would be less vague as

/s/it/your cleverest code

laumars · on Feb 17, 2020

> title would be less vague as

True but that's too many characters for HN's title field. Which might be why it was submitted with editorialised version.

> /s/it/your cleverest code

massive nitpick but you should be suffixing rather than prefixing the regex with a slash:

    s/it/your cleverest code/

As I said, this is a massive nitpick; literally the only reason I pointed this out was because the submission was about debugging code and I liked the irony of debugging someone's posted about debugging code :)

(there is another law somewhere about people who nitpick are prone to making mistakes in their corrections thus getting nitpicked themselves -- I'm hoping I'm not exception hehe)

jerf · on Feb 17, 2020

"there is another law somewhere about people who nitpick are prone to making mistakes in their corrections thus getting nitpicked themselves"

https://en.wikipedia.org/wiki/Muphry%27s_law

leeoniya · on Feb 17, 2020

haha, oops! can you tell i'm primarily on windows?

something about needing the hn editor to be a REPL...

austincheney · on Feb 17, 2020

Another way to think about this: The more steps that exist between the developer and the end product the less likely the developer is to care or identify whether their solution is clever.

shanemlk · on Feb 17, 2020

And if you write Laws thinking you're Moses, you're not smart enough to code.

jodrellblank · on Feb 17, 2020

You don’t think Brian Kernighan is smart enough to code?

Bendingo · on Feb 18, 2020

And if you don't know who Brian Kernighan is, you're not informed enough to judge him.

mkagenius · on Feb 17, 2020

Why do we pay attention to BS like this?

amai · on Feb 17, 2020

Which brings us to the interesting question: Is God smart enough to debug it’s own code?