The hardest program I've ever written (2015)

journal.stuffwithstuff.com

80 points by jacobedawson 4 days ago

Mine was an automated file transfer system that had to be 100% reliable on an insanely unreliable network (~95% uptime). Took about 9 months of bug squashing after development was done. So many edge cases. I would probably never mention this in a job interview because I doubt most people would understand why it was so hard.

bjoli 6 hours ago

I once wrote an inliner. When you have not done it, it seems simple. When you are doing it it is like trying to restrain a large rabid dog with a slippery leash.
Now, I am not a programmer by trade, but I have a hard time thinking anyone would find it nice to write an inliner. At least not if you want the inliner to always make things faster.
greazy 6 hours ago

I'm working on this exact same thing. Was your code ever published or did you blog about it?

PaulKeeble 15 hours ago

About the worst job on any enterprise software project is the PDF output, they always end up doing it for emails or something else and its a never ending list of bugs. Text formatting is a never ending list of problems since its so got a lot of vague inputs and a relatively strict output. Far too many little details go wrong.

vbezhenar 13 hours ago

With PDF, my best approach was to go very low level. I've used PDFKit and PDFBox libraries and both provide a way to output vector operations. It allows to implement extremely performant code. The resulting PDF is tiny and looks gorgeous (because it's vector). And you can implement anything. Code will be verbose, but it's worth it.
I even think that it's viable to output PDF without any libraries. I've investigated that format a bit and it doesn't seem too complicated, at least for relatively dumb output.
mikeday 11 hours ago

We've spent twenty years working on HTML to PDF conversion and I expect we could easily spend another twenty years, so feel free to give Prince a try if you would rather avoid the headache :)
- twotwotwo 5 hours ago
  
  Awesome. From curiosity: is Prince's core still written in Mercury? (Looked at old comments.)
  - mikeday 4 hours ago
    
    Absolutely! The CSS support, layout engine, PDF output, and JavaScript interpreter are all written in Mercury, while the font support that was originally a mix of Mercury and C has now been rewritten as a standalone Rust project, Allsorts.
kbbgl87 12 hours ago

Thinking about phantomjs and rasterize brings back nightmares
huflungdung 12 hours ago

[dead]

sema4hacker 16 hours ago

> That handful of code took me almost a year to write.

Formatting can be tough. See Knuth's extensive bug list for TEX from 1987 at https://yurichev.com/mirrors/knuth1989.pdf to see the kind of tarpit one can get trapped in.

rafabulsing 10 hours ago

In my brief experiments with Flutter, I must admit I didn't enjoy the experience of using the autoformatting. Not knocking the author of the tool at all, I can definitely see how absurdly hard it is to create something that does what it is trying to do. And I'm not against autoformatting in general either. I think gofmt works much better, and that's in large part because it tries to do less.

With dart, I felt that very often, when time I saved a file after editing (which activated the formatter), the code would jump around a lot, even for very small edits sometimes. I actually found myself saving less often, as I found the sudden reorganizing kind of jarring and disorienting.

Most of this, I felt like came from this wish of making the formatter keep lines inside a given width. While that's a goal I appreciate, I've come to think is one of those things that's better done manually. The same goes for the use of whitespace in general, other than trivial stuff like consistent indentation. There's actual, important meaning that can be conveyed in how you arrange things which I think is more important than having it always be exactly mathematically consistent.

It's one of the reasons I still prefer ESLint over Prettier in JS land, also, even for stylistic rules. The fact that Prettier always rewrites the entire file from scratch with the parsed AST often ends up mangling some deliberate way that I'd arrange the code.

fn-mote 7 hours ago

One of the lessons I took from the formatters (Python, Go, Rust) is that enforcing the same style ends all of the drama - indeed, all of the thinking - about how to format code. I like that.
I run my formatters manually, so I can’t comment on the jumps in code. That does seem jarring.

Panzerschrek 6 hours ago

For some time I tried to write a formatting tool for my programming language. After achieving first results I gave up. I have found that writing a formatter is surprisingly hard task. Operating on token-level can't provide good enough result, so a proper parser is necessary. Reusing existing parser isn't possible, since it ignores whitespaces and comments. Even more problems creates the necessity to preserve user-defined formatting in ambiguous cases.

So I thought, that there is no reason trying to solve all these problems, since it requires too much time investments. But without solving all this a semi-working formatter isn't good enough to be useful and not annoying.

b4ckup 15 hours ago

I once wrote a formatter for powerquery that's still in use today. It's a much simpler language and I took a simpler approach. It was a really fun problem to solve.

cjfd 16 hours ago

[flagged]

tom_ 12 hours ago

Interesting comment. I always saw the formatting aspect as the sort of drudgery the computer could do for me. (As a bonus, it will always do it completely consistently.)
chaps 16 hours ago

This might just be the most HN comment I've ever read.
- 7thpower 14 hours ago
  
  There seem to be a lot of posts that attempt to cast seemingly mundane things through an “ethical” lens and I often wonder what the authors of them must be like in real life.
  - kjs3 13 hours ago
    
    You do not want to get cornered by them at a party talking about their pet peeve.
- noir_lord 16 hours ago
  
  It's certainly an interesting take.
o11c 15 hours ago

There are 3 major problems with automated code formatters:
* Handling of multiple newlines to break sections, or none to group related functions (e.g. a getter with a setter). Sometimes it's even best to move functions around for better grouping.
* They don't factor out an expression into a separate variable.
* They destroy `git blame`. This one is avoided if the tooling has always enforced formatting.
- layer8 14 hours ago
  
  Regarding the second one, a formatter shouldn’t be changing the AST. At most inserting parentheses/braces/semicolons for clarity, which doesn’t change the AST structurally (or, depending on one‘s definition of “AST”, doesn’t change it at all).
  - o11c 13 hours ago
    
    Perhaps, but it is absolutely relevant. Most of the horrible formatting results are due to the formatter trying really hard when the user really should introduce a variable.
- dragonwriter 14 hours ago
  
  > * They don't factor out an expression into a separate variable.
  That's...not formatting, and there's probably no good deterministic rule for when to do that, anyhow.
- seanssel 11 hours ago
  
  > They destroy `git blame`
  How is this a problem with auto formatters? This is completely on the person/team, not the tool. Even if the repo doesn’t enforce on pre-commit or something, don’t most formatters have the option to only format lines you’ve actually changed?
- zahlman 12 hours ago
  
  > * They don't factor out an expression into a separate variable.
  * They implicitly enable people to write deeply nested code that lacks such factoring, without feeling like anything has gone wrong.
mjsir911 15 hours ago

yeah, I've come to terms that I mostly do programming-as-an-art and that includes how my code is structured, and I'm on exactly the same page.
In pragmatic business environments it's not worth the fuss but I never feel great about anything I make in those kinds of environments anyways, and I always appreciate being able to shine when there's no enforced code formatting.

nkrisc 14 hours ago

What is the ethical dilemma presented?

kjs3 13 hours ago

The one they invented to justify hating coding standards.

vbezhenar 13 hours ago

The human might use a carefully selected mix of tabs and spaces to output a picture. And formatter will destroy it, thus destroying artist expression.

Compare:

    #define/**/Q(x,y)r;char*q/*                         */=y#x","#y")",*p,x=*p%67;}
      /*-IOCCC2020-*/#include/*                         */<stdio.h>/*-BBQlock--*/
        int(y),x,i,k,Q(s[9<<9/*           12            */];float(f)[3];void(Z)
          (){*f=r<0?r:-r;f[1]/*      11         1       */=42.5;f[2]=22.5;for
           (k=0;++k<39;*f/=(k/*   10               2    */%2?k:-k)*6875.5/i)
            y=f[1+k%2]+=*f;k=/*                         */f[2];p=s+k/2*86+y
             ;}int(main)(){p=/*  9         o-------> 3  */s;for(;i<1978;*q
              >32?k=i++/86-11/*           /             */,y=(750>r*r+k*k
              *4)*4+y/2,*p++=/*    8     L         4    */r<44?y?"+0X+0X"
               "+!"[y]-1:*q++/*       7         5       */:10:*q++)r=-41
               +i%86;r=20;for/*            6            */(x=13;(i=3600*
               --x);*p++="XR"/*                         */"MOQSUWAY"[x%+
               10]-9,*p+=x/10/*                         */*41)Z();sscanf
               (__TIME__,"%d"/*  \       /    -------+  */":%d:%d",&k,&x
              ,&i);r=10;for(i/*   \     /     ------ |  */+=(k*60+x)*60;r
              +18;*p=k%2?*p%2/*    \   /      ------ |  */?59:44:*p>39?59
             :39,i=r--?i:i%(+/*     \ /       ------ |  */3600)*12)Z();for
            (p=s;*p;putchar(k/*      X        ------ |  */%2&&k<14?q="End",
           printf("%c%c",224|/*   __/ \__         |  |  */(21554>>k&3),"gCS"
          "gGMX"[k/2]+65),"E"/*  /  \ /  \        |  |  */"Gh_BrG"[k/2]+64:*p
        ),++p)k,"#define/**/"/*  \__/ \__/        +--+  */"Q(x,y)r;cha""r*q=y#"
      "x\",\"#y\")\",*p,x=*p"/*                         */"%67;}/*-IOCCC2020-*/#"
    "include<stdio.h>/*-BBQl"/*                         */"ock--*/int(y),x,i,k,Q(")

and clang-formatted version:

    #define /**/ Q(x, y)                                                           \
      r;                                                                           \
      char *q /*                         */ = y #x "," #y ")", *p, x = *p % 67;    \
      }
    /*-IOCCC2020-*/ #include /*                         */<stdio.h> /*-BBQlock--*/
            int(y),x,i,k,Q(s[9<<9/*           12            */];float(f)[3];void(Z)
              (){
      *f = r < 0 ? r : -r;
      f[1] /*      11         1       */ = 42.5;
      f[2] = 22.5;
      for (k = 0; ++k < 39;
           *f /= (k /*   10               2    */ % 2 ? k : -k) * 6875.5 / i)
        y = f[1 + k % 2] += *f;
      k = /*                         */ f[2];
      p = s + k / 2 * 86 + y;}int(main)(){
      p = /*  9         o-------> 3  */ s;
      for (; i < 1978; *q > 32 ? k = i++ / 86 - 11 /*           /             */,
                                 y = (750 > r * r + k * k * 4) * 4 + y / 2,
                                 *p++ = /*    8     L         4    */ r < 44
                                            ? y ? "+0X+0X"
                                                  "+!"[y] -
                                                      1
                                                : *q++ /*       7         5       */
                                            : 10
                               : *q++)
        r = -41 + i % 86;
      r = 20;for/*            6            */(x=13;(i=3600*
                   --x);*p++="XR"/*                         */"MOQSUWAY"[x%+
                   10]-9,*p+=x/10/*                         */*41)
        Z();
      sscanf(__TIME__, "%d" /*  \       /    -------+  */ ":%d:%d", &k, &x, &i);
      r = 10;
      for (i /*   \     /     ------ |  */ += (k * 60 + x) * 60; r + 18;
           *p = k % 2     ? *p % 2 /*    \   /      ------ |  */ ? 59 : 44
                : *p > 39 ? 59
                          : 39,
           i = r-- ? i : i % (+/*     \ /       ------ |  */ 3600) * 12)
        Z();
      for (p = s; *p;
           putchar(k /*      X        ------ |  */ % 2 &&k < 14
                   ? q = "End",
                     printf("%c%c",
                            224 | /*   __/ \__         |  |  */ (21554 >> k & 3),
                            "gCS"
                            "gGMX"[k / 2] +
                                65),
                     "E" /*  /  \ /  \        |  |  */ "Gh_BrG"[k / 2] + 64 : *p),
          ++p)k,"#define/**/"/*  \__/ \__/        +--+  */"Q(x,y)r;cha""r*q=y#"
          "x\",\"#y\")\",*p,x=*p"/*                         */"%67;}/*-IOCCC2020-*/#"
        "include<stdio.h>/*-BBQl"/*                         */"ock--*/int(y),x,i,k,Q(")

The beauty is gone!

jbreckmckye 13 hours ago

I agree, actually. Humans are visual creatures and they take cues from the visual design of program code: alignment, grouping, density. All these things are used to signify meaning in product design, including digital design. But we are not allowed to "design" source code.
andrewflnr 15 hours ago

"Ethics" is overdramatizing it. The goal of a code formatter is not greatness, but adequacy, in a context where the code is a means to an end. They're particularly used in contexts where you may be sharing the project with people who don't care about formatting at all. Forcing me to work in or clean up the messes of my lazy co-workers is also, I would suggest, not very humanistic.
Feel free to not use one in your art projects.
mackeye 13 hours ago

semantic code style varies plenty between people; i can often tell who wrote which code, among the members of my research group, even though we format our code. i'd prefer to be recognized by my preference for short lambdas (vs. defs) and partial functions, than my preference to put if statements on one line.
dredmorbius 13 hours ago

Ethical, or aesthetic?
Both are branches of philosophy, but rather distinct from one another.
bartekpacia 15 hours ago

weirdest take on code formatting I've ever read
imho uniformity of what the code looks like > some single person's opinion
it's so satisfying to me when I just run "gofmt" and know the thing is formatted well.
- spc476 12 hours ago
  
  If I didn't want opinions, I'd join a cult.
- ncruces 14 hours ago
  
  Gofmt's style is no one's favorite, yet gofmt is everyone's favorite.
wiseowise 14 hours ago

Humanistic is bikeshedding for weeks over new lines and “I feel like this is more readable”.
- cjfd 4 hours ago
  
  The bike shedding is optional once one accepts that code written by someone else might look a bit different from ones own. _Some_ conventions are beneficial, and one might discuss those, but one does not have to have complete uniformity.
  Bike shedding for weeks might happen and is not helpful. On the other side of this argument there are not-so-very-pleasant things as well. A: We follow this-and-this coding convention and we do not like wasting time on discussing it so we follow it to the letter. B: I don't like this small thing that I would like to write differently. A: We follow this-and-this coding convention and we do not like wasting time on discussing it so we follow it to the letter. B: You actually don't follow this-and-this coding convention because you are not following this-and-that rule. A: Yes, that is a change that I liked.
  One can easily end up with a very old coding convention that only the oldest developer likes so the oldest developer can now be dictator over everybody else.
jibal 14 hours ago

You surely forgot the "/s".
The uniformity is not merely for the sake of uniformity.

mmaunder 14 hours ago

[flagged]

jibal 14 hours ago

No.
- mmaunder 14 hours ago
  
  [flagged]
  - Maxatar 13 hours ago
    
    I'll just speak for myself, but I use Copilot with Visual Studio and C++ and Copilot does an excellent job of interpreting the instructions given to it and ensuring that the code I write is consistent with the coding style and standards specified in the instructions.
    I have it set to check my code live, as I write it, and it will reformat my code on the fly to break lines up, fix typos, insert code snippets, autocomplete documentation, etc...
    I will admit it was very awkward using it at first and I actually kind of hated it because it felt so intrusive, like having it modify code as I write it felt really distracting at first, but after maybe 2-3 days of doing it I got used to it and now I feel a lot more productive having it there. Especially for a language like C++ where sometimes the most menial syntax issue can have significant and hard to track down consequences, it's nice to have an LLM that can focus on these menial details while I focus on the higher level task.
  - jibal 13 hours ago
    
    https://news.ycombinator.com/newsguidelines.html
    "Please don't comment about the voting on comments. It never does any good, and it makes boring reading."
    Other elements also apply.
    
    zahlman 12 hours ago
    
    I agree, but the simple "no" response was not especially productive either.
    
    jibal 6 hours ago
    
    My comment was about the now dead comment I responded to. Yours is not in any way relevant to that.
  - kjs3 13 hours ago
    
    No...it's that "can an LLM do it better" with no other commentary is perhaps the absolute lowest effort, lowest value comment possible on HN these days and mercifully gets an appropriate downvote. The condescending response to downvote is also par for the course from a particular sort of reader.
    
    andrewflnr 13 hours ago
    
    Someone complaining about lack of detailed response, when their own contribution was basically just "but what if LLM?", is very rich