4. Instrumented: a performance measuring tool

Here is a little speech about great language designers. Once upon a time there was a very great language designer called Niklaus Wirth. He was very brilliant, no question about it, and a very wonderful, kind, charming and witty person. So he designed his first programming language called Euler which he did at Berkeley. Then it was ALGOL W. Then it was Pascal. After that it was Modula and then Modula-2. Then Oberon. It was such a beautiful language, this is why all the world is programming in it (joke).

He’s indeed a very brilliant language designer and a great computer scientist but, he would design a beautiful language, observe that it’s limited, throw it away, and design a better language. What about the customer base? Well, who cares. This is why we still have so many people programming in Oberon. Because, by the time he got to Oberon, people got tired.

This is why I claim, that to be honest Bjarne is literally the greatest language designer, at least after John Backus. John Backus invented the first useful programming language Fortran. It is still a very useful programming language. Then came C and C++. Dennis was brilliant. He did C and then washed his hands and walked away, which is very wise. I sympathize. But Bjarne started working on C++ in roughly 1978 (and then released in 1981), 35 years ago. Then he never abandoned us. It was never perfect, but he would work on it, and work on it, and work on it, and go to horrible meetings of the standard committee, and listen to people who know nothing. His hair fell out. It was a terrible, terrible, life. But, he had this sense of duty to develop the language further, and further, and further, with the most advanced language mechanisms known to humankind. What you can do right now in C++, you cannot really do in any other language. But it requires patience, determination and genius. Whatever decisions he made in 1979 didn’t lead to a stalemate later on. There is some ugly stuff, but you could avoid it. Being able to evolve the language for that long is incredible. I have no other example, not just in language design, but in computer science. Ken Thompson did UNIX but do you think he stayed with UNIX? No, in his Turing award speech he said he stopped working on UNIX a long time ago. It’s very difficult.

I’m a clear example of a lazy bum. STL was voted in August 1994, 20 years ago. How many times did I attend standard committee meetings after that? None. How many times did I look at proposals related to STL? Did I do anything related to this? Nothing. This is why I have sanity but also this is why, compared with Bjarne, I am a failure. I let people do things with STL that should have been prevented. I did not evolve it. I did not grow it. I walked away. I know it’s a free country, you can. My advice to most of you, if you want good life follow my example. Because it’s very hard to do what Bjarne does. I cannot point to a single other example literally of a person who keeps working. McCarthy invents Lisp. After 1.5 he’s gone. He didn’t follow. He didn’t contribute, and so on. Everybody does that, because we are weaklings.

So C++ is a great accomplishment, but of course it has flaws. C and C++ are extremely instructive languages. People think that studying Haskell is instructive. I don’t know about that, but studying C is very instructive because Dennis was trying to solve real problems. So, even his mistakes are very instructive. C and C++ are great precisely because they are works in progress.

Instrumented class

Since all STL algorithms are built on fundamental operations, what we can do is write a wrapper or adapter class instrumented<T> which will take a type T and behave exactly like T. You will be able to put instrumented<T> into any algorithm, it will behave normally, except in addition it will count all the operations that are applied to it.

Which operations do we count? The ones specified in the concepts we discussed. T will be SemiRegular, Regular, or TotallyOrdered. Then our instrumented class will redefine all the operations: copy constructor, assignment, operator, etc, adding code to count them. Then we could take any STL algorithm: sort, unique, stable sort, whatever you like, run it, and get a performance measurement. For example:

And it will count all operations. Writing this particular class will teach you once and for all to write Regular classes right.

Exercise: Before continuing on, try to write the instrumented class for yourself and experiment.

Redefining regular operations with counting

We’re going to write instrumented using the same technique we use to write all classes:

Now we will do some work to count operations. In the copy constructor, we will initialize value, and add a line that bumps up the copy count, like this:

One line or three? This is a very good question. When I write code I want to do two things:

This line is short, so I like one. I have been changing my programming style depending on the people with whom I work. Paul McJones affected my programming style greatly when I started here. For example I never used to use x. I avoided short variable names. In my old code everything is called special_variable_x. Paul convinced me that x is just as good.

Continue making similar replacements for the rest of the operations on singleton:

Storing counts

What to do with all the counts? Where do they get stored? Every time this instrumented thing happens we want some global count to be incremented. We were told that using global variables is bad. If I were doing it just for me, I would have used globals. Old guys don’t mind using global variables. They’re actually good. Since you are modern people, we will show you how to do it with inheritance. We will define a base class to hold this data:

This is a remarkable example of a class containing nothing. It is a very useful thing, we will use very many such classes. It’s very cheap to pass things which contain nothing.

A static member is a member which is one per class, not one per instance, and they’re useful because we don’t want to keep count per instance. We want to keep count per class. The static members need to be initialized in a .cpp file. At the same time, we provide a table of strings so we can label counts.

Why store count in a double instead of int? Sometimes I want to normalize by n to compute a ratio.

I’ll tell you a great secret. People all over the world spread this rumor that I’m absolutely opposed to using inheritance. This is false. Inheritance is very useful when you inherit from a class containing nothing because it couldn’t do any harm. That’s what we’re going to do here.

There is a notorious problem in C++ with static members of templates, it’s just not good. We don’t need to inherit from a template. All these different instrumented<T>’s inherit from the same base which will contain nothing at all and we will use this as a counting device.

What is good about this is we managed not to muck up this nice class. It’s basically the same as singleton. It’s fundamentally of the same structure and we pushed all of the statistic collection stuff out into a helper class.

How should we use enum?

enum is a mechanism which introduces a bunch of constants. It’s a very evil mechanism. I was wondering who invented enum, because it wasn’t in first edition of K&R¹. So I asked Bjarne. He sent me an email: “Dennis. Under duress.” Dennis didn’t quite know how to do it. He wanted to give us the opportunity to name things. He did not invent it. It appeared first in Pascal and the person who invented it was weird. Whether it worked there correctly or not remains to be seen.

Dennis decided to bring it in, but the issue is that it’s not really a type. C++ attempts to make it a type but it doesn’t quite work. You could have a variable typed with the enum which has three different values and then you take totally different value assigned to it, nothing happens. The compiler does not prevent you from doing that. My recommendation is still to use them. Enums are very good when used in a limited way. But, do not depend on any operations. Never depend on a value of a given enum.

Use all the language features

I use inheritance, when appropriate. In general, I use any language feature when appropriate. Paul and I even use goto and we’re not ashamed. There is a famous statement attributed to Ken Thompson that the fastest way of going from one place in the program to another is by using the goto statement, and it is so. If you implement things like state machines it’s a wonderful technique, because you have transitions. You go from this state to that state. You could write a loop with some conditional. Or, you could just goto and write very beautiful code, at least we believe so. Everything has its place, Dijkstra’s strictures not withstanding².

Using instrumented to analyze sort

To learn how to use instrumented, let’s analyze the performance of sorting routines in STL. There are a few of them and they all use a distinct algorithm:

We want to compare how these various sort operations perform, relative to each other.

Exercise: With instrumented, compare the number of operations between these three kinds of sort⁵. Refer to the code provided at the end of the chapter. A complete test harness is provided which will randomly shuffle a large list of numbers to test with and print the results in a formatted table. Here is a sample of the output for heap sort⁶:

Normalizing data

Another useful way to study operation counts is by normalizing the data. We know the asymptotic complexity of sort algorithms should be O(n log(n)). So, what we can do is normalize the data to tell us for n elements, how many operations were done, per n log(n).

After normalizing the data, a multiple of 2.86 would mean it took 2.86 times as many operations as n log(n) predicted. If n = 16, that means 2.86 * 16 log(16) = 183 operations were counted.

You remember Knuth (Author of “The Art of Computer Programming”)? In the beginning of the first volume when he introduces complexity he tells you how to measure complexity. He says we measure it as a function where we have different coefficients for different operations. This thing should allow us to predict timing (as we will learn later nothing of the sort is true) so whatever Knuth believed when he wrote first volume (which he did at least three times) is no longer true. We will discover that computers got very strange. They actually do many operations in the same cycle. So, often we could do more operations without actually incurring more time.

What data should we test on?

Let us talk about possible input shapes. What is a good set of data to test these algorithms on? The most basic one is just to generate uniformly random data. Another shape to try is a list which is already sorted. As we’ll discover later on, some sorting algorithms are particularly bad for this particular configuration. Both ascending and descending will give different results. Another interesting shape is a constant list.

In general, we tend to assume all elements we test on are not equal. It’s neither good, nor bad. But, eventually we want to define some measure of the ratio of equal to unequal elements.

Random shuffle of uniform shuffle of random data is very good, but it’s not a very realistic distribution. One which is very common in real life is called Zipf distribution. Let me describe it incorrectly, first. Assume that the most probable guy comes with probability 1. The second most probable guy comes with probability 1/2. The third guy 1/3, and so on, so it’s the harmonic series. Of course that wouldn’t work because they all need to add up to 1 so you normalize by the sum of the harmonic series up to n:

Exercise: Introduce variation into the shape of data and compare the sorting algorithms again.

Measuring solution to unique elements

Counting operations is only one measure of performance. If we apply instrumented to our problem of finding unique elements in the first chapter, we will find that using std::set actually uses fewer of almost every operation than first sorting with std::sort and then calling std::unique. However, it is also many multiples slower.

Code

K&R (Kernighan and Ritchie) is a nickname for the book “The C Programming Language”. K&R usually specifically refers to the original release. A later edition was made when the C language became ANSI standardized, and the cover of that edition is labeled as such.↩
Goto used to be the primary way to do control flow in programs, because it closely resembles how machines and their languages work.

For example, to implement a while loop, you might write:
```
START:
  DO STUFF
  IF CONDITION
    GOTO START
  OTHER STUFF
```
It doesn’t look bad there, but if you do a lot of control flow (especially using adhoc patterns, besides while) then it becomes “spaghetti code” that is difficult to read and follow. In a complex program, one must essentially read every statement as if you were the computer and jump around at each goto statement.

Dijkstra heavily criticized this approach in his famous paper: “Go to statement considered harmful”.

Alex is observing that it is a good solution to many problems, especially when used in a restricted context, and not as the primary way to organize programs. Later on he will give examples.

Common Lisp has this idea present in the PROG feature which allows one to use labels and goto restricted to a specific block. Fast, assembly-like, messy code can be wrapped in nice functional interfaces.
↩
Alex: If you can sort a little bit, you can sort everything. Set the second argument to be the same as the third: std::partial_sort(first, last, last).↩
Alex: There is a perfectly wonderful three line solution which could change partial sort and make it much more acceptable to modern computers but it’s not the standard. It will take the implementers of the standard library another 15 years to catch up.↩
Many programmers imagine the C++ standard library is a package like sqlite or LaTeX that is centrally developed and deployed to many platforms. This is not the case. Vendors who want to create a C++ compiler and support it on their platform typically develop their own library implementation in agreement with the standard. There is little or no collaboration on library code between platforms.

Alex: If you try this on various computers and operating systems you will find the counts are different. This is because the algorithms are different. I always assume, being the guy who did all of that, that is the same algorithm wherever I go. Obviously somebody modified them a little bit over these 20 years. For example, people at Apple did something slightly different from people at GNU.
↩
AMD Ryzen 5 2400G (8 core, 3.6 GHz). GCC 9.3.0↩
Alex states that the little bit added on to log(n) is called the Stirling number. This does not appear to be correct, and he probably meant to refer to the Euler-Masheroni constant.↩