The Joy of Pimpls (or, More About the Compiler-Firewall Idiom)

Reckless Fixes and Optimizations, and Why They

The main article shows why using the Pimpl idiom can incur space and performance overheads, and it also shows the right way to minimize or eliminate those overheads. There is also a common, but wrong, way to deal with them.

Here's the reckless, unsafe, might-work-if-you're-lucky, evil, fattening, and high-cholesterol way to eliminate the space and performance overheads, and you didn't hear it from me... the only reason I'm mentioning it at all is because I've seen people try to do this:

     // evil dastardly header file x.h
     class X {
       /* . . . */
       static const size_t sizeofximpl = /*some value*/;
       char pimpl_[sizeofximpl];
     };

     // pernicious depraved implementation file x.cpp
     #include "x.h"
     X::X() {
       assert( sizeofximpl >= sizeof(XImpl) );
       new (&pimpl_[0]) XImpl;
     }
     X::~X() {
       (reinterpret_cast<XImpl*>(&pimpl_[0]))->~XImpl();
     }

DON Yes, it removes the space overhead--it doesn't use so much as a single pointer. ^[8] Yes, it removes the memory allocation overhead--there's nary a malloc or new in sight. Yes, it might even happen to work on the current version of your current compiler.

It's also completely nonportable. Worse, it will completely break your system even if it does appear to work at first. Here are several reasons:

1. Alignment. Any memory that's allocated dynamically via new or malloc is guaranteed to be properly aligned for objects of any type, but buffers that are not allocated dynamically have no such guarantee:

     char* buf1 = (char*)malloc( sizeof(Y) );
     char* buf2 = new char[ sizeof(Y) ];
     char buf3[ sizeof(Y) ];

     new (buf1) Y;     // OK, buf1 allocated dynamically (#1)
     new (buf2) Y;     // OK, buf2 allocated dynamically (#2)
     new (&buf3[0]) Y; // error, buf3 may not be suitably aligned

     (reinterpret_cast<Y*>(buf1))->~Y(); // OK
     (reinterpret_cast<Y*>(buf2))->~Y(); // OK
     (reinterpret_cast<Y*>(&buf3[0]))->~Y(); // error

Just to be clear: I'm not recommending that you do #1 or #2. I'm just pointing out that they're legal, whereas the above attempt to have a pimpl without dynamic allocation is not, even though it may (dangerously) appear to work correctly at first if you happen to get lucky.^[9]

2. Brittleness. The author of X has to be inordinately careful with otherwise-ordinary X functions. For example, X must not use the default assignment operator, but must either suppress assignment or supply its own. (Writing a safe X::operator= isn't too hard, but I'll leave it as an exercise for the reader. Remember to account for exception safety in that and in X::~X.^[10] Once you're finished, I think you'll agree that this is a lot more trouble than it's worth.)

3. Maintenance Cost. When sizeof(XImpl) grows beyond sizeofximpl, the programmer must bump up sizeofximpl. This can be an unattractive maintenance burden. Choosing a larger value for sizeofximpl mitigates this, but at the expense of trading off efficiency (see #4).

4. Inefficiency. Whenever sizeofximpl > sizeof(XImpl), space is being wasted. This can be minimized, but at the expense of maintenance effort (see #3).

5. Just Plain Wrongheadedness. In short, it's obvious that the programmer is trying to do "something unusual." Frankly, in my experience, "unusual" is just about always a synonym for "hack." Whenever you see this kind of subversion--whether it's allocating objects inside character arrays like this programmer is doing, or implementing assignment using explicit destruction and placement new as discussed in Guru of the Week #23--you should Just Say No.^[11]

Bottom line, C++ doesn't support opaque types directly, and this is a brittle attempt to work around that limitation.

Notes

2. J. Coplien. Advanced C++ Programming Styles and Idioms (Addison-Wesley, 1992).

3. Please don't email me jokes about this subheading. I can imagine most of the answers.

5. Making a virtual private is usually not a good idea, anyway. The point of a virtual function is to allow a derived class to redefine it, and a common redefinition technique is to call the base class' version (not possible, if it's private) for most of the functionality.

6. Compared to most other common operations in C++, such as function calls. Note that here I'm specifically talking about the cost of using a general-purpose allocator, which is what you typically get with the built-in operator new and malloc.

7. If the hidden member being accessed itself uses a back pointer to call a function in the visible class, there will be multiple indirections.

8. This completely hides the pimpl class, but of course clients must still be recompiled if sizeofximpl changes.

9. All right, I'll fess up: There actually is a (not very portable, but pretty safe) way to do put the pimpl class right into the main class like this, thus avoiding all space and time overhead. It involves creating a "max_align" struct that guarantees maximal alignment, and defining the pimpl member as union { max_align dummy; char pimpl_[sizeofximpl]; }; -- this will guarantee sufficient alignment. For all the gory details, do a search for "max_align" on the web or on DejaNews. However, I still strongly urge you not to go down this path, because using a "max_align" solves only this first issue #1 and does not address issues #2 through #5. You Have Been Warned.