A serious bug in GCC

This post is to inform you about a bug in GCC that may cause memory (or other resource) leaks in your valid C++ programs.

One of the pillars of C++ philosophy is what we often call RAII: if you use classes to manage resources, use constructors for allocating resources and destructors for releasing them, the language makes sure that whatever happens, however you use such classes the resources will get properly released. We can easily test this guarantee by a fake class that logs resource acquisitions and releases:

struct Resource
{
  explicit Resource(int) { std::puts("create"); }
  Resource(Resource const&) { std::puts("create"); }
  ~Resource() { std::puts("destroy"); }
};

Whatever reasonable program we write (excluding the situations where you use raise, longjmp, exit, abort, etc., or when we cause std::terminate to be called) we expect that "create" and "destroy" are output the same number of times.

This is the contract: I take care that my classes correctly manage resources, and the language takes care that the resources will always be managed correctly regardless of the complexity of the program. This even works for such complex situations, you might not even have thought of:

Resource make_1() { return Resource(1); }
Resource make_2() { throw std::runtime_error("failed"); }

class User
{
  Resource r1;
  Resource r2;

public:
  explicit User()
    : r1{make_1()}
    , r2{make_2()} // what if make_2() throws?
    {}
};

Consider what happens when make_2() throws when executing this constructor. r1 has already been constructed (resources acquired), but object User has not been created yet, and it will never be (because constructor will not run to a successful end). This means that destructor of User will never be called either. But the language is still required to call the destructor of any sub-object that has been successfully created, like r1. Thus, r1’s resources will nonetheless be released, even though no object of type User was ever fully constructed.

You might have not even heard about this guarantee, but it still works to your advantage, preventing memory leaks.

But in one situation GCC will surprise you: namely, when you initialize a temporary using aggregate initialization. Let’s change our type User a bit, so that it is an aggregate:

struct User 
{
  Resource r1;
  Resource r2;
};

It just aggregates members. No constructors, but we can still initialize it with aggregate initialization syntax:

void process (User) {}

int main()
{
  try { 
    User u {make_1(), make_2()};
    process(u);
  }
  catch (...) {}
}

If you test it, it works correctly: the number of constructor calls equals the number of destructor calls, even though make_2() throws and makes the situation complicated. But u is an automatic object. If we change the example, and create a temporary User instead:

int main()
{
  try { 
    process({make_1(), make_2()});
  }
  catch (...) {}
}

This is where the bug manifests. Member r1 is initialized but never destroyed. Admittedly, this is a rare case: it requires an exception in the middle of initialization, a temporary and an aggregate initialization. But usually, leaks manifest in the face of exceptions. And the fact that it is rare makes you less prepared for it.

Here is a full example:

#include <cstdio>
#include <stdexcept>

struct Resource
{
  explicit Resource(int) { std::puts("create"); }
  Resource(Resource const&) { std::puts("create"); }
  ~Resource() { std::puts("destroy"); }
};

Resource make_1() { return Resource(1); }
Resource make_2() { throw std::runtime_error("failed"); }

struct User 
{
  Resource r1;
  Resource r2;
};

void process (User) {}

int main()
{
  try {
    process({make_1(), make_2()});
  }
  catch (...) {}
}

You can test it online here. It is present in GCC 4, 5, and 6. For a more real-life, and somewhat longer, illustration of the problem, see this example provided by Tomasz Kamiński.

A bug report for this already exists.

Maybe your program already leaks because of this surprise?

21 Responses to A serious bug in GCC

Alf P. Steinbach says:

April 27, 2017 at 10:18 pm

Thanks for sharing! Just a nit: <cstdio> is not guaranteed to provide puts in the global namespace.

- Andrzej Krzemieński says:
  
  April 28, 2017 at 6:44 am
  
  Thanks. Fixed.
  
John says:

April 27, 2017 at 11:03 pm

Lmao don’t do this then. I love C++ and I follow RAII.

- Ying says:
  
  April 28, 2017 at 12:46 pm
  
  Completely wrong attitude
  
  - Jason Dusek (@solidsnack) says:
    
    April 28, 2017 at 1:17 pm
    
    Yeah, I think John is missing the point — it’s something that anyone would expect to work, given the promises the language makes.
    
- p0int3r says:
  
  April 28, 2017 at 3:06 pm
  
  Yes I agree with you – better use asembler instead of this buggy C++ language.
  
- Max (@Autious) says:
  
  May 4, 2017 at 1:52 pm
  
  What do you mean “don’t do this”? Don’t follow the C++ standard?
  Should we all just avoid all compiler bugs for all compilers might encounter and not try to fix the compilers? What are you trying to convey?
  
- dv says:
  
  May 11, 2017 at 8:28 am
  
  What is your point supposed to be? If you really loved C++, then you would want people to be aware of bugs and compiler authors to fix them (and be supported in doing so); you wouldn’t give useless responses just saying to sidestep the issues.
  
Jinank Jain says:

April 28, 2017 at 3:33 pm

Good part is I tried out with other compilers like clang it was working as expected

- Andrzej Krzemieński says:
  
  April 28, 2017 at 5:03 pm
  
  It should be noted, however, that Clang is not devoid of similar problems. For instance, see this bug report by Jonathan Wakely. (Also two years old.)
  
tomaszkam says:

April 28, 2017 at 7:01 pm

This bug causes even code using initializer list to leak, for example:
std::vector vs{s1, s2, s3};
Will leak if construction of s2 or s3 will fail. Live example: https://wandbox.org/permlink/VZiD1OtYgggZuLwk.

Jesse Maurais says:

May 1, 2017 at 2:10 pm

A little correction here. The r1 member is not necessarily initialized because the order of construction is not guaranteed. I’d have to double check the standard to be sure, but I believe the order is an implementation detail left the compiler’s discretion.

- tomaszkam says:
  
  May 1, 2017 at 9:34 pm
  
  Elements in brace-initializer ({e1, e2, e3}) are required to be initialized from left-to-right by the standard. For reference: http://stackoverflow.com/questions/14060264/order-of-evaluation-of-elements-in-list-initialization
  
  - Jesse Maurais says:
    
    May 2, 2017 at 2:38 pm
    
    Sorry, I wasn’t referring to the brace initialization. Let me clarify
    
    class User
    {
    Resource r1;
    Resource r2;
    
    public:
    explicit User()
    : r1{make_1()}
    , r2{make_2()} // what if make_2() throws?
    {}
    };
    
    In the User() constructor there is no guarantee that r1 is initialized before r2.
    
    - Anonymous Coward says:
      
      May 3, 2017 at 1:53 am
      
      The standard guarantees initialization order is the declaration order of members (regardless of the order in the initializer list).
    - Chris says:
      
      May 3, 2017 at 7:22 am
      
      Actually, the standard draft (n4296 §12.6.2/13.3) defines that they are constructed in the order they are declared in the class.
      
      So it does not matter that “r1{make_1()}” is before “r2{make_2()}”, but because “Resource r1;” is before “Resource r2;”, r1 will be initialized first.
    - nobody says:
      
      May 3, 2017 at 7:52 am
      
      You are wrong. Standard says class members are constructed in order they are defined. So r1 before r2.
    - Dinka says:
      
      May 3, 2017 at 3:56 pm
      
      Hello,
      In this case, r1 is guaranteed to be initialised before r2.
      
      from N4618 : [class.base.init]/p13
      “In a non-delegating constructor, initialization proceeds in the following order:
      …
      — Then, non-static data members are initialized in the order they were declared in the class definition (again regardless of the order of the mem-initializers).”
      
      Best,
      Dinka
- dv says:
  
  May 11, 2017 at 8:30 am
  
  Of course, you are wrong. If you were right, and initialisation order was not absolutely guaranteed on a basic level, it would make classes in C++ a joke and many programs undefined and impossible. Thankfully, they knew this and competently designed this basic area.
  
Pingback: Dev Digest Episode 93
Pingback: BYB 1×10 – Terratenientes, ñues y campamentos | Birras y Bits