Pratik Parvati

Posted on Jun 26, 2021 • Edited on Jul 6, 2021

Memory management using Smart Pointers in C++ - Part 2

#cpp #smartpointers #memorymanagement #sharedpointer

Memory management using Smart Pointers in C++ - Part 2

In the previous article on Memory management using smart pointers (Part 1), we saw that unique_ptr implements unique ownership - only one smart pointer owns the object at a time. But there might be scenarios where you might need to create multiple pointers of the same object and share them; in that case we should use shared pointer(std::shared_ptr). One such scenario could be - using smart pointers for multithreading; sharing a pointers between threads helps to keep the smart pointer alive within threads when other thread using the same smart pointer goes out of scope (i.e, more than one owner manages the lifetime of the object in memory).

Shared Pointers (`std::shared_ptr<>`)

std::shared_ptr is a smart pointer that retains shared ownership of an object through a pointer. The same object may be owned by multiple shared_ptr objects. The object is destroyed and its memory deallocated when either of the following happens:

the last remaining shared_ptr owning the object is destroyed.
the last remaining shared_ptr owning the object is assigned another pointer via operator= or reset().

Shared pointers keep something called a reference count; which keeps track of the number of pointers to an object (manager object to be specific). As we create more copies of the pointer and they exit scope, the reference count is incremented and decremented accordingly. Finally, when the reference count reaches zero, the underlying memory is freed.

int main()
{
    std::shared_ptr<int> sp1 = std::make_shared<int>(2);
    std::cout << sp1.use_count() << '\n';   // prints 1

    {
        std::shared_ptr<int> sp2 = sp1;  // object copy -> reference count increments to 2
        std::cout << sp1.use_count() << '\n'; // prints 2
    } // sp2 goes out of scope hence reference count decrements to 1

    std::cout << sp1.use_count() << '\n'; // prints 1

    return EXIT_SUCCESS;
} // reference count reaches zero as sp1 goes out of scope; hence object resource is destroyed

How shared pointer works internally?

When the managed object is dynamically allocated using shared_ptr, the first shared_ptr object (lets call it sp1) contains two raw pointers, one pointing to the managed object (returned by get()) and another raw pointer pointing to the control block (also called as manager object). The control block contains a pointer to the managed object, two reference counts (the number of shared_ptrs pointing to the manager object and the number of weak_ptrs pointing to the manager object) and depending on how a shared_ptr is initialized, it can also contain other data, such as a deleter and an allocator.

The shared_ptr constructor creates a manager object (dynamically allocated) and the overloaded member functions like shared_ptr::operator-> access the pointer in the manager object to get the actual pointer to the managed object. If another shared_ptr (sp2) is created by copy or assignment from sp1, then it also points to the same manager object, and the copy constructor or assignment operator increments the shared count to show that 2 shared_ptrs are now pointing to the managed object.

Similarly, when weak pointer is created by copy or assignment from the shared_ptr object or another weak_ptr object, it points to the same manager object and the weak pointer count is incremented. When sp1 and the manager object are first created, the shared count will be 1, and the weak count will be 0

Virtual destructor and `shared_ptr`

The manager object (or control block) contains pointer to the managed object, which is basically used for deleting the object. One interesting fact is that the managed pointer in the manager object may be of a different type (and even have a different value) than the raw pointer in the shared_ptr; that's because shared_ptr stores type-erased deleter in the manager object.

// A shared_ptr<void> managing double
// The raw pointer is void*
// Managed pointer in the manager object is double*
auto sp = std::shared_ptr<void>(new double()); //OK

This leads to an interesting use case where we can avoid incomplete deletion of derived object when derived object pointer is assigned to a base class pointer (of course without virtual destructor). Lets look at an example for incomplete deletion of an object.

struct Base
{
    ~Base()
    {
        std::cout << "Base::~Base()\n";
    }
};


struct Derived: public Base
{
    ~Derived()
    {
        std::cout << "Derived::~Derived()\n";
    }
};

int main()
{
    Base* base = new Derived;
    delete base; // prints Base::~Base(), hence Derived class object is partially destructed
    return EXIT_SUCCESS;
}

Of course the fix for the issue is doing the binding at run time polymorphically (dynamic binding) i.e, using virtual destructor in the base class, which prints

Derived::~Derived()
Base::~Base()

If we use shared_ptr we may not need virtual destructor since the smart pointer will take care of it for us. Changing the above code with shared_ptr

struct Base
{
    ~Base()
    {
        std::cout << "Base::~Base()\n";
    }
};


struct Derived: public Base
{
    ~Derived()
    {
        std::cout << "Derived::~Derived()\n";
    }
};

int main()
{
    std::shared_ptr<Base> base(new Derived);
    return EXIT_SUCCESS;
}

/*
prints
Derived::~Derived()
Base::~Base()
*/

In the above code, despite the destructor not being virtual, the correct derived class (Derived) destructor is invoked when the base class (Base) shared_ptr goes out of scope or reset. This works because the manager object (control block) is destroying the object through Derived* deleter and not through the raw pointer Base*. However, the destructor should be declared virtual in the base class to make it operate polymorphically; the above example is just to show how shared_ptr works.

NOTE: unique_ptr<Base> base(new Derived) do not provide this feature because with unique_ptr deleter function is part of the type and there is no concept of control block or manager object.

Using `shared_ptr`

To use shared_ptrlike a raw pointer the arrow operator(->) and dereferencing (*) operator are overloaded.

sp->myMethod();
(*sp)->myMethod();

You can call a reset() member function, which will decrement the reference count and delete the pointed-to object if appropriate, and result in an empty shared_ptr that is just like a default-constructed one. You can also reset a shared_ptr by assigning it the value nullptr.

sp.reset();
sp = nullptr;

Below is a code sketch to illustrate the basic usage

class MyClass
{
public:
    void classMethod();
};
ostream &operator<<(ostream &, const MyClass &);
// ...

// a function can return a shared_ptr
std::shared_ptr<MyClass> find_some_thing();

// a function can take a shared_ptr parameter by value;
std::shared_ptr<MyClass> do_something_with(std::shared_ptr<MyClass> p);

void foo()
{
    // the new is in the shared_ptr constructor expression:
    std::shared_ptr<MyClass> sp1(new MyClass);
   // ... 
    std::shared_ptr<MyClass> sp2 = sp1;       // sp1 and sp2 now share ownership of the MyClass
   // ... 
    std::shared_ptr<MyClass> p3(new MyClass); // another MyClass
    sp1 = find_some_thing();              // sp1 may no longer point to first MyClass
    do_something_with(sp2);
    sp3->classMethod();  // call a member function like built-in pointer
    cout << *sp2 << endl; // dereference like built-in pointer
    // reset with a member function or assignment to nullptr:
    sp1.reset();   // decrement count, delete if last
    sp2 = nullptr; // convert nullptr to an empty shared_ptr, and decrement count;
}
// sp1, sp2, sp3 go out of scope, decrementing count, delete the MyClass if last

Casting `shared_ptr`

Having two classes one that inherits from another, when trying to assign a smart pointer variable of the derived class to a smart pointer variable of the base class it is required to use std::static_pointer_cast in place of the normal static_cast (used with raw pointers). It will be useful to remove the suggestion to use the static_cast and include the suggestion to use the std::static_pointer_cast.

For example:


#include <memory>
#include <iostream>

class Base{
 public:
    Base(){std::cout << "Base::Base()" << std::endl;}
    ~Base(){std::cout << "Base::~Base()" << std::endl;}
    void print(){std::cout << "Base::print()" << std::endl;}
};

class Derived:public Base{
 public:
    Derived(){std::cout << "Derived::Derived()" << std::endl;}
    ~Derived(){std::cout << "Derived::~Derived()" << std::endl;}
    void print(){std::cout << "Derived::print()" << std::endl;}
};

int main()
{
    std::shared_ptr<Base> b_ptr = std::make_shared<Derived>();
    b_ptr->print();
    auto d_ptr = std::static_pointer_cast<Derived>(b_ptr);
    d_ptr->print();
    return EXIT_SUCCESS;
}


/* OUTPUT
Base::Base()
Derived::Derived()
Base::print()
Derived::print()
Derived::~Derived()
Base::~Base()
*/

Getting better memory allocation performance from `shared_ptr`

When we create an object with new operator in shared_ptr there will be two dynamic memory allocations that happen, one for object from the new and the second is the manager object created by the shared_ptr constructor. Since memory allocations are slow, creating shared_ptr is slow when compared to raw pointer. To solve this problem C++11 has introduced the function template make_shared that does a single memory allocation big enough to hold both manager object and the new object, with the constructor parameters passed, which returns a shared_ptr of required type.

std::shared_ptr<MyClass> sp(new MyClass("some string")) // Bad practice - two allocations
std::shared_ptr<MyClass> sp(std::make_shared<MyClass>("some string")); // Good practice - one allocation

Using make_shared also avoids explicit use of new, promoted in the slogan "no naked new!". Because only a single memory allocation is involved when you use make_shared to initialize a shared_ptr, you can expect improved performance over the separate allocation approach.

Getting `std::shared_ptr` from `this` Object

In the situation where the class member function needs to pass a pointer to this object to another function that accepts the shared_ptr to that class object as argument then we can use std::enable_shared_from_this to create an object that returns shared_ptr from this. Hence the class of the objects has to be derived from std::enable_shared_from_this, which unlocks the special method shared_from_this, which can be used to create shared_ptr from this.

#include <iostream>
#include <memory>

class MyClass;

void someFunction(std::shared_ptr<MyClass> shareMe)
{
    std::cout << "someFunction: shareMe.use_count(): " << shareMe.use_count() << std::endl;
}

class MyClass : public std::enable_shared_from_this<MyClass>
{
public:
    std::shared_ptr<MyClass> getShared()
    {
        return shared_from_this();
    }
    void classMethod()
    {
        //....
        someFunction(getShared());
        //....
    }
};

int main()
{
    std::cout << std::endl;

    std::shared_ptr<MyClass> shareMe(new MyClass);
    std::shared_ptr<MyClass> shareMe1 = shareMe->getShared();
    {
        auto shareMe2(shareMe1);
        std::cout << "main(): shareMe.use_count(): " << shareMe.use_count() << std::endl;
    }
    std::cout << "main(): shareMe.use_count(): " << shareMe.use_count() << std::endl;

    shareMe1.reset();

    std::cout << "main(): shareMe.use_count(): " << shareMe.use_count() << std::endl;

    shareMe->classMethod();

    std::cout << std::endl;
    return EXIT_SUCCESS;
}

`std::shared_ptr` as function argument

From performance perspective taking shared_ptr by copy or by reference make significant difference.

// 
void functionbyReference(std::shared_ptr<int> &refPtr) // Not a good practice
{
    std::cout << "refPtr.use_count(): " << refPtr.use_count() << std::endl;// prints 1
}

void functionbyCopy(std::shared_ptr<int> cpyPtr)
{
    std::cout << "cpyPtr.use_count(): " << cpyPtr.use_count() << std::endl; // prints 2
}

int main()
{

    auto sp(std::make_shared<int>(2021));
    std::cout << "sp.use_count(): " << sp.use_count() << std::endl; // prints 1

    functionbyReference(sp);
    functionbyCopy(sp);

    std::cout << "sp.use_count(): " << sp.use_count() << std::endl; // prints 1
    return EXIT_SUCCESS;
}

The reference count is incremented a when function takes shared_ptr by copy; since incrementing and decrementing the reference count is an expensive operation and hence results in a performance difference. The quick benchmark test states the measurable difference in performance (almost twice).

NOTE: The best practice in C++ is always to have clearly defined ownership semantics for your objects. The pass by reference of the smart pointer defy this rule; Hence, copy the shared pointer when a new function or object needs to share ownership of the pointee.

Cyclic reference problem with `shared_ptr`

The problem with shared_ptr is that if there is a ring, or cycle of an objects that have shared_ptr to each other, they keep each other alive - they won't get deleted as they are holding each other (i.e, still has a shared_ptr pointing to them) leading to memory leak.

//Forward declaration
struct MyClass2;
struct MyClass3;

struct MyClass1 {     
 std::shared_ptr<MyClass3> sp3;
};

struct MyClass2 {
 std::shared_ptr<MyClass1> sp1;
};

struct MyClass3 {
 std::shared_ptr<MyClass2> sp2;
};

Lets look at an example that causes cyclic reference problem

#include <memory>
#include <iostream>

struct MyClassB;
struct MyClassA
{
    std::shared_ptr<MyClassB> b;
    ~MyClassA() { std::cout << "MyClassA::~MyClassA()\n"; }
};

struct MyClassB
{
    std::shared_ptr<MyClassA> a;
    ~MyClassB() { std::cout << "MyClassB::~MyClassB()\n"; }
};

void useClassAnClassB()
{
    auto a = std::make_shared<MyClassA>();
    auto b = std::make_shared<MyClassB>();
    a->b = b;
    b->a = a;
}

int main()
{
    useClassAnClassB();
    std::cout << "Finished using A and B\n";
}

/*OUTPUT
Finished using A and B
*/

If both references are shared_ptr then that says MyClassA has ownership of MyClassB and MyClassB has ownership of MyClassA, which should ring alarm bells. In other words, MyClassA keeps MyClassB alive and MyClassB keeps MyClassA alive.

In this example the instances a and b are only used in the useClassAnClassB() function so we would like them to be destroyed when the function ends but as we can see when we run the program the destructors are not called.

The memory leak summary is as shown below.

==1969133==
==1969133== LEAK SUMMARY:
==1969133==    definitely lost: 32 bytes in 1 blocks
==1969133==    indirectly lost: 32 bytes in 1 blocks
==1969133==      possibly lost: 0 bytes in 0 blocks
==1969133==    still reachable: 0 bytes in 0 blocks
==1969133==         suppressed: 0 bytes in 0 blocks
==1969133==
==1969133== For lists of detected and suppressed errors, rerun with: -s
==1969133== ERROR SUMMARY: 1 errors from 1 contexts (suppressed: 0 from 0)

C++11 includes a solution: weak smart pointers: these only observe an object but do not influence its lifetime.

Weak pointer `std::weak_ptr<>`

A ring of objects can point to each other with weak_ptrs, which point to the managed object but do not keep it in existence. This is shown in the diagram below, where the "observing" relations are shown by the dotted arrows.

If the shared pointer container is emptied, the three objects in the ring are automatically deleted because no other shared pointers point to them; weak pointers, like raw pointers, do not keep the pointed-to object "alive" and hence cycle problem has been resolved.

Solution to the problem
The solution to the cyclic reference problem example is to decide who owns who. Lets say MyClassA owns MyClassB but MyClassB does not own MyClassA then we replace the reference to MyClassA in MyClassB with a weak_ptr like

#include <memory>
#include <iostream>

struct MyClassB;
struct MyClassA
{
    std::shared_ptr<MyClassB> b;
    ~MyClassA() { std::cout << "MyClassA::~MyClassA()\n"; }
};

struct MyClassB
{
    std::weak_ptr<MyClassA> a;
    ~MyClassB() { std::cout << "MyClassB::~MyClassB()\n"; }
};

void useMyClassAnMyClassB()
{
    auto a = std::make_shared<MyClassA>();
    auto b = std::make_shared<MyClassB>();
    a->b = b;
    b->a = a;
}

int main()
{
    useMyClassAnMyClassB();
    std::cout << "Finished using A and B\n";
}

/*OUTPUT
MyClassA::~MyClassA()
MyClassB::~MyClassB()
Finished using A and B
*/

Then if we run the program we see that a and b are destroyed as we expect. And memory leak summary (for the above code) below shows that, no memory is leaked and the objects are destroyed gracefully.

==1969207== HEAP SUMMARY:
==1969207==     in use at exit: 0 bytes in 0 blocks
==1969207==   total heap usage: 4 allocs, 4 frees, 73,792 bytes allocated
==1969207==
==1969207== All heap blocks were freed -- no leaks are possible
==1969207==
==1969207== For lists of detected and suppressed errors, rerun with: -s
==1969207== ERROR SUMMARY: 0 errors from 0 contexts (suppressed: 0 from 0)

The definition of weak_ptr is designed to make it relatively foolproof, so as a result there is very little you can do directly with a weak_ptr. For instance, you can't dereference it; neither operator* nor operator-> is defined for a weak_ptr, you can not get pointer to an object - there is no get() function.

Using `weak_ptr`

During initialization, the weak_ptr is constructed as empty; the weak_ptr object can only be initialized by copy or assignment from the shared_ptr or an existing weak_ptr to the object.

std::shared_ptr<MyClass> sp(new Thing);
std::weak_ptr<MyClass> wp1(sp); // construct wp1 from a shared_ptr
std::weak_ptr<MyClass> wp2; // an empty weak_ptr - points to nothing
wp2 = sp; // wp2 now points to the new MyClass
std::weak_ptr<MyClass> wp3 (wp2); // construct wp3 from a weak_ptr
std::weak_ptr<MyClass> wp4
wp4 = wp2; // wp4 now points to the new MyClass.

You can not refer to an object directly using weak_ptr; you have to get a shared_ptr from it with lock() member function.

std::shared_ptr<MyClass> sp2 = wp2.lock(); // get shared_ptr from weak_ptr

The lock() function checks the state of the manager object to see if the managed object still exists. if it does not exist, returns an empty shared_ptr and if managed object exists, it returns a shared_ptr to the manager object.

std::shared_ptr<MyClass> sp = wp.lock(); // get shared_ptr from weak_ptr
if(sp)
    sp->do_something(); // tell the MyClass to do something
else
    std::cout << "The MyClass is gone!\n";

ask the weak_ptr if it has expired

bool is_sharedptr_alive(std::weak_ptr<MyClass> wp)
{
    if(wp.expired())
    {
        std::cout << "The MyClass is gone!\n";
        return false;
    }
    return true;
}

The weak_ptr can tell looking at the manager Object whether the managed object is still there; if the pointer and/or shared count are zero, the managed object is gone, and no attempt should be made to refer to it. If the pointer and shared count are non-zero, then the managed object is still present, and weak_ptr can make the pointer to it available.

C++11 shared_ptr and weak_ptr work well enough to automate or simplify your memory management. std::weak_ptr is a smart pointer that holds weak reference to an object that is managed by std::shared_ptr. The main intension of using std::weak_ptr is used to break circular references of std::shared_ptr.

Top comments (3)

Sandor Dargo • Jul 2 '21 • Edited

Thanks for the article.

I have two remarks.

1) When you delete a derived class through a base class pointer where the base class has no virtual destructor, that is not a partial deletion. It's undefined behaviour which is much worse. More info here.

2) Passing around a smart pointer by reference to gain on the performance is not a good idea. We've seen this with a colleague of mine and we found that the reason behind is a lack of understanding.

Why?

What's the purpose of using a smart pointer?

It's to handle ownership in a safe way. That's why you use smart pointers. If you pass around a reference, you don't pass/share the ownership.

If ownership is not involved, there is no need for smart pointers. Instead, you should just pass around a pointer.

It will be even faster because there is less indirection. Plus you don't have to make the mental exercise of understanding what a reference of a pointer mean.

QuickBench here.

(You don't need an inner loop in QuickBench. With for (auto _ : state) it takes care of executing things enough)

Pratik Parvati • Jul 6 '21

Thanks for your remarks Sandor.

I understand your point on ownership semantics of the objects; It completely makes sense to pass smart pointers by value. My intension was to point out the overhead added by the atomic operations (increment and decrement reference count) in shared_ptr; perhaps, I should edit and make this point clear.

Sandor Dargo • Jul 6 '21

What should be made clear IMO is that there is no good reason to pass a smart pointer by reference.

If you refer to Herb Sutter, he recommends that you "don’t pass a smart pointer as a function parameter unless you want to use or manipulate the smart pointer itself, such as to share or transfer ownership".

Consider that and knowing that sharing a smart pointer by reference doesn't share or transfer the ownership, we have no reasons to pass a SP by reference.

Some corresponding rules:

isocpp.github.io/CppCoreGuidelines...
isocpp.github.io/CppCoreGuidelines...

DEV Community

Memory management using Smart Pointers in C++ - Part 2

Memory management using Smart Pointers in C++ - Part 2

Shared Pointers (`std::shared_ptr<>`)

How shared pointer works internally?

Virtual destructor and `shared_ptr`

Using `shared_ptr`

Casting `shared_ptr`

Getting better memory allocation performance from `shared_ptr`

Getting `std::shared_ptr` from `this` Object

`std::shared_ptr` as function argument

Cyclic reference problem with `shared_ptr`

Weak pointer `std::weak_ptr<>`

Using `weak_ptr`

Top comments (3)

Read next

JVM Tuning Explained: From Fresh Graduate to Seasoned Performance Jedi

LZ4 – C++ React Native bindings for an extremely fast compression algorithm

Mastering JavaScript Memory Management: Essential Guide to Garbage Collection & Memory Leaks

Finding errors in unit tests

Memory management using Smart Pointers in C++ - Part 2

Shared Pointers (std::shared_ptr<>)

How shared pointer works internally?

Virtual destructor and shared_ptr

Using shared_ptr

Casting shared_ptr

Getting better memory allocation performance from shared_ptr

Getting std::shared_ptr from this Object

std::shared_ptr as function argument

Cyclic reference problem with shared_ptr

Weak pointer std::weak_ptr<>

Using weak_ptr

Read next

JVM Tuning Explained: From Fresh Graduate to Seasoned Performance Jedi

LZ4 – C++ React Native bindings for an extremely fast compression algorithm

Mastering JavaScript Memory Management: Essential Guide to Garbage Collection & Memory Leaks

Finding errors in unit tests

Shared Pointers (`std::shared_ptr<>`)

Virtual destructor and `shared_ptr`

Using `shared_ptr`

Casting `shared_ptr`

Getting better memory allocation performance from `shared_ptr`

Getting `std::shared_ptr` from `this` Object

`std::shared_ptr` as function argument

Cyclic reference problem with `shared_ptr`

Weak pointer `std::weak_ptr<>`

Using `weak_ptr`