A brief guide for versioning symbols in the Mir DSOs

So, what do I have to do?

There are more detailed descriptions below, but as a general rule:

  • If you add a new symbol, add it to a *_NEXTSERIES version stanza, like MIR_CLIENT_0.22, MIR_PLATFORM_0.22, etc representing the next future Mir series in which the new symbol will first be released.

  • If you change the behaviour or signature of a symbol and wish to preserve backward compatibility, see “Change symbols without breaking ABI” below.

Can I have some details?

Sure.

Mir is a set of libraries, one C++ library for writing display- server/compositor/shells and one C library for writing clients (or, more usually, toolkits for clients) that use a Mir display-server for output. Mir also has internal dynamic libraries for platform support - drivers - and may in future allow the same with extensions to the core functionality. As such, the ABI of these interfaces is important to keep in mind.

Mir uses the ELF symbol versioning support. This provides three advantages:

  • Consumers of the Mir libraries can know at load time rather than symbol resolution time whether the library exposes all the symbols they expect.

  • We can drop or change the behaviour of symbols without breaking ABI by exposing multiple different implementations under different versions, and

  • We can safely load multiple different versions of Mir libraries into the same process.

When should I bump SONAME?

There are varying standards for when to bump SONAME. In Mir we choose to bump the SONAME of a library whenever we make a change that could cause a binary linked to the library to fail as long as the binary is using only public interfaces and (where applicable) relying on documented behaviour. In general, changes that make an interface work as described by its documentation will not result in SONAME bumps.

With that explanation, you should bump SONAME when:

  • You remove a public symbol from a library

  • You change the signature of a public symbol without retaining the previous signature exposed under the old versioning.

  • You change the behaviour of a public symbol without retaining the previous behaviour exposed with the old versioning.

If you are changing the behaviour of an interface, think about whether it’s easy to maintain the old interface in parallel. If it is, you should consider providing both under different versions. This should become easier over time as the Mir ABI becomes more stable and also more valuable over time as the Mir libraries become more widely used.

Load-time version detection

When using versioned symbols the linker adds an extra, special symbol containing the version(s) exported from the library. Consumers of the library resolve this on library load. For example:

$ objdump -C -T lib/libmirclient.so
…
00000000002a2080  w   DO .data.rel.ro   0000000000000080  MIR_CLIENT_8 vtable for mir::client::DefaultConnectionConfiguration
0000000000000000 g    DO *ABS*  0000000000000000  MIR_CLIENT_8 MIR_CLIENT_8
0000000000030ed2 g    DF .text  0000000000000098  MIR_CLIENT_8 mir::client::DefaultConnectionConfiguration::the_rpc_report()
…

This shows the special MIR_CLIENT_8 symbol of the current libmirclient, along with a versioned symbol in the read-only data segment (the vtable for mir::client::DefaultConnectionConfiguration) and a versioned symbol in the text segment (the implementation of mir::client::DefaultConnectionConfiguration::the_rpc_report()). If a client needed a symbol versioned with MIR_CLIENT_9, it would try to resolve this at load time and fail, rather than failing when the symbol was first referenced - possibly much later, and more confusingly.

So what do I have to do to make this work?

When you add new symbols, add them to a new version block in the relevant symbols.map file, like so:

MIR_CLIENT_0.17 {
    global:
        mir_connect_sync;
        ...
        /* Other symbols go here */
};

MIR_CLIENT_0.18 {
    global:
        mir_connect_new_symbol;
    local:
        *;
} MIR_CLIENT_0.17;

Note that the script is read top to bottom; wildcards are greedily bound when first encountered, so to avoid surprises you should only have a wildcard in the final stanza.

Change symbols without breaking ABI

ELF DSOs can have multiple implementations for the same symbol with different versions. This means that you can change the signature or behaviour of a symbol without breaking dependants that use the old behaviour. While there can be as many different implementations with different versions as you want, there can only be one default implementation - this is what the linker will resolve to when building a dependant project.

Binding different implementations to the versioned symbol is done with __asm__ directives in the relevant source file(s). The default implementation is specified with symbol_name@@VERSION; other versions are specified with symbol_name@VERSION.

Note that this does not require a change in SONAME. Binaries that have been linked against the old library will continue to work and resolve to the old implementation. Binaries linked against the new library will resolve to the new (default) implementation.

So, what do I have to do to make this work?

For example, if you wanted to change the signature of mir_connection_create_surface to take a new parameter:

mir_connection_api.cpp:

__asm__(".symver old_mir_connection_create_surface,mir_connection_create_surface@MIR_CLIENT_0.17");

extern "C" MirWaitHandle* old_mir_connection_create_surface(...)
/* The old implementation */

/* The @@ specifies that this is the default version */
__asm__(".symver mir_connection_create_surface,mir_connection_create_surface@@@MIR_CLIENT_0.18");
MirWaitHandle* mir_connection_create_surface(...)
/* The new implementation */

symbols.map:

MIR_CLIENT_0.17 {
    global:
        ...
        mir_connection_create_surface;
        ...
};

MIR_CLIENT_0.18 {
    global:
        ...
        mir_connection_create_surface;
        ...
    local:
        *;
} MIR_CLIENT_0.17;

Safely load multiple versions of a library into the same address space

This benefit is currently theoretical, as there seems to be a Protobuf singleton that aborts if we try this. But should that be resolved, it’s theoretically possible and of some benefit…

This situation will come about - the Qtmir plugin links to libmirclient and also libEGL, and libEGL will link to libmirclient itself. There is no guarantee that Qtmir and libEGL will link to the same SONAME, and so a process can end up trying to load both libmirclient.so.8 and libmirclient.so.9 into its address space. Without symbol versioning this is potentially broken - there’s no mechanism for libEGL to only resolve symbols from libmirclient.so.8 and Qtmir to only resolve symbols from libmirclient.so.9, so in cases where symbols have changed use of those symbols will break.

By versioning the symbols we ensure that code always gets exactly the symbol implementation it expects, even when multiple library versions are loaded.

So, what do I have to do to make this work?

Ensure that different implementations of a symbol have different versions.

Additionally, there’s the complication of passing objects between different versions. For the moment, we can not bother trying to make this work.

See also:

Binutils manual

Former glibc maintainer’s DSO guide