How can a compiler differentiate between static data members having same name in different classes in C++?
I had a C++ interview recently where I was asked, how does the compiler differen开发者_JS百科tiate static data members having the same name in two different classes?
Since all static data variables are stored in the data segment, there has to be a way by which the compiler keeps track of which static data belongs to which class especially when they have the same name.
Edit: I answered name mangling, but he refused saying name mangling is used only among the members of the same class.
The names are mangled with their class name in them. An example with the clang compiler
class A {
static int i;
};
int A::i = 0;
Output
$ clang++ -cc1 -emit-llvm main1.cpp -o -
; ModuleID = 'main1.cpp'
target datalayout = "e-p:32:32:32-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:32:64-f32:32:32-f64:32:64-v64:64:64-v128:128:128-a0:0:64-f80:32:32-n8:16:32"
target triple = "i386-pc-linux-gnu"
@_ZN1A1iE = global i32 0, align 4
Where _ZN1A1iE
is
$ c++filt _ZN1A1iE
A::i
It's implementation-defined, so there's no one way it has to be done.
Name mangling is commonly used though.
Well I would assume that through name mangling the name of the class is encoded with the data too. I wouldn't be surprised if it's as simple as prefixing the name:
class one
{
static int data;
};
class two
{
static int data;
};
could have the names
one::data
two::data
in the data segment. Those are unique names.
A name is just something that is used by humans to give a meaning to things.
Once that the compiler knows that they are different items (since they are declared in different classes) it won't use the name to distinguish them inside the binary, it will use pointers and pointers don't care about names :)
Then nothing forbids from prepending the name of the data with the enclosing class..
All the "name mangling" answer address how the debugger finds the storage for each symbol and how the linker knows the difference, but the compiler knows because these entities occupy different entries in the symbol table.
That's it.
The executable doesn't need to know anything about names at all{*}. Just locations. The compiler knows about names, and that knowledge is coded in the symbol table (however it may be implemented).
{*} Well, in the absence of RTTI, anyway.
The name of the class qualifies the name of the member: myclass::mymemb
. The compiler differentiates them the same way you do, when referring to them from outside the class.
Within the implementation, this is done by name mangling; the compiler santizes the name of the class and the name of the member, and catenates them together into a mess such as __i_7myclass_6mymemb
.
The name mangling process (or its necessity) is the reason why we need extern "C"
for compatibility with C interfaces, which define non-mangled names.
static
members of a class will have their names mangled such that the class name is part of the 'global' name seen by the linker.
It attaches the class name to the data member name. (And then name-mangles the whole thing.)
精彩评论