Skip to content
Merged
Show file tree
Hide file tree
Changes from 8 commits
Commits
Show all changes
96 commits
Select commit Hold shift + click to select a range
d2d56f4
[𝘀𝗽𝗿] changes to main this commit is based on
necipfazil Apr 3, 2024
f805007
[𝘀𝗽𝗿] initial version
necipfazil Apr 3, 2024
058a100
[𝘀𝗽𝗿] changes introduced through rebase
necipfazil Apr 24, 2024
2a17e59
Rebased on top of main
necipfazil Apr 24, 2024
3065b9b
[𝘀𝗽𝗿] changes introduced through rebase
necipfazil Apr 29, 2024
a998707
dyn_cast to isa
necipfazil Apr 29, 2024
c2f27b3
[𝘀𝗽𝗿] changes introduced through rebase
necipfazil May 1, 2024
4a36a0c
Rebased on upstream main.
necipfazil May 1, 2024
58e5ecf
[𝘀𝗽𝗿] changes introduced through rebase
necipfazil Nov 14, 2024
851f62e
Rebase patchset
necipfazil Nov 14, 2024
966ac5a
[𝘀𝗽𝗿] changes introduced through rebase
necipfazil Nov 14, 2024
460f02a
Update inline comment as suggested.
necipfazil Nov 14, 2024
1a8d810
Fixed the tests and addressed most of the review comments.
necipfazil Nov 19, 2024
a993e6d
[𝘀𝗽𝗿] changes introduced through rebase
necipfazil Nov 20, 2024
4f31680
Rebase on top of upstream main. Remove opt has_value, value use.
necipfazil Nov 20, 2024
bb9d714
[𝘀𝗽𝗿] changes introduced through rebase
necipfazil Nov 20, 2024
086baf0
Break clang and llvm parts into separate commits.
necipfazil Nov 20, 2024
746e03c
[𝘀𝗽𝗿] changes introduced through rebase
necipfazil Dec 10, 2024
5d09cf0
Address review comments. Break llvm and clang patches.
necipfazil Dec 10, 2024
67662e4
[𝘀𝗽𝗿] changes introduced through rebase
necipfazil Feb 2, 2025
f38ce99
Rebase on top of main.
necipfazil Feb 2, 2025
cfcb9c4
[𝘀𝗽𝗿] changes introduced through rebase
necipfazil Feb 5, 2025
f80a182
Rename OB_type to OB_callee_type.
necipfazil Feb 5, 2025
8c18c46
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Feb 11, 2025
fe1b3e2
Rebase on top of main
necipfazil Feb 11, 2025
80f879e
Update IR verifier.
necipfazil Feb 11, 2025
f769ce6
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Mar 13, 2025
162e967
Add requested tests part 1.
necipfazil Mar 13, 2025
647db9a
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Mar 13, 2025
ddc8de5
Update comments in tests.
necipfazil Mar 13, 2025
5e97695
Remove unnecessary asserts. Remove autos for better readability.
necipfazil Mar 13, 2025
1429f1d
Add RISC-V support. Clean up test files.
necipfazil Mar 14, 2025
306e08b
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Mar 15, 2025
061fd55
Clean up test files.
necipfazil Mar 15, 2025
9a63148
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Mar 19, 2025
09a933d
Address code refactoring and test cleanup comments.
Prabhuk Mar 19, 2025
4a45618
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Apr 19, 2025
f1be618
Use metadata instead of OB to construct CallSiteInfo.
Prabhuk Apr 19, 2025
3eee130
Address review comments.
Prabhuk Apr 19, 2025
9ceb54c
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Apr 23, 2025
55ea3e2
Add tailcall tests.
Prabhuk Apr 23, 2025
175b5f1
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Apr 23, 2025
4724c6e
Move verifier down to parent change.
Prabhuk Apr 23, 2025
721cb8b
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Apr 23, 2025
f6a71b5
Address review comments.
Prabhuk Apr 23, 2025
7a32b7e
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Apr 24, 2025
a55c857
Address review comments.
Prabhuk Apr 24, 2025
484ad23
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Apr 24, 2025
5e5690a
Rebase on parent.
Prabhuk Apr 24, 2025
11b6b2f
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Apr 24, 2025
baaa763
Rebase on parent.
Prabhuk Apr 24, 2025
dd2c0b8
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Apr 28, 2025
6c0b729
Rebase on parent.
Prabhuk Apr 28, 2025
b813c25
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk May 1, 2025
5d49a8d
Rebase on parent.
Prabhuk May 1, 2025
35ef409
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk May 5, 2025
706478e
Rebase on parent.
Prabhuk May 5, 2025
07b7f6c
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk May 10, 2025
dd7e4b4
Rebase on parent change.
Prabhuk May 10, 2025
d7e6843
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk May 13, 2025
a388287
Remove dso_local and noundef from tests.
Prabhuk May 13, 2025
29cc917
Test direct call sites.
Prabhuk May 13, 2025
7e133e5
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk May 14, 2025
00d17d1
Rebase change stack on top of parent.
Prabhuk May 14, 2025
5ec5e7a
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk May 14, 2025
195cd6e
Rebase on main.
Prabhuk May 14, 2025
f1154ce
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk May 27, 2025
70ca198
Drop local_unnamed_addr.
Prabhuk May 27, 2025
24fd769
Replace not check with positive checks.
Prabhuk May 27, 2025
b8f1934
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Jun 11, 2025
5431cd5
Rebase.
Prabhuk Jun 11, 2025
37faf58
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Jun 11, 2025
38ac87a
Rebase on parent
Prabhuk Jun 11, 2025
975676e
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Jul 10, 2025
a7ac510
Rebase on top of main.
Prabhuk Jul 10, 2025
9c96517
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Jul 18, 2025
3f9848f
Rebase on top of main.
Prabhuk Jul 18, 2025
3c53c18
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Jul 18, 2025
72f6773
Rebase on parent
Prabhuk Jul 18, 2025
11dc6bf
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Jul 18, 2025
34eb039
Rebase on top of parent change.
Prabhuk Jul 18, 2025
ac406a5
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Jul 18, 2025
fd7299f
Rebase on parent.
Prabhuk Jul 18, 2025
67756fc
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Jul 18, 2025
2065a88
Rebase on parent.
Prabhuk Jul 18, 2025
37e8d19
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Jul 18, 2025
44d4463
Address review comments.
Prabhuk Jul 18, 2025
690ff13
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Jul 21, 2025
3ec7216
Rebase on top of main.
Prabhuk Jul 21, 2025
de6003d
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Jul 22, 2025
0e2eb95
Rebase.
Prabhuk Jul 22, 2025
4f76066
[𝘀𝗽𝗿] changes introduced through rebase
Prabhuk Jul 23, 2025
faf9f8f
Reduce tests. Update comments in tests. Rename functions in tests.
Prabhuk Jul 23, 2025
7b846f0
Rebase on main.
Prabhuk Jul 23, 2025
858480f
Change test var name.
Prabhuk Jul 23, 2025
230e666
Rebase after reapplying parent patch.
Prabhuk Jul 28, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
251 changes: 251 additions & 0 deletions clang/docs/CallGraphSection.rst
Original file line number Diff line number Diff line change
@@ -0,0 +1,251 @@
==================
Call Graph Section
==================

Introduction
============

With ``-fcall-graph-section``, the compiler will create a call graph section
in the object file. It will include type identifiers for indirect calls and
targets. This information can be used to map indirect calls to their receivers
with matching types. A complete and high-precision call graph can be
reconstructed by complementing this information with disassembly
(see ``llvm-objdump --call-graph-info``).

Semantics
=========

A coarse-grained, type-agnostic call graph may allow indirect calls to target
any function in the program. This approach ensures completeness since no
indirect call edge is missing. However, it is generally poor in precision
due to having unneeded edges.

A call graph section provides type identifiers for indirect calls and targets.
This information can be used to restrict the receivers of an indirect target to
indirect calls with matching type. Consequently, the precision for indirect
call edges are improved while maintaining the completeness.

The ``llvm-objdump`` utility provides a ``--call-graph-info`` option to extract
full call graph information by parsing the content of the call graph section
and disassembling the program for complementary information, e.g., direct
calls.

Section layout
==============

A call graph section consists of zero or more call graph entries.
Each entry contains information on a function and its indirect calls.

An entry of a call graph section has the following layout in the binary:

+---------------------+-----------------------------------------------------------------------+
| Element | Content |
+=====================+=======================================================================+
| FormatVersionNumber | Format version number. |
+---------------------+-----------------------------------------------------------------------+
| FunctionEntryPc | Function entry address. |
+---------------------+-----------------------------------+-----------------------------------+
| | A flag whether the function is an | - 0: not an indirect target |
| FunctionKind | indirect target, and if so, | - 1: indirect target, unknown id |
| | whether its type id is known. | - 2: indirect target, known id |
+---------------------+-----------------------------------+-----------------------------------+
| FunctionTypeId | Type id for the indirect target. Present only when FunctionKind is 2. |
+---------------------+-----------------------------------------------------------------------+
| CallSiteCount | Number of type id to indirect call site mappings that follow. |
+---------------------+-----------------------------------------------------------------------+
| CallSiteList | List of type id and indirect call site pc pairs. |
+---------------------+-----------------------------------------------------------------------+

Each element in an entry (including each element of the contained lists and
pairs) occupies 64-bit space.

The format version number is repeated per entry to support concatenation of
call graph sections with different format versions by the linker.

As of now, the only supported format version is described above and has version
number 0.

Type identifiers
================

The type for an indirect call or target is the function signature.
The mapping from a type to an identifier is an ABI detail.
In the current experimental implementation, an identifier of type T is
computed as follows:

- Obtain the generalized mangled name for “typeinfo name for T”.
- Compute MD5 hash of the name as a string.
- Reinterpret the first 8 bytes of the hash as a little-endian 64-bit integer.

To avoid mismatched pointer types, generalizations are applied.
Pointers in return and argument types are treated as equivalent as long as the
qualifiers for the type they point to match.
For example, ``char*``, ``char**``, and ``int*`` are considered equivalent
types. However, ``char*`` and ``const char*`` are considered separate types.

Missing type identifiers
========================

For functions, two cases need to be considered. First, if the compiler cannot
deduce a type id for an indirect target, it will be listed as an indirect target
without a type id. Second, if an object without a call graph section gets
linked, the final call graph section will lack information on functions from
the object. For completeness, these functions need to be taken as receiver to
any indirect call regardless of their type id.
``llvm-objdump --call-graph-info`` lists these functions as indirect targets
with `UNKNOWN` type id.

For indirect calls, current implementation guarantees a type id for each
compiled call. However, if an object without a call graph section gets linked,
no type id will be present for its indirect calls. For completeness, these calls
need to be taken to target any indirect target regardless of their type id. For
indirect calls, ``llvm-objdump --call-graph-info`` prints 1) a complete list of
indirect calls, 2) type id to indirect call mappings. The difference of these
lists allow to deduce the indirect calls with missing type ids.

TODO: measure and report the ratio of missed type ids

Performance
===========

A call graph section does not affect the executable code and does not occupy
memory during process execution. Therefore, there is no performance overhead.

The scheme has not yet been optimized for binary size.

TODO: measure and report the increase in the binary size

Example
=======

For example, consider the following C++ code:

.. code-block:: cpp

namespace {
// Not an indirect target
void foo() {}
}

// Indirect target 1
void bar() {}

// Indirect target 2
int baz(char a, float *b) {
return 0;
}

// Indirect target 3
int main() {
char a;
float b;
void (*fp_bar)() = bar;
int (*fp_baz1)(char, float*) = baz;
int (*fp_baz2)(char, float*) = baz;

// Indirect call site 1
fp_bar();

// Indirect call site 2
fp_baz1(a, &b);

// Indirect call site 3: shares the type id with indirect call site 2
fp_baz2(a, &b);

// Direct call sites
foo();
bar();
baz(a, &b);

return 0;
}

Following will compile it with a call graph section created in the binary:

.. code-block:: bash

$ clang -fcall-graph-section example.cpp

During the construction of the call graph section, the type identifiers are
computed as follows:

+---------------+-----------------------+----------------------------+----------------------------+
| Function name | Generalized signature | Mangled name (itanium ABI) | Numeric type id (md5 hash) |
+===============+=======================+============================+============================+
| bar | void () | _ZTSFvvE.generalized | f85c699bb8ef20a2 |
+---------------+-----------------------+----------------------------+----------------------------+
| baz | int (char, void*) | _ZTSFicPvE.generalized | e3804d2a7f2b03fe |
+---------------+-----------------------+----------------------------+----------------------------+
| main | int () | _ZTSFivE.generalized | a9494def81a01dc |
+---------------+-----------------------+----------------------------+----------------------------+

The call graph section will have the following content:

+---------------+-----------------+--------------+----------------+---------------+--------------------------------------+
| FormatVersion | FunctionEntryPc | FunctionKind | FunctionTypeId | CallSiteCount | CallSiteList |
+===============+=================+==============+================+===============+======================================+
| 0 | EntryPc(foo) | 0 | (empty) | 0 | (empty) |
+---------------+-----------------+--------------+----------------+---------------+--------------------------------------+
| 0 | EntryPc(bar) | 2 | TypeId(bar) | 0 | (empty) |
+---------------+-----------------+--------------+----------------+---------------+--------------------------------------+
| 0 | EntryPc(baz) | 2 | TypeId(baz) | 0 | (empty) |
+---------------+-----------------+--------------+----------------+---------------+--------------------------------------+
| 0 | EntryPc(main) | 2 | TypeId(main) | 3 | * TypeId(bar), CallSitePc(fp_bar()) |
| | | | | | * TypeId(baz), CallSitePc(fp_baz1()) |
| | | | | | * TypeId(baz), CallSitePc(fp_baz2()) |
+---------------+-----------------+--------------+----------------+---------------+--------------------------------------+


The ``llvm-objdump`` utility can parse the call graph section and disassemble
the program to provide complete call graph information. This includes any
additional call sites from the binary:

.. code-block:: bash

$ llvm-objdump --call-graph-info a.out

# Comments are not a part of the llvm-objdump's output but inserted for clarifications.

a.out: file format elf64-x86-64
# These warnings are due to the functions and the indirect calls coming from linked objects.
llvm-objdump: warning: 'a.out': callgraph section does not have type ids for 3 indirect calls
llvm-objdump: warning: 'a.out': callgraph section does not have information for 10 functions

# Unknown targets are the 10 functions the warnings mention.
INDIRECT TARGET TYPES (TYPEID [FUNC_ADDR,])
UNKNOWN 401000 401100 401234 401050 401090 4010d0 4011d0 401020 401060 401230
a9494def81a01dc 401150 # main()
f85c699bb8ef20a2 401120 # bar()
e3804d2a7f2b03fe 401130 # baz()

# Notice that the call sites share the same type id as target functions
INDIRECT CALL TYPES (TYPEID [CALL_SITE_ADDR,])
f85c699bb8ef20a2 401181 # Indirect call site 1 (fp_bar())
e3804d2a7f2b03fe 401191 4011a1 # Indirect call site 2 and 3 (fp_baz1() and fp_baz2())

INDIRECT CALL SITES (CALLER_ADDR [CALL_SITE_ADDR,])
401000 401012 # _init
401150 401181 401191 4011a1 # main calls fp_bar(), fp_baz1(), fp_baz2()
4011d0 401215 # __libc_csu_init
401020 40104a # _start

DIRECT CALL SITES (CALLER_ADDR [(CALL_SITE_ADDR, TARGET_ADDR),])
4010d0 4010e2 401060 # __do_global_dtors_aux
401150 4011a6 401110 4011ab 401120 4011ba 401130 # main calls foo(), bar(), baz()
4011d0 4011fd 401000 # __libc_csu_init

FUNCTIONS (FUNC_ENTRY_ADDR, SYM_NAME)
401000 _init
401100 frame_dummy
401234 _fini
401050 _dl_relocate_static_pie
401090 register_tm_clones
4010d0 __do_global_dtors_aux
401110 _ZN12_GLOBAL__N_13fooEv # (anonymous namespace)::foo()
401150 main # main
4011d0 __libc_csu_init
401020 _start
401060 deregister_tm_clones
401120 _Z3barv # bar()
401130 _Z3bazcPf # baz(char, float*)
401230 __libc_csu_fini
2 changes: 2 additions & 0 deletions clang/include/clang/Basic/CodeGenOptions.def
Original file line number Diff line number Diff line change
Expand Up @@ -76,6 +76,8 @@ CODEGENOPT(EnableNoundefAttrs, 1, 0) ///< Enable emitting `noundef` attributes o
CODEGENOPT(DebugPassManager, 1, 0) ///< Prints debug information for the new
///< pass manager.
CODEGENOPT(DisableRedZone , 1, 0) ///< Set when -mno-red-zone is enabled.
CODEGENOPT(CallGraphSection, 1, 0) ///< Emit a call graph section into the
///< object file.
CODEGENOPT(EmitCallSiteInfo, 1, 0) ///< Emit call site info only in the case of
///< '-g' + 'O>0' level.
CODEGENOPT(IndirectTlsSegRefs, 1, 0) ///< Set when -mno-tls-direct-seg-refs
Expand Down
4 changes: 4 additions & 0 deletions clang/include/clang/Driver/Options.td
Original file line number Diff line number Diff line change
Expand Up @@ -4131,6 +4131,10 @@ defm data_sections : BoolFOption<"data-sections",
PosFlag<SetTrue, [], [ClangOption, CC1Option],
"Place each data in its own section">,
NegFlag<SetFalse>>;
defm call_graph_section : BoolFOption<"call-graph-section",
CodeGenOpts<"CallGraphSection">, DefaultFalse,
PosFlag<SetTrue, [], [CC1Option], "Emit a call graph section">,
NegFlag<SetFalse>>;
defm stack_size_section : BoolFOption<"stack-size-section",
CodeGenOpts<"StackSizeSection">, DefaultFalse,
PosFlag<SetTrue, [], [ClangOption, CC1Option],
Expand Down
1 change: 1 addition & 0 deletions clang/lib/CodeGen/BackendUtil.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -431,6 +431,7 @@ static bool initTargetOptions(DiagnosticsEngine &Diags,
Options.StackUsageOutput = CodeGenOpts.StackUsageOutput;
Options.EmitAddrsig = CodeGenOpts.Addrsig;
Options.ForceDwarfFrameSection = CodeGenOpts.ForceDwarfFrameSection;
Options.EmitCallGraphSection = CodeGenOpts.CallGraphSection;
Options.EmitCallSiteInfo = CodeGenOpts.EmitCallSiteInfo;
Options.EnableAIXExtendedAltivecABI = LangOpts.EnableAIXExtendedAltivecABI;
Options.XRayFunctionIndex = CodeGenOpts.XRayFunctionIndex;
Expand Down
36 changes: 36 additions & 0 deletions clang/lib/CodeGen/CGCall.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -25,6 +25,7 @@
#include "clang/AST/Decl.h"
#include "clang/AST/DeclCXX.h"
#include "clang/AST/DeclObjC.h"
#include "clang/AST/Type.h"
#include "clang/Basic/CodeGenOptions.h"
#include "clang/Basic/TargetInfo.h"
#include "clang/CodeGen/CGFunctionInfo.h"
Expand Down Expand Up @@ -5021,6 +5022,11 @@ static unsigned getMaxVectorWidth(const llvm::Type *Ty) {
return MaxVectorWidth;
}

static bool isCXXDeclType(const FunctionDecl *FD) {
return isa<CXXConstructorDecl>(FD) || isa<CXXMethodDecl>(FD) ||
isa<CXXDestructorDecl>(FD);
}

RValue CodeGenFunction::EmitCall(const CGFunctionInfo &CallInfo,
const CGCallee &Callee,
ReturnValueSlot ReturnValue,
Expand Down Expand Up @@ -5693,6 +5699,36 @@ RValue CodeGenFunction::EmitCall(const CGFunctionInfo &CallInfo,
AllocAlignAttrEmitter AllocAlignAttrEmitter(*this, TargetDecl, CallArgs);
Attrs = AllocAlignAttrEmitter.TryEmitAsCallSiteAttribute(Attrs);

if (CGM.getCodeGenOpts().CallGraphSection) {
// Create operand bundle only for indirect calls, not for all
if (callOrInvoke && *callOrInvoke && (*callOrInvoke)->isIndirectCall()) {
assert((TargetDecl && TargetDecl->getFunctionType() ||
Callee.getAbstractInfo().getCalleeFunctionProtoType()) &&
"cannot find callsite type");
QualType CST;
if (TargetDecl && TargetDecl->getFunctionType())
CST = QualType(TargetDecl->getFunctionType(), 0);
else if (const auto *FPT =
Callee.getAbstractInfo().getCalleeFunctionProtoType())
CST = QualType(FPT, 0);

if (!CST.isNull()) {
auto *TypeIdMD = CGM.CreateMetadataIdentifierGeneralized(CST);
auto *TypeIdMDVal =
llvm::MetadataAsValue::get(getLLVMContext(), TypeIdMD);
BundleList.emplace_back("type", TypeIdMDVal);
}

// Set type identifier metadata of indirect calls for call graph section.
if (const FunctionDecl *FD = dyn_cast_or_null<FunctionDecl>(TargetDecl)) {
// Type id metadata is set only for C/C++ contexts.
if (isCXXDeclType(FD)) {
CGM.CreateFunctionTypeMetadataForIcall(FD->getType(), *callOrInvoke);
}
}
}
}

// Emit the actual call/invoke instruction.
llvm::CallBase *CI;
if (!InvokeDest) {
Expand Down
5 changes: 5 additions & 0 deletions clang/lib/CodeGen/CGExpr.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -6013,6 +6013,11 @@ RValue CodeGenFunction::EmitCall(QualType CalleeType, const CGCallee &OrigCallee
}
}

// Set type identifier metadata of indirect calls for call graph section.
if (CGM.getCodeGenOpts().CallGraphSection && CallOrInvoke &&
CallOrInvoke->isIndirectCall())
CGM.CreateFunctionTypeMetadataForIcall(QualType(FnType, 0), CallOrInvoke);

return Call;
}

Expand Down
Loading