Add NNCFProfiler implementation #3735

avolkov-intel · 2025-11-13T19:49:13Z

Changes

Added implementation of profiler to collect and compare activations value of openVino model to investigate cause of accuracy degradation.

Reason for changes

Related tickets

162317

Tests

ljaljushkin

Thank you for the contribution, Anatoly!
Please consider adding visualization functionality and enhancing the scalability

tools/profiler.py

tools/profiler/profiler.py

SearchSavior · 2025-12-03T17:12:54Z

Wonderful addition!!!!

@avolkov-intel once merged I will test on

Hermes-14B
Wayfarer-2-12B

Thanks you for your work.

daniil-lyakhov · 2025-12-04T15:17:20Z

tools/profiler/profiler.py

+        # Extract and convert collected statistics to numpy arrays
+        result: ActivationData = {}
+        for layer_name, statistic_points_list in statistics_aggregator.statistic_points.items():
+            # Extract input activations (index 1 in statistic_points_list)
+            in_container = list(
+                statistic_points_list[1].algorithm_to_tensor_collectors["collect"][0].aggregators.values()
+            )[0]._container
+            in_vals = [np.array(elem.data) for elem in in_container]
+
+            # Extract output activations (index 0 in statistic_points_list)
+            out_container = list(
+                statistic_points_list[0].algorithm_to_tensor_collectors["collect"][0].aggregators.values()
+            )[0]._container
+            out_vals = [np.array(elem.data) for elem in out_container]
+
+            result[layer_name] = {"in": in_vals, "out": out_vals}


Statistic collection API

Suggested change

# Extract and convert collected statistics to numpy arrays

result: ActivationData = {}

for layer_name, statistic_points_list in statistics_aggregator.statistic_points.items():

# Extract input activations (index 1 in statistic_points_list)

in_container = list(

statistic_points_list[1].algorithm_to_tensor_collectors["collect"][0].aggregators.values()

)[0]._container

in_vals = [np.array(elem.data) for elem in in_container]

# Extract output activations (index 0 in statistic_points_list)

out_container = list(

statistic_points_list[0].algorithm_to_tensor_collectors["collect"][0].aggregators.values()

)[0]._container

out_vals = [np.array(elem.data) for elem in out_container]

result[layer_name] = {"in": in_vals, "out": out_vals}

# Extract and convert collected statistics to numpy arrays

result: ActivationData = defaultdict(dict)

target_type_to_str = {

TargetType.PRE_LAYER_OPERATION: "in",

TargetType.POST_LAYER_OPERATION: "out",

}

for _, statistic_point, tensor_collector in statistic_points.get_tensor_collectors():

if statistic_point.target_point.type not in target_type_to_str:

msg = f"Unsupported target type: {statistic_point.target_point.type}"

raise RuntimeError(msg)

insert_type = target_type_to_str[statistic_point.target_point.type]

layer_name = statistic_point.target_point.target_node_name

stats = tensor_collector.get_statistics().values

result[layer_name][insert_type] = [np.array(elem.data) for elem in stats]

daniil-lyakhov · 2025-12-04T15:19:05Z

tools/profiler/profiler.py

+            msg = f"No layers found matching pattern: {pattern}"
+            raise ValueError(msg)
+
+        target_ops = [graph.get_node_by_key(name) for name in target_names]


Sort nodes in topographical order. Currently nodes are sorted by lexicographical order

Suggested change

target_ops = [graph.get_node_by_key(name) for name in target_names]

target_ops = []

for node in graph.topological_sort():

if len(target_ops) == len(target_names):

break

if node.node_key in target_names:

target_ops.append(node)

daniil-lyakhov · 2025-12-04T15:20:56Z

tools/profiler/tiny_llama_profiling.ipynb

+    "def transform_fn(data, tokenizer):\n",
+    "    tokenized_text = tokenizer(data[\"text\"], return_tensors=\"np\")\n",
+    "    input_ids = tokenized_text[\"input_ids\"]\n",
+    "    attention_mask = tokenized_text[\"attention_mask\"]\n",
+    "\n",
+    "    inputs = {}\n",
+    "    inputs[\"input_ids\"] = input_ids\n",
+    "    inputs[\"attention_mask\"] = tokenized_text[\"attention_mask\"]\n",
+    "    position_ids = np.cumsum(attention_mask, axis=1) - 1\n",
+    "    position_ids[attention_mask == 0] = 1\n",
+    "    inputs[\"position_ids\"] = position_ids\n",
+    "\n",
+    "    batch_size = input_ids.shape[0]\n",
+    "    inputs[\"beam_idx\"] = np.arange(batch_size, dtype=int)\n",
+    "\n",
+    "    return inputs\n",
+    "\n",
+    "\n",
+    "quantization_dataset = nncf.Dataset(dataset, partial(transform_fn, tokenizer=tokenizer))"


I'm suggesting to use following model-agnostic sniped there

from optimum.gptq.data import get_dataset from optimum.gptq.data import prepare_dataset dataset = "wikitext2" seqlen = 50 nsamples = 2 calibration_dataset = get_dataset(dataset, tokenizer, seqlen=seqlen, nsamples=nsamples) calibration_dataset = prepare_dataset(calibration_dataset) quantization_dataset = nncf.Dataset(calibration_dataset, lambda x: model.prepare_inputs(**x))

Add NNCFProfiler implementation

1d858ff

ljaljushkin requested changes Nov 20, 2025

View reviewed changes

tools/profiler.py Outdated Show resolved Hide resolved

avolkov-intel added 5 commits November 26, 2025 14:50

Add option to register custom statistics, comparators, visualizers

82cca4d

Return dict instead of list

50ec8e2

Change labels in summary plots

b8989fb

Add notebook with usage example

ab3bedb

Move profiler example

9181996

daniil-lyakhov reviewed Dec 2, 2025

View reviewed changes

tools/profiler/profiler.py Show resolved Hide resolved

Fix pre-commit issues

f92042a

github-actions bot added the documentation Improvements or additions to documentation label Dec 3, 2025

daniil-lyakhov reviewed Dec 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add NNCFProfiler implementation #3735

Add NNCFProfiler implementation #3735

Uh oh!

avolkov-intel commented Nov 13, 2025

Uh oh!

ljaljushkin left a comment

Uh oh!

Uh oh!

Uh oh!

SearchSavior commented Dec 3, 2025

Uh oh!

daniil-lyakhov Dec 4, 2025

Uh oh!

daniil-lyakhov Dec 4, 2025

Uh oh!

daniil-lyakhov Dec 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

-        target_ops = [graph.get_node_by_key(name) for name in target_names]
+        target_ops = []
+        for node in graph.topological_sort():
+            if len(target_ops) == len(target_names):
+                break
+            if node.node_key in target_names:
+                target_ops.append(node)

Add NNCFProfiler implementation #3735

Are you sure you want to change the base?

Add NNCFProfiler implementation #3735

Uh oh!

Conversation

avolkov-intel commented Nov 13, 2025

Changes

Reason for changes

Related tickets

Tests

Uh oh!

ljaljushkin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

SearchSavior commented Dec 3, 2025

Uh oh!

daniil-lyakhov Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

daniil-lyakhov Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

daniil-lyakhov Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants