Skip to content

Conversation

@wangleis
Copy link
Contributor

@wangleis wangleis commented Nov 24, 2025

Details:

  • Revert the LLM identification method to the previous version

Tickets:

@wangleis wangleis requested review from a team as code owners November 24, 2025 13:56
@github-actions github-actions bot added the category: CPU OpenVINO CPU plugin label Nov 24, 2025
@maxnick maxnick added this to the 2026.0 milestone Nov 24, 2025
(ov::get_number_of_blocked_cores() || proc_type_table[0][LP_EFFICIENT_CORE_PROC] > 0) &&
proc_type_table[0][EFFICIENT_CORE_PROC] <= 2 * proc_type_table[0][MAIN_CORE_PROC]) {
if (ov::op::util::is_large_language_model(*model)) {
if (int8_intensive || llm_related) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Here we disable E cores for any int8 model, right? Thus probably it does make sense to remove int8_intensive from line 730.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

updated

@maxnick
Copy link
Contributor

maxnick commented Nov 25, 2025

Just a general question. Does this change mean that we disable E cores for all int8 quantized models in the latency mode?

@wangleis wangleis requested a review from maxnick November 26, 2025 01:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

category: CPU OpenVINO CPU plugin

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants