add check total_max_length for generate_func #9338

mingMelody · 2024-10-29T15:58:22Z

PR types

Others

PR changes

Others

Description

该pr在generate函数中添加了对total_max_length的判定，解决当total_max_length < input_len + max_new_tokens时，可能出现的Error: ../paddle/phi/kernels/funcs/gather.cu.h:60 Assertion index_value >= 0 && index_value < input_dims[j] failed. The index is out of bounds, please check whether the dimensions of index and input meet the requirements. It should be less than [8192] and greater than or equal to 0, but received [8192]的相关问题

paddle-bot · 2024-10-29T15:58:27Z

Thanks for your contribution!

CLAassistant · 2024-10-29T15:58:27Z

All committers have signed the CLA.

yuanlehome · 2024-10-30T05:23:57Z

paddlenlp/generation/utils.py

+        total_max_length = None
+        names = [
+            "total_max_length",
+            "max_seq_len",
+            "max_position_embeddings",
+            "max_sequence_length",
+            "seq_length",
+        ]    
+        for name in names:
+            total_max_length = self.config.get(name, None)
+            if total_max_length is not None:
+                break


这一块是否可以直接调用llm_utils.get_model_max_position_embedding函数？

这块会产生一个循环调用的问题，具体情况如下：

DrownFish19 · 2024-10-31T02:14:42Z

Lint 问题需要安装pre-commit 后格式化代码，参考步骤如下：

# 安装
pip install pre-commit

# 在项目文件夹下注册pre-commit，每次commit提交时都会格式化代码
pre-commit install

# 单独处理之前的代码文件
pre-commit run --file XXXX.py

DrownFish19 · 2024-10-31T02:17:23Z

请优先解决单测问题，单测问题会阻断整体项目开发，在解决前无法合入此PR。
单测文件目录为PaddleNLP/tests，可根据输出确定单测错误的位置，再对应修改，有其他问题可以随时在此PR提问。

add new test files

mingMelody · 2024-10-31T12:57:19Z

请优先解决单测问题，单测问题会阻断整体项目开发，在解决前无法合入此PR。单测文件目录为PaddleNLP/tests，可根据输出确定单测错误的位置，再对应修改，有其他问题可以随时在此PR提问。

十分感谢告知存在单测的功能，以便于我更好的复现了自己遇见的问题，问题的首次出现是发生在chatglm_v2模型上的，当input_len + max_length > max_sequence_length时，由于无法按照max_sequence_length进行截断操作，导致到生成的长度超过max_sequence_length时，会发生如下报错：

Error: ../paddle/phi/kernels/funcs/gather.cu.h:60 Assertion `index_value >= 0 && index_value < input_dims[j]` failed. The index is out of bounds, please check whether the dimensions of index and input meet the requirements. It should be less than [10] and greater than or equal to 0, but received [10]

出现错误后一段时间会产生cuda的报错，复现的方式可以在chatglm_v2的get_config函数中添加max_sequence_length=self.seq_length，而不是采用默认的2048，同时修改ChatGLMv2Test类的_get_input_ids_and_config的sequence_length变量，使其直接为最大值，如下图所示：

感觉这块是由于chatglm_v2测试用例覆盖不够强导致的，进一步定位到了模型文件中的CoreAttention类，但是报错的出现和什么时候发生对query_layer和key_layer的操作挂钩，甚至是对之前相关tensor的操作，print相关tensor时便会出现对应的情况

出现的单测错误的主要原因是因为对于chatglm模型而言，这样的越界访问并不会产生相应的报错，使得采用generate函数和sample函数出现的结果不一致,因为截断后的长度会小于sample未截断的情况，对于其他的模型，在单测函数中对于sequence_length的处理方式也存在一定的不一致性

github-actions · 2024-12-31T00:20:32Z

This Pull Request is stale because it has been open for 60 days with no activity. 当前Pull Request 60天内无活动，被标记为stale。

paddle-bot · 2026-01-06T06:39:39Z

Automatically closed by Paddle-bot.

add check total_max_length for generate_func

a048e8a

paddle-bot bot added the contributor label Oct 29, 2024

paddle-bot bot assigned wawltor Oct 29, 2024

yuanlehome reviewed Oct 30, 2024

View reviewed changes

mingMelody added 2 commits October 31, 2024 07:50

add pre-commit

90ca364

Merge branch 'develop' into add_check_total_max_length_for_generate_func

f51da4b

add new test files

github-actions bot added the stale label Dec 31, 2024

paddle-bot bot closed this Jan 6, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add check total_max_length for generate_func #9338

add check total_max_length for generate_func #9338

Uh oh!

mingMelody commented Oct 29, 2024

Uh oh!

paddle-bot bot commented Oct 29, 2024

Uh oh!

CLAassistant commented Oct 29, 2024 •

edited

Loading

Uh oh!

yuanlehome Oct 30, 2024

Uh oh!

mingMelody Oct 30, 2024

Uh oh!

DrownFish19 commented Oct 31, 2024

Uh oh!

DrownFish19 commented Oct 31, 2024

Uh oh!

mingMelody commented Oct 31, 2024

Uh oh!

github-actions bot commented Dec 31, 2024

Uh oh!

paddle-bot bot commented Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

add check total_max_length for generate_func #9338

add check total_max_length for generate_func #9338

Uh oh!

Conversation

mingMelody commented Oct 29, 2024

PR types

PR changes

Description

Uh oh!

paddle-bot bot commented Oct 29, 2024

Uh oh!

CLAassistant commented Oct 29, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yuanlehome Oct 30, 2024

Choose a reason for hiding this comment

Uh oh!

mingMelody Oct 30, 2024

Choose a reason for hiding this comment

Uh oh!

DrownFish19 commented Oct 31, 2024

Uh oh!

DrownFish19 commented Oct 31, 2024

Uh oh!

mingMelody commented Oct 31, 2024

Uh oh!

github-actions bot commented Dec 31, 2024

Uh oh!

paddle-bot bot commented Jan 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

CLAassistant commented Oct 29, 2024 •

edited

Loading