Skip to content

Commit cf659bb

Browse files
authored
mtmd: refactor preprocessing + support max/min pixels (#16878)
* mtmd: refactor preprocessing + support max/min pixels * fix mlp type * implement mix/max pixels * improve hparams * better image preproc for qwen * fix * fix out of bound composite * fix (2) * fix token calculation * get_merge_kernel_size() * fix llama4 and lfm2 * gonna fix them all * use simple resize for qwen * qwen: increase min tokens * no resize if dst size == src size * restore to initial min/max tokens value for qwen
1 parent d8b860a commit cf659bb

File tree

2 files changed

+431
-331
lines changed

2 files changed

+431
-331
lines changed

tools/mtmd/clip-impl.h

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -154,8 +154,8 @@ enum projector_type {
154154
PROJECTOR_TYPE_LFM2,
155155
PROJECTOR_TYPE_KIMIVL,
156156
PROJECTOR_TYPE_LIGHTONOCR,
157-
PROJECTOR_TYPE_UNKNOWN,
158157
PROJECTOR_TYPE_COGVLM,
158+
PROJECTOR_TYPE_UNKNOWN,
159159
};
160160

161161
static std::map<projector_type, std::string> PROJECTOR_TYPE_NAMES = {

0 commit comments

Comments
 (0)