FIX/REFACTOR: FoldContent revamp #866

romanrizzi · 2024-10-24T20:53:02Z

We hit a snag with our hot topic gist strategy: the regex we used to split the content didn't work, so we cannot send the original post separately. This was important for letting the model focus on what's new in the topic.

The algorithm doesn’t give us full control over how prompts are written, and figuring out how to format the content isn't straightforward. This means we're having to use more complicated workarounds, like regex.

To tackle this, I'm suggesting we simplify the approach a bit. Let's focus on summarizing as much as we can upfront, then gradually add new content until there's nothing left to summarize.

Also, the "extend" part is mostly for models with small context windows, which shouldn't pose a problem 99% of the time with the content volume we're dealing with.

We hit a snag with our hot topic gist strategy: the regex we used to split the content didn't work, so we cannot send the original post separately. This was important for letting the model focus on what's new in the topic. The algorithm doesn’t give us full control over how prompts are written, and figuring out how to format the content isn't straightforward. This means we're having to use more complicated workarounds, like regex. To tackle this, I'm suggesting we simplify the approach a bit. Let's focus on summarizing as much as we can upfront, then gradually add new content until there's nothing left to summarize. Also, the "extend" part is mostly for models with small context windows, which shouldn't pose a problem 99% of the time with the content volume we're dealing with.

xfalcox · 2024-10-24T22:19:11Z

Also, the "extend" part is mostly for models with small context windows, which shouldn't pose a problem 99% of the time with the content volume we're dealing with.

Given that 99% of the time we will only do a single pass, shouldn't we only support that to simplify code?

Like

if "topic fits in context"
   summarize with all posts
else
  summarize using best replies mode
end

romanrizzi · 2024-10-25T14:23:53Z

Also, the "extend" part is mostly for models with small context windows, which shouldn't pose a problem 99% of the time with the content volume we're dealing with.

Given that 99% of the time we will only do a single pass, shouldn't we only support that to simplify code?

Like
if "topic fits in context"
   summarize with all posts
else
  summarize using best replies mode
end

This brings up another point: with the average context window being much larger than it was a year ago, are we being overly cautious in how we choose posts to summarize? I think the answer is probably yes, and we could probably start sending the entire topic to the model.

However, I don’t think we should completely drop the folding. It’s a good safety net for avoiding overwhelming the LLM with too much content. I wouldn't expect this approach to change much, especially now that we've separated the "what" and the "how" into different strategies.

xfalcox · 2024-10-25T14:42:03Z

This brings up another point: with the average context window being much larger than it was a year ago, are we being overly cautious in how we choose posts to summarize? I think the answer is probably yes, and we could probably start sending the entire topic to the model.

LLMM config gives us the exact context size, so we know if it fits or not. We should check, if the topic size is < 80% of the context window we should send it all.

romanrizzi · 2024-10-25T14:47:12Z

This brings up another point: with the average context window being much larger than it was a year ago, are we being overly cautious in how we choose posts to summarize? I think the answer is probably yes, and we could probably start sending the entire topic to the model.

LLMM config gives us the exact context size, so we know if it fits or not. We should check, if the topic size is < 80% of the context window we should send it all.

Fair enough. I'll follow-up on a different PR.

romanrizzi force-pushed the fold_revamp branch from f711226 to dacb7db Compare October 24, 2024 21:08

Fix fold docs

4185309

xfalcox approved these changes Oct 24, 2024

View reviewed changes

Use #shift instead of #pop to get the first elem, not the last

ccb6966

romanrizzi merged commit ec97996 into main Oct 25, 2024
5 checks passed

romanrizzi deleted the fold_revamp branch October 25, 2024 14:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

FIX/REFACTOR: FoldContent revamp #866

FIX/REFACTOR: FoldContent revamp #866

Uh oh!

romanrizzi commented Oct 24, 2024

Uh oh!

xfalcox commented Oct 24, 2024 •

edited

Loading

Uh oh!

romanrizzi commented Oct 25, 2024

Uh oh!

xfalcox commented Oct 25, 2024

Uh oh!

romanrizzi commented Oct 25, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

FIX/REFACTOR: FoldContent revamp #866

FIX/REFACTOR: FoldContent revamp #866

Uh oh!

Conversation

romanrizzi commented Oct 24, 2024

Uh oh!

xfalcox commented Oct 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

romanrizzi commented Oct 25, 2024

Uh oh!

xfalcox commented Oct 25, 2024

Uh oh!

romanrizzi commented Oct 25, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

3 participants

xfalcox commented Oct 24, 2024 •

edited

Loading