Skip to content

Commit 5806fb3

Browse files
patrickvonplatenyounesbelkadagante
authored
Blog post about how to optimize LLMs for memory and speed (#1473)
* correct (( * [LLMs] Getting most out of LLMS * finish * finish * Fix * finish * finish * finish * finish * Apply suggestions from code review Co-authored-by: Younes Belkada <[email protected]> Co-authored-by: Joao Gante <[email protected]> * improve --------- Co-authored-by: Younes Belkada <[email protected]> Co-authored-by: Joao Gante <[email protected]>
1 parent 6ac46dc commit 5806fb3

File tree

4 files changed

+755
-1
lines changed

4 files changed

+755
-1
lines changed

_blog.yml

Lines changed: 11 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2777,7 +2777,7 @@
27772777
- collaboration
27782778
- diffusers
27792779
- diffusion
2780-
2780+
27812781
- local: safecoder-vs-closed-source-code-assistants
27822782
title: "SafeCoder vs. Closed-source Code Assistants"
27832783
author: julsimon
@@ -2818,3 +2818,13 @@
28182818
- diffusion
28192819
- diffusers
28202820
- text-to-image
2821+
2822+
# - local: optimize-llm
2823+
# title: "Optimizing your LLM in production"
2824+
# author: patrickvonplaten
2825+
# thumbnail: /blog/assets/163_optimize_llm/self_attn_tokens.png
2826+
# date: Sep 18, 2023
2827+
# tags:
2828+
# - nlp
2829+
# - research
2830+
# - LLM

assets/163_optimize_llm/alibi.png

17.6 KB
Loading
17.2 KB
Loading

0 commit comments

Comments
 (0)