Skip to content

Conversation

@oOraph
Copy link
Contributor

@oOraph oOraph commented Mar 13, 2025

  • Update base image to ubuntu 22.04 (20.04 reaches end of support this year anyway)
  • Remove some non required dependencies for the webservice to start
  • Remove unused build arg from docker file

@oOraph
Copy link
Contributor Author

oOraph commented Mar 13, 2025

side note: I may be wrong in removing all the dependencies (openjdk and stuff). I did so because I do not see where they are used, so no need to embed them, but I may miss the whole picture about the image usage. I just built the inf2 image as if I were to deploy it on HF endpoints, for the encoder models it now supports on inf2

@oOraph oOraph requested review from ErikKaum and alvarobartt March 13, 2025 09:43
@oOraph
Copy link
Contributor Author

oOraph commented Mar 13, 2025

for the inf2_env.py helper script, strongly inspired from the script we made on tgi neuronx side

@oOraph oOraph marked this pull request as ready for review March 13, 2025 09:59
@oOraph oOraph force-pushed the dev/inf2_image branch 3 times, most recently from 038e9eb to 78a245c Compare March 13, 2025 15:34
Copy link
Member

@ErikKaum ErikKaum left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

should be good, thank you 👍

oOraph added 4 commits April 2, 2025 16:06
20.04 is reaching end of support this year anyway

remove some unused dependencies for the inference web service to run

Signed-off-by: Raphael Glon <[email protected]>
- + fix broken dep install (kenlm)

Signed-off-by: Raphael Glon <[email protected]>
use the same HF_MODEL_ID env var as the main server

Signed-off-by: Raphael Glon <[email protected]>
@alvarobartt alvarobartt merged commit 8c3562d into main Apr 14, 2025
6 checks passed
@alvarobartt alvarobartt deleted the dev/inf2_image branch April 14, 2025 07:20
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants