Tgi llamacpp by fgbelidji · Pull Request #147 · awslabs/llm-hosting-container

fgbelidji · 2025-04-17T15:46:36Z

Issue #, if available:

Description of changes:

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

malav-shastri · 2025-04-21T12:02:56Z

    "TGI": ["GPU", "INF2"],
    "TEI": ["GPU", "CPU"],
-    "TGILLAMACPP": ["CPU"],
+    "TGILLAMACPP": ["GPU", "CPU"],


interesting, I thought it's just a CPU image.

Yeah and ideally for Graviton instances there should be a different container to maximize the performance, see https://huggingface.co/docs/text-generation-inference/backends/llamacpp#build-docker-image

malav-shastri · 2025-04-21T12:04:40Z

+ENTRYPOINT ["./entrypoint.sh"]
+CMD ["--json-output"]
+
+LABEL dlc_major_version="2"


dlc_major_version = "2" ? shouldn't it be "1" considering that we are contributing it for the first time?

malav-shastri · 2025-04-21T12:06:05Z

+ENTRYPOINT ["./entrypoint.sh"]
+CMD ["--json-output"]
+
+LABEL dlc_major_version="2"


same comment as above

malav-shastri · 2025-04-21T12:08:14Z

@@ -0,0 +1,21 @@
+#!/bin/bash


correct me if I am wrong but this is because of the vulnerability right? I'll take an AI to check with hosting platform if they have got any long term remediation for this vulnerability or not.

my understanding is, this is an overhead every time we are gonna add a new image right?

Indeed that's for the vulnerability, and yes this is a an overhead for all the new cuda based images

malav-shastri · 2025-04-21T12:09:59Z

-        {
-            "framework": "TEI",
+            "framework": "TGILLAMACPP",
            "device": "gpu",


dont you want to add both CPU and GPU images?

Yeah, tested both independently locally, but will ad that to build the two

malav-shastri · 2025-04-21T12:14:35Z

just for sanity, what would be the final image tag here?

…iner into tgi-llamacpp

fgbelidji and others added 8 commits April 15, 2025 17:22

Added TGI llamacpp ressources

3bb20ab

Added new tests for llamacpp backend

a18a1a0

Fix Dockerfile

dded0ef

updated releases.json

f04339a

Changed permitted devices

5e8b18f

Added test for llamacpp backend

42d19fb

Fix cpu instance tests

238021f

Added cpu image for llamacpp

411a8a7

fgbelidji requested a review from a team as a code owner April 17, 2025 15:46

malav-shastri reviewed Apr 21, 2025

View reviewed changes

fgbelidji and others added 15 commits April 22, 2025 15:59

Added cpu version for tgillamacpp in releases.json

4dd4d55

dlc major version fix

d4d7990

Merge branch 'tgi-llamacpp' of github.com:fgbelidji/llm-hosting-conta…

c7e9246

…iner into tgi-llamacpp

fix releases.json

a82bbb4

various fixes gpu

718cb3f

various fixes cpu

11b09a7

merge fix

beec1b0

Merge branch 'main' into tgi-llamacpp

2929320

Changed to smaller model for llamacpp tests

c5561d1

Merge branch 'main' into tgi-llamacpp

afe8d84

changed models for tgillamacpp tests

94d5108

changed instance

7a8a453

changed gguf model

13040f2

Merge branch 'main' into tgi-llamacpp

505c499

change cuda file location

6e86ba6

fgbelidji and others added 9 commits June 6, 2025 12:16

Merge branch 'tgi-llamacpp' of github.com:fgbelidji/llm-hosting-conta…

434854a

…iner into tgi-llamacpp

change cuda version in releases.json

fbd94e9

change cuda version in releases.json - remove cuda compat script

b95b21c

removed container_startup_health_check_timeout

109550a

changed test payload

95b191f

changed instance for cpu

6996a75

changed instance for cpu

70eaac5

Reverted latest changes

6bae751

added missig model-gguf name

371ca77

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tgi llamacpp#147

Tgi llamacpp#147
fgbelidji wants to merge 32 commits intoawslabs:mainfrom
fgbelidji:tgi-llamacpp

fgbelidji commented Apr 17, 2025

Uh oh!

malav-shastri Apr 21, 2025

Uh oh!

fgbelidji Apr 22, 2025

Uh oh!

malav-shastri Apr 21, 2025

Uh oh!

malav-shastri Apr 21, 2025

Uh oh!

malav-shastri Apr 21, 2025

Uh oh!

malav-shastri Apr 21, 2025

Uh oh!

fgbelidji Apr 22, 2025

Uh oh!

malav-shastri Apr 21, 2025

Uh oh!

fgbelidji Apr 22, 2025

Uh oh!

malav-shastri commented Apr 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

fgbelidji commented Apr 17, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

malav-shastri commented Apr 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants