CodeGen on OpenShift #6

polszewska · 2024-10-29T11:49:53Z

That part allows to deploy CodeGen on Red Hat OpenShift. It contains two options: with Red Hat OpenShift AI and without. Both variants were prepared to run on Xeon or using Gaudi accelerators.

* terraform: add AWS/EKS deployment for ChatQnA Signed-off-by: Sakari Poussa <sakari.poussa@intel.com> * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci --------- Signed-off-by: Sakari Poussa <sakari.poussa@intel.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

ChatQnA: accelerate also teirerank Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

* Add monitoring option for all ChatQnA Helm components Also sync current serviceMonitors with "kubernetes-addons/Observability" manifests content. * Add Helm monitoring option documentation And refactor + fix HPA instructions (HPA setting is not global). --------- Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>

* helm-chart: Make nginx service type configurable Fixed issue opea-project#501. * Adapt docsum and ui test-pod for latest changes Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>

* ui: support variants for multiple examples * use the single helm chart to support multiple variants of UI * add probes for ui pod Signed-off-by: Ruoyu Ying <ruoyu.ying@intel.com>

…ct#519) Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>

Fixes issue opea-project#503 Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>

Add visualqna and the dependent lvm-uservice helm charts. Signed-off-by: Dolpher Du <dolpher.du@intel.com>

Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>

Signed-off-by: Dolpher Du <dolpher.du@intel.com>

Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>

Limit max_tokens to 17 save test time and avoid sporadic timout on busy CI nodes on cpu case in the following helm charts: - llm-uservice - faqgen Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>

Add the helm chart for gpt-sovits which is used by AudioQnA multilanguage example. Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>

Signed-off-by: Sakari Poussa <sakari.poussa@intel.com> Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com>

Signed-off-by: Paulina Olszewska <paulina.olszewska@intel.com>

for more information, see https://pre-commit.ci

poussa and others added 5 commits October 24, 2024 12:38

ChatQnA: accelerate also teirerank with Gaudi (opea-project#475)

620963f

ChatQnA: accelerate also teirerank Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>

Upgrade tei-gaudi version to 1.5.0 (opea-project#494)

c6a9c90

Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>

helm-chart: Make nginx service type configurable (opea-project#506)

a5c96ab

* helm-chart: Make nginx service type configurable Fixed issue opea-project#501. * Adapt docsum and ui test-pod for latest changes Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>

polszewska force-pushed the openshift branch from 32ed46b to 5dec0dd Compare October 29, 2024 13:09

Ruoyu-y and others added 12 commits October 30, 2024 13:24

ui: support variants for multiple examples (opea-project#464)

96af2ad

* ui: support variants for multiple examples * use the single helm chart to support multiple variants of UI * add probes for ui pod Signed-off-by: Ruoyu Ying <ruoyu.ying@intel.com>

CI: turn off bash errexit during wait for pod in GMC test (opea-proje…

745dc95

…ct#519) Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>

Change default model of codegen and codetrans (opea-project#508)

74476b7

Fixes issue opea-project#503 Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>

Add helm chart for VisualQnA example (opea-project#505)

b077d44

Add visualqna and the dependent lvm-uservice helm charts. Signed-off-by: Dolpher Du <dolpher.du@intel.com>

helm: Add audioQnA e2e helm chart (opea-project#510)

9efacee

Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>

Add FaqGen helm chart (opea-project#513)

f847e05

Signed-off-by: Dolpher Du <dolpher.du@intel.com>

Update tgi cpu image version to 2.4.0-intel-cpu (opea-project#507)

f6c180e

Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>

Limit max_tokens to save test time (opea-project#525)

d6e5ad1

Limit max_tokens to 17 save test time and avoid sporadic timout on busy CI nodes on cpu case in the following helm charts: - llm-uservice - faqgen Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>

helm-charts: Add gpt-sovits support (opea-project#516)

1f55e1a

Add the helm chart for gpt-sovits which is used by AudioQnA multilanguage example. Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>

[csp] add terraform layer (opea-project#520)

c476fde

Signed-off-by: Sakari Poussa <sakari.poussa@intel.com> Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com>

Initial commit for CodeGen on OpenShift

1a976bc

Signed-off-by: Paulina Olszewska <paulina.olszewska@intel.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

145403c

for more information, see https://pre-commit.ci

lianhao force-pushed the openshift branch from 60037e7 to 145403c Compare November 6, 2024 05:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CodeGen on OpenShift #6

CodeGen on OpenShift #6

polszewska commented Oct 29, 2024

CodeGen on OpenShift #6

Are you sure you want to change the base?

CodeGen on OpenShift #6

Conversation

polszewska commented Oct 29, 2024