Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CodeGen on OpenShift #6

Open
wants to merge 17 commits into
base: main
Choose a base branch
from
Open

CodeGen on OpenShift #6

wants to merge 17 commits into from

Conversation

polszewska
Copy link
Owner

That part allows to deploy CodeGen on Red Hat OpenShift. It contains two options: with Red Hat OpenShift AI and without. Both variants were prepared to run on Xeon or using Gaudi accelerators.

poussa and others added 5 commits October 24, 2024 12:38
* terraform: add AWS/EKS deployment for ChatQnA

Signed-off-by: Sakari Poussa <sakari.poussa@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: Sakari Poussa <sakari.poussa@intel.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
ChatQnA: accelerate also teirerank

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>
* Add monitoring option for all ChatQnA Helm components

Also sync current serviceMonitors with "kubernetes-addons/Observability"
manifests content.

* Add Helm monitoring option documentation

And refactor + fix HPA instructions (HPA setting is not global).


---------

Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com>
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>
* helm-chart: Make nginx service type configurable

Fixed issue opea-project#501.

* Adapt docsum and ui test-pod for latest changes

Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>
Ruoyu-y and others added 12 commits October 30, 2024 13:24
* ui: support variants for multiple examples
* use the single helm chart to support multiple variants of UI
* add probes for ui pod

Signed-off-by: Ruoyu Ying <ruoyu.ying@intel.com>
Fixes issue opea-project#503

Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>
Add visualqna and the dependent lvm-uservice helm charts.

Signed-off-by: Dolpher Du <dolpher.du@intel.com>
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>
Signed-off-by: Dolpher Du <dolpher.du@intel.com>
Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>
Limit max_tokens to 17 save test time and avoid sporadic timout
on busy CI nodes on cpu case in the following helm charts:

- llm-uservice
- faqgen

Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>
Add the helm chart for gpt-sovits which is used by AudioQnA
multilanguage example.

Signed-off-by: Lianhao Lu <lianhao.lu@intel.com>
Signed-off-by: Sakari Poussa <sakari.poussa@intel.com>
Co-authored-by: Malini Bhandaru <malini.bhandaru@intel.com>
Signed-off-by: Paulina Olszewska <paulina.olszewska@intel.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants