shenlan
|
bc71dc455f
|
Add gpu-k8s script
|
2025-07-02 17:12:28 +08:00 |
|
shenlan
|
b462001499
|
Merge pull request #41 from svc-design/codex/refactor-playbook-roles-into-directories
Add roles for k8s GPU and monitoring charts
|
2025-07-01 11:47:25 +08:00 |
|
shenlan
|
38abe9f368
|
Add roles for GPU operator and monitoring charts
|
2025-07-01 11:47:11 +08:00 |
|
shenlan
|
603cbf6a1a
|
Merge pull request #40 from svc-design/codex/修复sealos_version变量未定义问题
Fix sealos_version fact when no GitHub response
|
2025-06-26 18:37:34 +08:00 |
|
shenlan
|
37cc0b56c1
|
Fix undefined variable when fetching sealos release
|
2025-06-26 18:35:39 +08:00 |
|
Haitao Pan
|
ce86876892
|
Add GPU K8s reset playbook, rename init file, update inventory
|
2025-06-26 18:09:16 +08:00 |
|
shenlan
|
2788c80815
|
Merge pull request #34 from svc-design/codex/rewrite-sealos-command-for-kubernetes-cluster
Add GPU k8s reset role and use script for sealos
|
2025-06-26 18:00:56 +08:00 |
|
shenlan
|
4f68aeee25
|
gpu-k8s role use script and add reset
|
2025-06-26 17:34:24 +08:00 |
|
Haitao Pan
|
f8982f93b5
|
- fix(common): typo in add-apt-repository (universev -> universe -y)
- fix(gpu-k8s): fallback registry changed from ghcr.io/labring to labring
|
2025-06-26 14:39:37 +08:00 |
|
shenlan
|
43e95c4b98
|
Merge pull request #33 from svc-design/codex/修正镜像前缀错误导致403
Fix registry path concatenation
|
2025-06-26 13:50:35 +08:00 |
|
shenlan
|
a87a4c20db
|
fix: trim registry prefix when running sealos
|
2025-06-26 13:50:21 +08:00 |
|
Haitao Pan
|
159d378aaa
|
fix: enable universe repo, use sudo for sealos, fix role order
|
2025-06-26 13:22:07 +08:00 |
|
Haitao Pan
|
5c560c1921
|
fix: update node IPs and correct SSH key path for sealos
|
2025-06-26 13:06:03 +08:00 |
|
shenlan
|
f7e93fbd0c
|
Install fuse-overlayfs and document requirement
|
2025-06-26 11:13:26 +08:00 |
|
shenlan
|
13a56d5c07
|
Merge pull request #26 from svc-design/codex/修复sealos命令中未知标志错误
Fix Sealos ssh flags
|
2025-06-26 10:33:59 +08:00 |
|
shenlan
|
f9fd140ea1
|
fix sealos ssh flags
|
2025-06-26 10:33:48 +08:00 |
|
shenlan
|
e3280c879a
|
Merge pull request #25 from svc-design/codex/支持非-root-用户和-root-用户部署
|
2025-06-26 09:16:58 +08:00 |
|
shenlan
|
74464a22be
|
Add non-root deployment support for gpu-k8s role
|
2025-06-26 09:16:11 +08:00 |
|
shenlan
|
3a6d50db3a
|
Merge pull request #24 from svc-design/codex/fix-ssh-connection-issue-to-ops-1
Allow customizing SSH user for cluster setup
|
2025-06-26 08:53:13 +08:00 |
|
shenlan
|
deadeae892
|
Make ssh user configurable for cluster setup
|
2025-06-26 08:52:37 +08:00 |
|
shenlan
|
e8bd1eefa4
|
Merge pull request #23 from svc-design/codex/分析并修复ssh无密码登录错误
Fix GPU k8s SSH trust
|
2025-06-26 00:07:14 +08:00 |
|
shenlan
|
bf20dd62c3
|
Fix GPU cluster playbook SSH setup
|
2025-06-26 00:07:02 +08:00 |
|
shenlan
|
68830ab067
|
Merge pull request #22 from svc-design/codex/fix-passwordless-ssh-access-issue
Fix GPU k8s ssh precheck user
|
2025-06-25 23:50:18 +08:00 |
|
shenlan
|
556058036f
|
fix(gpu-k8s): use inventory ssh user for precheck
|
2025-06-25 23:50:03 +08:00 |
|
shenlan
|
ceb07b5a4c
|
Merge pull request #21 from svc-design/codex/配置本机-ssh-key
Add SSH precheck for gpu-k8s role
|
2025-06-25 23:39:29 +08:00 |
|
shenlan
|
4131181bc6
|
Authorize ops host key on all cluster nodes
|
2025-06-25 23:39:12 +08:00 |
|
shenlan
|
d1de70b020
|
gpu-k8s: precheck SSH connectivity
|
2025-06-25 23:29:49 +08:00 |
|
shenlan
|
a1c023f216
|
Merge pull request #20 from svc-design/codex/fix-templating-error-in-ip-resolution
Fix IP resolution templating
|
2025-06-25 23:14:18 +08:00 |
|
shenlan
|
310e4aef12
|
Fix IP resolution templating
|
2025-06-25 23:14:00 +08:00 |
|
shenlan
|
214018e607
|
Merge pull request #19 from svc-design/codex/修复---masters---nodes-未获取ip
Fix GPU Kubernetes IP resolution
|
2025-06-25 23:07:52 +08:00 |
|
shenlan
|
8bdf5fb17e
|
fix gpu-k8s role ip resolution
|
2025-06-25 23:07:35 +08:00 |
|
shenlan
|
050d327fc9
|
Merge pull request #18 from svc-design/codex/fix-permission-issue-with-get_labring_registry.sh
Fix GPU role variable checks
|
2025-06-25 22:56:06 +08:00 |
|
shenlan
|
6fd798d4f4
|
support hostnames for gpu k8s role
|
2025-06-25 22:55:46 +08:00 |
|
shenlan
|
17d43001e0
|
Merge pull request #17 from svc-design/codex/fix--sudo--a-password-is-required--error
Fix LabRing registry prefix task sudo issue
|
2025-06-25 22:28:34 +08:00 |
|
shenlan
|
7d76bc170e
|
Move LabRing registry script into role
|
2025-06-25 22:27:46 +08:00 |
|
shenlan
|
4989b26dd6
|
Merge pull request #16 from svc-design/codex/修复labring注册表脚本未找到错误
Fix gpu-k8s role script path
|
2025-06-25 22:13:28 +08:00 |
|
shenlan
|
00ab7a116c
|
fix gpu-k8s role script path
|
2025-06-25 22:13:14 +08:00 |
|
shenlan
|
e9f1337e4d
|
Merge pull request #15 from svc-design/codex/根据节点ip选择镜像地址
Implement automatic LabRing registry selection
|
2025-06-25 22:05:50 +08:00 |
|
shenlan
|
5019dc008c
|
Merge branch 'main' into codex/根据节点ip选择镜像地址
|
2025-06-25 22:04:13 +08:00 |
|
shenlan
|
206f649406
|
feat: auto-select labring registry
|
2025-06-25 21:47:00 +08:00 |
|
shenlan
|
892ed22100
|
Merge pull request #14 from svc-design/codex/fix-invalid-ip-range-format-error
Fix GPU k8s role default version
|
2025-06-25 21:42:29 +08:00 |
|
shenlan
|
35fca24f2e
|
gpu-k8s: separate kubernetes version
|
2025-06-25 21:42:15 +08:00 |
|
shenlan
|
20b6647639
|
Merge pull request #13 from svc-design/codex/fix-sealos-installation-404-error
Update gpu-k8s role to pull latest Sealos
|
2025-06-25 21:33:40 +08:00 |
|
shenlan
|
3933de1764
|
gpu role: fetch latest sealos and install tools
|
2025-06-25 21:33:22 +08:00 |
|
shenlan
|
9d19914dec
|
Merge pull request #12 from svc-design/codex/修正roles/vhosts/gpu-k8s/配置与sealos初始化
Fix GPU K8S role and add ssh trust setup
|
2025-06-25 21:17:42 +08:00 |
|
shenlan
|
b6bbd93cea
|
Add SSH trust role and enhance gpu-k8s setup
|
2025-06-25 21:17:30 +08:00 |
|
shenlan
|
524aaf6d0a
|
Merge pull request #11 from svc-design/codex/更新-readme.md-并创建子文档
Update README with docs reference
|
2025-06-25 20:44:00 +08:00 |
|
shenlan
|
79576fb9b6
|
Merge pull request #10 from svc-design/codex/修复roles/vhosts/gpu-k8s/问题
Fix NVIDIA repo URLs for gpu role
|
2025-06-25 20:41:29 +08:00 |
|
shenlan
|
1664a8cddd
|
docs: add repo structure overview
|
2025-06-25 20:40:54 +08:00 |
|
shenlan
|
b8f6cc7648
|
Fix NVIDIA repository URLs
|
2025-06-25 20:40:38 +08:00 |
|