Commit Graph

234 Commits

Author SHA1 Message Date
shenlan
6bc7cb7ea2 Refactor k8s node role and split microk8s 2025-07-03 20:27:33 +08:00
shenlan
f50f7fe01e
Merge pull request #42 from svc-design/codex/修改-gpu-k8s.sh-脚本
Update gpu install script
2025-07-02 17:13:56 +08:00
shenlan
d7235bdf69 Add gpu-k8s script 2025-07-02 17:12:28 +08:00
shenlan
0590eab15d
Merge pull request #41 from svc-design/codex/refactor-playbook-roles-into-directories
Add roles for k8s GPU and monitoring charts
2025-07-01 11:47:25 +08:00
shenlan
ad0960d036 Add roles for GPU operator and monitoring charts 2025-07-01 11:47:11 +08:00
shenlan
9d6d787c93
Merge pull request #40 from svc-design/codex/修复sealos_version变量未定义问题
Fix sealos_version fact when no GitHub response
2025-06-26 18:37:34 +08:00
shenlan
8b64496aab Fix undefined variable when fetching sealos release 2025-06-26 18:35:39 +08:00
Haitao Pan
4de7b31aa6 Add GPU K8s reset playbook, rename init file, update inventory 2025-06-26 18:09:16 +08:00
shenlan
0fc9bd513e
Merge pull request #34 from svc-design/codex/rewrite-sealos-command-for-kubernetes-cluster
Add GPU k8s reset role and use script for sealos
2025-06-26 18:00:56 +08:00
shenlan
665a39fc62 gpu-k8s role use script and add reset 2025-06-26 17:34:24 +08:00
Haitao Pan
142d9bf93d - fix(common): typo in add-apt-repository (universev -> universe -y)
- fix(gpu-k8s): fallback registry changed from ghcr.io/labring to labring
2025-06-26 14:39:37 +08:00
shenlan
6f1887fc80
Merge pull request #32 from svc-design/codex/安装-fuse-overlayfs-并更新配置
Add fuse-overlayfs install note
2025-06-26 14:31:16 +08:00
shenlan
40f1bbb449
Merge pull request #33 from svc-design/codex/修正镜像前缀错误导致403
Fix registry path concatenation
2025-06-26 13:50:35 +08:00
shenlan
d5649c11d3 fix: trim registry prefix when running sealos 2025-06-26 13:50:21 +08:00
Haitao Pan
9e5ba63dd0 fix: enable universe repo, use sudo for sealos, fix role order 2025-06-26 13:22:07 +08:00
shenlan
6e21a301d8 docs: add note on enabling universe 2025-06-26 13:14:45 +08:00
Haitao Pan
042109c1ed fix: update node IPs and correct SSH key path for sealos 2025-06-26 13:06:03 +08:00
shenlan
da36ace4e5 Install fuse-overlayfs and document requirement 2025-06-26 11:13:26 +08:00
shenlan
15d0db0465
Merge pull request #26 from svc-design/codex/修复sealos命令中未知标志错误
Fix Sealos ssh flags
2025-06-26 10:33:59 +08:00
shenlan
c1330cb814 fix sealos ssh flags 2025-06-26 10:33:48 +08:00
shenlan
47baf6e018
Merge pull request #25 from svc-design/codex/支持非-root-用户和-root-用户部署 2025-06-26 09:16:58 +08:00
shenlan
64094097b5 Add non-root deployment support for gpu-k8s role 2025-06-26 09:16:11 +08:00
shenlan
8dd9346c14
Merge pull request #24 from svc-design/codex/fix-ssh-connection-issue-to-ops-1
Allow customizing SSH user for cluster setup
2025-06-26 08:53:13 +08:00
shenlan
eb0d5ad9d3 Make ssh user configurable for cluster setup 2025-06-26 08:52:37 +08:00
shenlan
8a3732b89a
Merge pull request #23 from svc-design/codex/分析并修复ssh无密码登录错误
Fix GPU k8s SSH trust
2025-06-26 00:07:14 +08:00
shenlan
e9c6e79f50 Fix GPU cluster playbook SSH setup 2025-06-26 00:07:02 +08:00
shenlan
29f65595c0
Merge pull request #22 from svc-design/codex/fix-passwordless-ssh-access-issue
Fix GPU k8s ssh precheck user
2025-06-25 23:50:18 +08:00
shenlan
3f107cd7d1 fix(gpu-k8s): use inventory ssh user for precheck 2025-06-25 23:50:03 +08:00
shenlan
8081c9ae09
Merge pull request #21 from svc-design/codex/配置本机-ssh-key
Add SSH precheck for gpu-k8s role
2025-06-25 23:39:29 +08:00
shenlan
75a08de7c0 Authorize ops host key on all cluster nodes 2025-06-25 23:39:12 +08:00
shenlan
7dfe106aba gpu-k8s: precheck SSH connectivity 2025-06-25 23:29:49 +08:00
shenlan
ebf11cbf57
Merge pull request #20 from svc-design/codex/fix-templating-error-in-ip-resolution
Fix IP resolution templating
2025-06-25 23:14:18 +08:00
shenlan
b10d45da45 Fix IP resolution templating 2025-06-25 23:14:00 +08:00
shenlan
8f444a4584
Merge pull request #19 from svc-design/codex/修复---masters---nodes-未获取ip
Fix GPU Kubernetes IP resolution
2025-06-25 23:07:52 +08:00
shenlan
e9212fded1 fix gpu-k8s role ip resolution 2025-06-25 23:07:35 +08:00
shenlan
70dbd217c9
Merge pull request #18 from svc-design/codex/fix-permission-issue-with-get_labring_registry.sh
Fix GPU role variable checks
2025-06-25 22:56:06 +08:00
shenlan
86ce571d8e support hostnames for gpu k8s role 2025-06-25 22:55:46 +08:00
shenlan
f4860966a1
Merge pull request #17 from svc-design/codex/fix--sudo--a-password-is-required--error
Fix LabRing registry prefix task sudo issue
2025-06-25 22:28:34 +08:00
shenlan
88d88d647b Move LabRing registry script into role 2025-06-25 22:27:46 +08:00
shenlan
8f8ab7a99a
Merge pull request #16 from svc-design/codex/修复labring注册表脚本未找到错误
Fix gpu-k8s role script path
2025-06-25 22:13:28 +08:00
shenlan
f6e8808ee9 fix gpu-k8s role script path 2025-06-25 22:13:14 +08:00
shenlan
42a492813a
Merge pull request #15 from svc-design/codex/根据节点ip选择镜像地址
Implement automatic LabRing registry selection
2025-06-25 22:05:50 +08:00
shenlan
f627d17de9
Merge branch 'main' into codex/根据节点ip选择镜像地址 2025-06-25 22:04:13 +08:00
shenlan
15aae07a57 feat: auto-select labring registry 2025-06-25 21:47:00 +08:00
shenlan
6040b78696
Merge pull request #14 from svc-design/codex/fix-invalid-ip-range-format-error
Fix GPU k8s role default version
2025-06-25 21:42:29 +08:00
shenlan
dc6ac56d4a gpu-k8s: separate kubernetes version 2025-06-25 21:42:15 +08:00
shenlan
1302e5231b
Merge pull request #13 from svc-design/codex/fix-sealos-installation-404-error
Update gpu-k8s role to pull latest Sealos
2025-06-25 21:33:40 +08:00
shenlan
a0436eb71c gpu role: fetch latest sealos and install tools 2025-06-25 21:33:22 +08:00
shenlan
c2a5999794
Merge pull request #12 from svc-design/codex/修正roles/vhosts/gpu-k8s/配置与sealos初始化
Fix GPU K8S role and add ssh trust setup
2025-06-25 21:17:42 +08:00
shenlan
600e05fcf8 Add SSH trust role and enhance gpu-k8s setup 2025-06-25 21:17:30 +08:00