作者:DevOps旭
來自:DevOps探路者
一、什麼是k8s資源
在運維管理k8s時,管理員習慣將k8s中的一切稱為資源,比如pod、deployment、service等等,k8s通過對這些資源進行維護,調度,從而實現了整個集群的管理
二、認識pod
pod是kubernetes內的最小管理單元,可以對一組容器提供管理。在k8s的管理哲學中,並不會對單個的容器進行維護,而是針對一組pod進行部署和操作。當然了,一個pod內的容器數也是靈活的,可以是一個也可以是多個。
那麼這麼設計又有什麼優勢呢?首先需要強調一下容器的管理哲學,那就是一個容器內只運行一個進程(子進程除外),那麼,如果一個應用需要多個進程時,是選擇一個容器內多個進程呢還是選擇多個容器在同一節點上呢?
首先,一個容器多個進程是可以實現的,可以通過腳本來實現不同進程按照一定的依賴順序啟動,但是這樣就存在一個問題了,容器內的第一個進程是否存在是判斷容器存活的關鍵,那麼容器內如果有多個進程,保障所有進程都處於運行狀態將是一個極大的挑戰,實現這個將會導致容器越來越沉,將有悖於容器輕量級的本質,同時,日誌蒐集,數據持久化等等,也將帶來巨大的挑戰,所以這個並非是一個比較好的選擇。那麼,多個容器跑在一個節點上呢?這恰好就是pod的管理哲學了。將多個容器限制在同一個pod中,共享這個pod的PID、NETWORK、UTS、IPC、MOUNT namespace這樣的話,僅需要通過pause這一個容器對pod實現管理即可 。
說了那麼多了,那麼我們應該如何創建一個pod呢?kubernetes為我們提供十分便捷的方式kubectl——一個可以和apiserver交互的終端
<code>kubectl create pod --image=nginx/<code>
通過上面這條簡單的命令便可以創建一個pod,那麼除此之外還可以通過yaml文件來創建pod,下面就是一個最簡單的yaml文件
<code>apiVersion: v1 kind: Pod metadata: name: nginx-demo spec: containers: - image: nginx imagePullPolicy: Always name: nginx-demo/<code>
這個編排文件遵循kubernetes API組的v1版本,將資源類型描述為pod,命名為nginx-demo
我們可以通過如下命令創建pod
<code>[root@k8s01 yaml]# kubectl apply -f nginx-demo.yml [root@k8s01 yaml]# kubectl get po -o wide NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES nginx 1/1 Running 0 4m32s 10.244.1.9 k8s02 /<code>
可是容器的生命週期是短暫的,但是我們可以針對pod設置重啟策略restartPolicy實現pod中容器的重啟
<code>Always: 當容器失效時重啟容器 OnFailure:當容器終止運行且退出碼不為0時,由kubelet重啟pod Never:從不重啟/<code>
下面我們修改pod的yaml文件
<code>apiVersion: v1 kind: Pod metadata: name: nginx-demo spec: restartPolicy: Always containers: - image: nginx imagePullPolicy: Always name: nginx-demo/<code>
然後刪除pod進行重建。
<code>[root@k8s01 yaml]# kubectl delete pod nginx [root@k8s01 yaml]# kubectl apply -f nginx-demo.yml [root@k8s01 yaml]# kubectl get po -o wide NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES nginx 1/1 Running 0 52s 10.244.1.11 k8s02 /<code>
現在可以看到pod已經重建,並且運行在k8s02節點上,不過鑑於nginx的官方鏡像缺少很多命令,無法進入容器內進行kill 操作殺死nginx進程,只能選擇在pod所在節點殺死進程的方式來模擬容器故障
<code>[root@k8s02 ~]# ps -ef | grep nginx root 58895 58880 0 02:59 ? 00:00:00 nginx: master process nginx -g daemon off; 101 58947 58895 0 02:59 ? 00:00:00 nginx: worker process root 59071 49835 0 03:00 pts/0 00:00:00 grep --color=auto nginx [root@k8s02 ~]# kill 58895 然後在k8s01節點上我們可以看到 [root@k8s01 yaml]# kubectl get po -o wide NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES nginx 0/1 Completed 0 53s 10.244.1.11 k8s02 [root@k8s01 yaml]# kubectl get po -o wide NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES nginx 1/1 Running 1 55s 10.244.1.11 k8s02 此時nginx已經重啟成功,然後我們看一下pod的事件 [root@k8s01 yaml]# kubectl describe pod nginx Name: nginx Namespace: default Priority: 0 Node: k8s02/192.168.1.32 Start Time: Sun, 06 Sep 2020 02:59:31 +0800 Labels: Annotations: Status: Running IP: 10.244.1.11 IPs: IP: 10.244.1.11 Containers: nginx: Container ID: docker://cf21ee868641ba2da52321e16fe7e43a0aca61b7ebcb0c4a4d62ecb4a3f9787a Image: nginx Image ID: docker-pullable://nginx@sha256:b0ad43f7ee5edbc0effbc14645ae7055e21bc1973aee5150745632a24a752661 Port: Host Port: State: Running Started: Sun, 06 Sep 2020 03:00:24 +0800 Last State: Terminated Reason: Completed Exit Code: 0 Started: Sun, 06 Sep 2020 02:59:48 +0800 Finished: Sun, 06 Sep 2020 03:00:20 +0800 Ready: True Restart Count: 1 Environment: Mounts: /var/run/secrets/kubernetes.io/serviceaccount from default-token-hdhjf (ro) Conditions: Type Status Initialized True Ready True ContainersReady True PodScheduled True Volumes: default-token-hdhjf: Type: Secret (a volume populated by a Secret) SecretName: default-token-hdhjf Optional: false QoS Class: BestEffort Node-Selectors: Tolerations: node.kubernetes.io/not-ready:NoExecute op=Exists for 300s node.kubernetes.io/unreachable:NoExecute op=Exists for 300s Events: Type Reason Age From Message ---- ------ ---- ---- ------- Normal Scheduled 63s Successfully assigned default/nginx to k8s02 Normal Pulled 47s kubelet, k8s02 Successfully pulled image "nginx" in 16.098712681s Normal Pulling 14s (x2 over 63s) kubelet, k8s02 Pulling image "nginx" Normal Created 11s (x2 over 47s) kubelet, k8s02 Created container nginx Normal Started 11s (x2 over 47s) kubelet, k8s02 Started container nginx Normal Pulled 11s kubelet, k8s02 Successfully pulled image "nginx" in 3.162238195s 可以清晰地看到kubelet對nginx容器的重啟過程/<code>
雖然kubelet可以實現對pod中的容器進行重啟,但是,如果node節點發生了故障,這個策略又會如何呢?下面我們依次關閉k8s02節點的kubelet和kube-proxy 來模擬節點k8s02故障
<code>[root@k8s02 ~]# systemctl stop kubelet [root@k8s02 ~]# systemctl stop kube-proxy [root@k8s02 ~]# systemctl status kube-proxy ● kube-proxy.service - Kubernetes Proxy Loaded: loaded (/usr/lib/systemd/system/kube-proxy.service; enabled; vendor preset: disabled) Active: inactive (dead) since 日 2020-09-06 03:06:03 CST; 23s ago Process: 971 ExecStart=/opt/kubernetes/bin/kube-proxy $KUBE_PROXY_OPTS (code=killed, signal=TERM) Main PID: 971 (code=killed, signal=TERM) 9月 05 23:52:04 k8s02 systemd[1]: Ignoring invalid environment assignment '--proxy-mode=ipvs': /opt/kubernetes/cfg/kube-proxy.conf 9月 05 23:52:04 k8s02 systemd[1]: Started Kubernetes Proxy. 9月 05 23:52:16 k8s02 kube-proxy[971]: E0905 23:52:16.561493 971 node.go:125] Failed to retrieve node info: Get "https...timeout 9月 05 23:52:23 k8s02 kube-proxy[971]: E0905 23:52:23.654714 971 node.go:125] Failed to retrieve node info: nodes "k8s...r scope 9月 06 03:06:03 k8s02 systemd[1]: Stopping Kubernetes Proxy... 9月 06 03:06:03 k8s02 systemd[1]: Stopped Kubernetes Proxy. Hint: Some lines were ellipsized, use -l to show in full. [root@k8s02 ~]# systemctl status kubelet ● kubelet.service - Kubernetes Kubelet Loaded: loaded (/usr/lib/systemd/system/kubelet.service; enabled; vendor preset: disabled) Active: inactive (dead) since 日 2020-09-06 03:05:57 CST; 35s ago Process: 1183 ExecStart=/opt/kubernetes/bin/kubelet $KUBELET_OPTS (code=exited, status=0/SUCCESS) Main PID: 1183 (code=exited, status=0/SUCCESS) 9月 05 23:52:30 k8s02 kubelet[1183]: E0905 23:52:30.456897 1183 remote_runtime.go:113] RunPodSandbox from runtime service fail... 9月 05 23:52:30 k8s02 kubelet[1183]: E0905 23:52:30.456938 1183 kuberuntime_sandbox.go:69] CreatePodSandbox for pod "nginx-679... 9月 05 23:52:30 k8s02 kubelet[1183]: E0905 23:52:30.456951 1183 kuberuntime_manager.go:730] createPodSandbox for pod "nginx-67... 9月 05 23:52:30 k8s02 kubelet[1183]: E0905 23:52:30.457009 1183 pod_workers.go:191] Error syncing pod ee15155c-faab-424...685b)" 9月 06 02:44:26 k8s02 kubelet[1183]: E0906 02:44:26.124263 1183 remote_runtime.go:329] ContainerStatus "4413a8d21a2b72b...68fb93c 9月 06 02:44:26 k8s02 kubelet[1183]: E0906 02:44:26.124934 1183 remote_runtime.go:329] ContainerStatus "35eee7e6a06d70f...91c626b 9月 06 02:51:40 k8s02 kubelet[1183]: E0906 02:51:40.490991 1183 remote_runtime.go:329] ContainerStatus "6489db11518634b...332343e 9月 06 02:51:41 k8s02 kubelet[1183]: E0906 02:51:41.660419 1183 kubelet_pods.go:1250] Failed killing the pod "nginx": f...32343e" 9月 06 03:05:57 k8s02 systemd[1]: Stopping Kubernetes Kubelet... 9月 06 03:05:57 k8s02 systemd[1]: Stopped Kubernetes Kubelet. Hint: Some lines were ellipsized, use -l to show in full./<code>
然後我們在k8s01節點上觀察一下
<code>[root@k8s01 yaml]# kubectl get node NAME STATUS ROLES AGE VERSION k8s01 Ready 9d v1.19.0 k8s02 NotReady 9d v1.19.0 k8s03 Ready 9d v1.19.0/<code>
此時node節點k8s02已經是故障狀態,那麼pod呢?
<code>[root@k8s01 yaml]# kubectl get po -o wide NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES nginx 1/1 Running 1 8m18s 10.244.1.11 k8s02 [root@k8s01 yaml]# kubectl exec -it nginx sh kubectl exec [POD] [COMMAND] is DEPRECATED and will be removed in a future version. Use kubectl exec [POD] -- [COMMAND] instead. Error from server: error dialing backend: dial tcp 192.168.1.32:10250: connect: connection refused/<code>
那麼現在我們在殺死pod的進程呢?
<code>[root@k8s02 ~]# ps -ef | grep nginx root 59156 59141 0 03:00 ? 00:00:00 nginx: master process nginx -g daemon off; 101 59203 59156 0 03:00 ? 00:00:00 nginx: worker process root 61301 49835 0 03:10 pts/0 00:00:00 grep --color=auto nginx [root@k8s02 ~]# kill 59156 在k8s01上看呢 [root@k8s01 yaml]# kubectl get po -o wide NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES nginx 1/1 Terminating 1 12m 10.244.1.11 k8s02 /<code>
可見pod已經被刪除,查看pod的事件
<code>[root@k8s01 yaml]# kubectl describe pod nginx Name: nginx Namespace: default Priority: 0 Node: k8s02/192.168.1.32 Start Time: Sun, 06 Sep 2020 02:59:31 +0800 Labels: Annotations: Status: Terminating (lasts 48s) Termination Grace Period: 30s IP: 10.244.1.11 IPs: IP: 10.244.1.11 Containers: nginx: Container ID: docker://cf21ee868641ba2da52321e16fe7e43a0aca61b7ebcb0c4a4d62ecb4a3f9787a Image: nginx Image ID: docker-pullable://nginx@sha256:b0ad43f7ee5edbc0effbc14645ae7055e21bc1973aee5150745632a24a752661 Port: Host Port: State: Running Started: Sun, 06 Sep 2020 03:00:24 +0800 Last State: Terminated Reason: Completed Exit Code: 0 Started: Sun, 06 Sep 2020 02:59:48 +0800 Finished: Sun, 06 Sep 2020 03:00:20 +0800 Ready: True Restart Count: 1 Environment: Mounts: /var/run/secrets/kubernetes.io/serviceaccount from default-token-hdhjf (ro) Conditions: Type Status Initialized True Ready False ContainersReady True PodScheduled True Volumes: default-token-hdhjf: Type: Secret (a volume populated by a Secret) SecretName: default-token-hdhjf Optional: false QoS Class: BestEffort Node-Selectors: Tolerations: node.kubernetes.io/not-ready:NoExecute op=Exists for 300s node.kubernetes.io/unreachable:NoExecute op=Exists for 300s Events: Type Reason Age From Message ---- ------ ---- ---- ------- Normal Scheduled 13m Successfully assigned default/nginx to k8s02 Normal Pulled 13m kubelet, k8s02 Successfully pulled image "nginx" in 16.098712681s Normal Pulling 12m (x2 over 13m) kubelet, k8s02 Pulling image "nginx" Normal Created 12m (x2 over 13m) kubelet, k8s02 Created container nginx Normal Started 12m (x2 over 13m) kubelet, k8s02 Started container nginx Normal Pulled 12m kubelet, k8s02 Successfully pulled image "nginx" in 3.162238195s Warning NodeNotReady 6m23s node-controller Node is not ready/<code>
現在我們恢復pod
<code>[root@k8s02 ~]# systemctl start kubelet [root@k8s02 ~]# systemctl start kube-proxy [root@k8s01 yaml]# kubectl get po -o wide No resources found in default namespace. [root@k8s01 yaml]# /<code>
可以看到node異常導致pod無法完成自動恢復,可見pod自身的故障恢復能力還是有限的,同時node恢復後,pod也未恢復,那麼這個必將引發很多問題,那麼針對這個問題又該將如何處理呢?
三、認識deployment
3.1、deployment的故障自動轉移
為了應對pod的故障轉移,我們需要認識一下kubernetes的另一個關鍵的資源——deployment。deployment是一個極其強大的資源,是kubernetes提供的一個強大的控制器,這個控制器用來管理無狀態應用的。我們可以通過這個控制器實現對pod的調度,對pod的滾動式升級,對pod的擴容縮容等等。那麼我們應該如何創建一個deployment資源呢?下面我們先創建一個最簡單的deployment資源
<code>apiVersion: apps/v1 # api組 kind: Deployment # 資源類型為deployment metadata: labels: app: nginx name: nginx spec: replicas: 1 # 副本數為1 selector: matchLabels: app: nginx template: metadata: labels: app: nginx spec: containers: - image: nginx # 鏡像為nginx name: nginx/<code>
我們創建次資源
<code>[root@k8s01 yaml]# kubectl apply -f nginx-deployment.yaml [root@k8s01 yaml]# kubectl get po -o wide NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES nginx-6799fc88d8-9lsjl 1/1 Running 0 109s 10.244.0.15 k8s01 /<code>
可以看到pod被自動分配到了k8s01節點上,那麼我們在模擬一下節點上的kubelet異常,將會如何呢?
<code>[root@k8s01 yaml]# systemctl stop kubelet [root@k8s01 yaml]# systemctl stop kube-proxy [root@k8s01 yaml]# ps -ef | grep nginx root 70693 70678 0 03:36 ? 00:00:00 nginx: master process nginx -g daemon off; 101 70732 70693 0 03:36 ? 00:00:00 nginx: worker process root 71641 50125 0 03:39 pts/0 00:00:00 grep --color=auto nginx [root@k8s01 yaml]# kill 70693/<code>
此時我們觀察一下k8s01節點
<code>[root@k8s01 yaml]# kubectl get node NAME STATUS ROLES AGE VERSION k8s01 NotReady 9d v1.19.0 k8s02 Ready 9d v1.19.0 k8s03 Ready 9d v1.19.0 [root@k8s01 yaml]# kubectl describe pod nginx-6799fc88d8-9lsjl Name: nginx-6799fc88d8-9lsjl Namespace: default Priority: 0 Node: k8s01/192.168.1.31 Start Time: Sun, 06 Sep 2020 03:36:23 +0800 Labels: app=nginx pod-template-hash=6799fc88d8 Annotations: Status: Running IP: 10.244.0.15 IPs: IP: 10.244.0.15 Controlled By: ReplicaSet/nginx-6799fc88d8 Containers: nginx: Container ID: docker://f86cb1313c120b7797ac843a17f23a3551de7e868cbfe8fd24ade70de1ede843 Image: nginx Image ID: docker-pullable://nginx@sha256:b0ad43f7ee5edbc0effbc14645ae7055e21bc1973aee5150745632a24a752661 Port: Host Port: State: Running Started: Sun, 06 Sep 2020 03:36:26 +0800 Ready: True Restart Count: 0 Environment: Mounts: /var/run/secrets/kubernetes.io/serviceaccount from default-token-hdhjf (ro) Conditions: Type Status Initialized True Ready False ContainersReady True PodScheduled True Volumes: default-token-hdhjf: Type: Secret (a volume populated by a Secret) SecretName: default-token-hdhjf Optional: false QoS Class: BestEffort Node-Selectors: Tolerations: node.kubernetes.io/not-ready:NoExecute op=Exists for 300s node.kubernetes.io/unreachable:NoExecute op=Exists for 300s Events: Type Reason Age From Message ---- ------ ---- ---- ------- Normal Scheduled 4m42s Successfully assigned default/nginx-6799fc88d8-9lsjl to k8s01 Normal Pulling 4m42s kubelet, k8s01 Pulling image "nginx" Normal Pulled 4m40s kubelet, k8s01 Successfully pulled image "nginx" in 2.073509979s Normal Created 4m40s kubelet, k8s01 Created container nginx Normal Started 4m40s kubelet, k8s01 Started container nginx Warning NodeNotReady 68s node-controller Node is not ready [root@k8s01 yaml]# kubectl get po -o wide NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES nginx-6799fc88d8-9lsjl 1/1 Terminating 1 13m 10.244.0.15 k8s01 nginx-6799fc88d8-dvcj7 0/1 ContainerCreating 0 3s k8s02 /<code>
我們驚喜的發現,在5分鐘( pod-eviction-timeout控制 ,默認5m0s)後,在k8s01節點上的pod自動刪除,pod被調度到了k8s02節點上,並被啟動了起來,實現了pod的轉移。可是實際在生產中,肯定無法容忍這個現象,那麼我們還有什麼策略呢?
3.2、deployment的pod多副本
我們在回顧一下deployment的yaml文件,可以發現裡面有一行為副本數,那麼我們對此進行修改後又將會如何呢?
<code>[root@k8s01 yaml]# vim nginx-deployment.yaml apiVersion: apps/v1 kind: Deployment metadata: labels: app: nginx name: nginx spec: replicas: 3 selector: matchLabels: app: nginx template: metadata: labels: app: nginx spec: containers: - image: nginx name: nginx [root@k8s01 yaml]# kubectl apply -f nginx-deployment.yaml deployment.apps/nginx configured [root@k8s01 yaml]# kubectl get pod -o wide NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES nginx-6799fc88d8-dvcj7 1/1 Running 0 7m55s 10.244.1.13 k8s02 nginx-6799fc88d8-j9l4v 1/1 Running 0 22s 10.244.0.16 k8s01 nginx-6799fc88d8-v48rj 1/1 Running 0 22s 10.244.2.15 k8s03 /<code>
我們可以驚喜的看到,pod的副本數由1變成了3,那麼這個是怎麼實現的呢?
<code>[root@k8s01 yaml]# kubectl describe deployment nginx Name: nginx Namespace: default CreationTimestamp: Sun, 06 Sep 2020 03:36:23 +0800 Labels: app=nginx Annotations: deployment.kubernetes.io/revision: 1 Selector: app=nginx Replicas: 3 desired | 3 updated | 3 total | 3 available | 0 unavailable StrategyType: RollingUpdate MinReadySeconds: 0 RollingUpdateStrategy: 25% max unavailable, 25% max surge Pod Template: Labels: app=nginx Containers: nginx: Image: nginx Port: Host Port: Environment: Mounts: Volumes: Conditions: Type Status Reason ---- ------ ------ Progressing True NewReplicaSetAvailable Available True MinimumReplicasAvailable OldReplicaSets: NewReplicaSet: nginx-6799fc88d8 (3/3 replicas created) Events: Type Reason Age From Message ---- ------ ---- ---- ------- Normal ScalingReplicaSet 21m deployment-controller Scaled up replica set nginx-6799fc88d8 to 1 Normal ScalingReplicaSet 77s deployment-controller Scaled up replica set nginx-6799fc88d8 to 3/<code>
可以在deployment的事件中看到,deployment-controller 將nginx的replica 調整到3,這個replica是kubernetes的控制器,可以按照模板來實現pod的創建。
3.3、deployment的地毯式升級
作為核心資源的deployment的除此之外,還可以實現地毯式升級,而且可以控制升級的速率,主要是通過以下參數實現
<code>maxSurge : 決定了deployment配置中期望的副本數之外,最多允許超出的pod實例數量 maxUnavailable : 決定了滾動升級時,最多有多少pod處於不可用狀態/<code>
下面我們模擬一下升級,先創建一個升級使用的yaml文件
<code>apiVersion: apps/v1 kind: Deployment metadata: labels: app: nginx name: nginx spec: strategy: type: RollingUpdate rollingUpdate: maxSurge: 2 maxUnavailable: 0 selector: matchLabels: app: nginx replicas: 8 template: metadata: labels: app: nginx spec: containers: - image: nginx:1.12.1 name: nginx/<code>
下面開始升級服務
<code># 先將副本數調至8,以放大現象,使滾動升級更加明顯 [root@k8s01 yaml]# kubectl scale deployment nginx --replicas=8 [root@k8s01 yaml]# kubectl get pod -o wide NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES nginx-6799fc88d8-72kkv 1/1 Running 0 4m41s 10.244.1.14 k8s02 nginx-6799fc88d8-7tl5d 1/1 Running 0 4m41s 10.244.1.15 k8s02 nginx-6799fc88d8-dvcj7 1/1 Running 0 29m 10.244.1.13 k8s02 nginx-6799fc88d8-j9l4v 1/1 Running 0 22m 10.244.0.16 k8s01 nginx-6799fc88d8-jhwt6 1/1 Running 0 4m41s 10.244.0.17 k8s01 nginx-6799fc88d8-m4wxm 1/1 Running 0 4m41s 10.244.2.16 k8s03 nginx-6799fc88d8-mg6jl 1/1 Running 0 4m41s 10.244.0.18 k8s01 nginx-6799fc88d8-v48rj 1/1 Running 0 22m 10.244.2.15 k8s03 # 執行升級命令 [root@k8s01 yaml]# kubectl apply -f nginx-deployment-update.yaml deployment.apps/nginx configured # 開始滾動升級 [root@k8s01 yaml]# kubectl get pod -o wide NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES nginx-599c4c9ccc-4z7nn 0/1 ContainerCreating 0 15s k8s02 nginx-599c4c9ccc-kbr6v 0/1 ContainerCreating 0 15s k8s01 nginx-6799fc88d8-72kkv 1/1 Running 0 10m 10.244.1.14 k8s02 nginx-6799fc88d8-7tl5d 1/1 Running 0 10m 10.244.1.15 k8s02 nginx-6799fc88d8-dvcj7 1/1 Running 0 35m 10.244.1.13 k8s02 nginx-6799fc88d8-j9l4v 1/1 Running 0 28m 10.244.0.16 k8s01 nginx-6799fc88d8-jhwt6 1/1 Running 0 10m 10.244.0.17 k8s01 nginx-6799fc88d8-m4wxm 1/1 Running 0 10m 10.244.2.16 k8s03 nginx-6799fc88d8-mg6jl 1/1 Running 0 10m 10.244.0.18 k8s01 nginx-6799fc88d8-v48rj 1/1 Running 0 28m 10.244.2.15 k8s03 # 滾動升級完畢 [root@k8s01 yaml]# kubectl get pod -o wide NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES nginx-599c4c9ccc-2f4fc 1/1 Running 0 2m15s 10.244.2.17 k8s03 nginx-599c4c9ccc-4cckr 1/1 Running 0 46s 10.244.0.20 k8s01 nginx-599c4c9ccc-4vh5f 1/1 Running 0 32s 10.244.1.18 k8s02 nginx-599c4c9ccc-4z7nn 1/1 Running 0 4m4s 10.244.1.16 k8s02 nginx-599c4c9ccc-87hf7 1/1 Running 0 28s 10.244.0.21 k8s01 nginx-599c4c9ccc-kbr6v 1/1 Running 0 4m4s 10.244.0.19 k8s01 nginx-599c4c9ccc-mk6c2 1/1 Running 0 74s 10.244.1.17 k8s02 nginx-599c4c9ccc-q4wtg 1/1 Running 0 41s 10.244.2.18 k8s03 /<code>
這裡可以看到nginx的滾動升級已經結束,下面我們可以看一下deployment的事件
<code>[root@k8s01 yaml]# kubectl describe deployment nginx Name: nginx Namespace: default CreationTimestamp: Sun, 06 Sep 2020 03:36:23 +0800 Labels: app=nginx Annotations: deployment.kubernetes.io/revision: 2 Selector: app=nginx Replicas: 8 desired | 8 updated | 8 total | 8 available | 0 unavailable StrategyType: RollingUpdate MinReadySeconds: 0 RollingUpdateStrategy: 0 max unavailable, 2 max surge Pod Template: Labels: app=nginx Containers: nginx: Image: nginx:1.12.1 Port: Host Port: Environment: Mounts: Volumes: Conditions: Type Status Reason ---- ------ ------ Available True MinimumReplicasAvailable Progressing True NewReplicaSetAvailable OldReplicaSets: NewReplicaSet: nginx-599c4c9ccc (8/8 replicas created) Events: Type Reason Age From Message ---- ------ ---- ---- ------- Normal ScalingReplicaSet 53m deployment-controller Scaled up replica set nginx-6799fc88d8 to 1 Normal ScalingReplicaSet 32m deployment-controller Scaled up replica set nginx-6799fc88d8 to 3 Normal ScalingReplicaSet 15m deployment-controller Scaled up replica set nginx-6799fc88d8 to 8 Normal ScalingReplicaSet 5m deployment-controller Scaled up replica set nginx-599c4c9ccc to 2 Normal ScalingReplicaSet 3m11s deployment-controller Scaled down replica set nginx-6799fc88d8 to 7 Normal ScalingReplicaSet 3m11s deployment-controller Scaled up replica set nginx-599c4c9ccc to 3 Normal ScalingReplicaSet 2m10s deployment-controller Scaled up replica set nginx-599c4c9ccc to 4 Normal ScalingReplicaSet 2m10s deployment-controller Scaled down replica set nginx-6799fc88d8 to 6 Normal ScalingReplicaSet 102s deployment-controller Scaled down replica set nginx-6799fc88d8 to 5 Normal ScalingReplicaSet 102s deployment-controller Scaled up replica set nginx-599c4c9ccc to 5 Normal ScalingReplicaSet 97s deployment-controller Scaled down replica set nginx-6799fc88d8 to 4 Normal ScalingReplicaSet 97s deployment-controller Scaled up replica set nginx-599c4c9ccc to 6 Normal ScalingReplicaSet 65s (x6 over 88s) deployment-controller (combined from similar events): Scaled down replica set nginx-6799fc88d8 to 0/<code>
可以看到,kubernetes通過deployment-controller,將replica nginx-599c4c9ccc 調整為2,當pod創建成功後,將replica nginx-6799fc88d8調整為7(這個依賴於pod啟動的速度),按照此順序,直到nginx-599c4c9ccc 調整為8,nginx-6799fc88d8調整為0,滾動升級結束。
除此之外,deployment資源也可以實現資源的回退
<code>[root@k8s01 yaml]# kubectl rollout history deployment nginx deployment.apps/nginx REVISION CHANGE-CAUSE 1 2 [root@k8s01 yaml]# kubectl rollout undo deployment nginx deployment.apps/nginx rolled back [root@k8s01 yaml]# kubectl get pod -o wide NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES nginx-6799fc88d8-4wn62 1/1 Running 0 79s 10.244.1.19 k8s02 nginx-6799fc88d8-5rz78 1/1 Running 0 47s 10.244.0.24 k8s01 nginx-6799fc88d8-ckdfx 1/1 Running 0 60s 10.244.2.19 k8s03 nginx-6799fc88d8-f6dr7 1/1 Running 0 51s 10.244.1.21 k8s02 nginx-6799fc88d8-ghhp2 1/1 Running 0 55s 10.244.2.20 k8s03 nginx-6799fc88d8-msl22 1/1 Running 0 55s 10.244.0.23 k8s01 nginx-6799fc88d8-qmcxq 1/1 Running 0 79s 10.244.0.22 k8s01 nginx-6799fc88d8-wvmw9 1/1 Running 0 60s 10.244.1.20 k8s02 /<code>
deployment回退到了上一個版本,下面看一下deployment的事件
<code>[root@k8s01 yaml]# kubectl describe deployment nginx Name: nginx Namespace: default CreationTimestamp: Sun, 06 Sep 2020 03:36:23 +0800 Labels: app=nginx Annotations: deployment.kubernetes.io/revision: 3 Selector: app=nginx Replicas: 8 desired | 8 updated | 8 total | 8 available | 0 unavailable StrategyType: RollingUpdate MinReadySeconds: 0 RollingUpdateStrategy: 0 max unavailable, 2 max surge Pod Template: Labels: app=nginx Containers: nginx: Image: nginx Port: Host Port: Environment: Mounts: Volumes: Conditions: Type Status Reason ---- ------ ------ Available True MinimumReplicasAvailable Progressing True NewReplicaSetAvailable OldReplicaSets: NewReplicaSet: nginx-6799fc88d8 (8/8 replicas created) Events: Type Reason Age From Message ---- ------ ---- ---- ------- Normal ScalingReplicaSet 41m deployment-controller Scaled up replica set nginx-6799fc88d8 to 8 Normal ScalingReplicaSet 31m deployment-controller Scaled up replica set nginx-599c4c9ccc to 2 Normal ScalingReplicaSet 29m deployment-controller Scaled down replica set nginx-6799fc88d8 to 7 Normal ScalingReplicaSet 29m deployment-controller Scaled up replica set nginx-599c4c9ccc to 3 Normal ScalingReplicaSet 28m deployment-controller Scaled up replica set nginx-599c4c9ccc to 4 Normal ScalingReplicaSet 28m deployment-controller Scaled down replica set nginx-6799fc88d8 to 6 Normal ScalingReplicaSet 27m deployment-controller Scaled down replica set nginx-6799fc88d8 to 5 Normal ScalingReplicaSet 27m deployment-controller Scaled up replica set nginx-599c4c9ccc to 5 Normal ScalingReplicaSet 27m deployment-controller Scaled down replica set nginx-6799fc88d8 to 4 Normal ScalingReplicaSet 27m deployment-controller Scaled up replica set nginx-599c4c9ccc to 6 Normal ScalingReplicaSet 118s deployment-controller Scaled up replica set nginx-6799fc88d8 to 2 Normal ScalingReplicaSet 99s deployment-controller Scaled down replica set nginx-599c4c9ccc to 6 Normal ScalingReplicaSet 99s deployment-controller Scaled up replica set nginx-6799fc88d8 to 4 Normal ScalingReplicaSet 99s deployment-controller Scaled down replica set nginx-599c4c9ccc to 7 Normal ScalingReplicaSet 99s (x2 over 58m) deployment-controller Scaled up replica set nginx-6799fc88d8 to 3 Normal ScalingReplicaSet 94s deployment-controller Scaled down replica set nginx-599c4c9ccc to 5 Normal ScalingReplicaSet 94s deployment-controller Scaled up replica set nginx-6799fc88d8 to 5 Normal ScalingReplicaSet 94s deployment-controller Scaled down replica set nginx-599c4c9ccc to 4 Normal ScalingReplicaSet 94s deployment-controller Scaled up replica set nginx-6799fc88d8 to 6 Normal ScalingReplicaSet 73s (x12 over 27m) deployment-controller (combined from similar events): Scaled down replica set nginx-599c4c9ccc to 0/<code>
和滾動升級相同的策略,deployment回退到了之前的版本。
可以說deployment是kubernetes中的一個很重要的資源,後面會對此資源進行更加細緻的分析,去尋找此控制器的最佳實踐。
關鍵字: 10.244 pod 6799fc88d8