對話阿里雲田濤濤:企業如何用好雲、管好雲?

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"近幾年,數字化轉型帶來了更加複雜的 IT 基礎設施和大量的業務系統,對企業自身的運維能力來說,是一場前所未有的大考。DevOps 出現以後,極大程度地提升了企業的研發效率,縮短了業務從研發到上線的週期。在相近時間誕生的雲計算,其所擁有的“軟件定義一切”的特性,更是與 DevOps、智能運維和基礎設施即代碼(Iac) 等自動化運維趨勢相互促進。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"然而,將傳統的 DevOps 直接搬到雲上,是否真正地釋放了雲的優勢?企業到底應該如何“用好雲、管好雲”?"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"帶着這些問題,InfoQ 在 2021 雲上架構與運維峯會舉辦之際,採訪了阿里雲彈性計算管控平臺技術負責人田濤濤。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https:\/\/static001.geekbang.org\/infoq\/03\/03f356bed3a4547aee989f65d4b4aba2.webp","alt":"圖片","title":null,"style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":null,"fromPaste":true,"pastePass":true}},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"text","text":"雲時代,運維不重要了?"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"雲時代到來以後,運維的門檻被大幅降低。傳統運維需要處理服務器、網絡等硬件設備,而在雲時代,運維工程師不再需要直接操作實體資源,負載均衡、動態伸縮、數據遷移等服務全部可以交由雲平臺廠商來提供。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"因此,與“去運維”相關的言論甚囂塵上,不少人認爲運維崗位會逐漸走向消亡,但事實是否真的如此?"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"“雲時代的運維,變得比以前更加迫切、更加重要。”田濤濤認爲,運維不是消亡,而是需要進化,因爲雲原生趨勢的到來,給運維提出了更多挑戰。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"strong"}],"text":"第一,敏捷快速的交付方式給運維和交付帶來了巨大的挑戰。"},{"type":"text","text":"早前,研發團隊交付一款 App 是按照半年時間進行規劃的。如今,App 從研發、交付再到上線,整個過程僅需要7 天。這樣一來,高效地進行運維管理成爲了雲上運維必須思考的問題。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"strong"}],"text":"第二,排查問題的難度持續飆升。"},{"type":"text","text":"無論是傳統設備還是智能化設備,服務化都是大家關注的焦點,但做到服務化之後,系統之間的耦合會使調用關係變得複雜,一旦出現問題,它的影響面非常不可控。如何能快速做好可靠性、可用性觀測、問題排查以及問題診斷,同樣成爲了雲上運維的重大挑戰。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"strong"}],"text":"第三,在線系統數量變多,宕機影響變大。"},{"type":"text","text":"由於在線系統的數量越來越多,出現問題之後影響面是非常大的,甚至可能影響民生的工程。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"不僅如此,雲上運維的範疇也比以往更加廣泛,運維人員需要關注藍圖規劃、上雲交付以及雲上管理整個過程。我們能夠清晰地感知到,身處新技術革命浪潮下,企業想要搶佔市場,做好雲上運維是非常重要的一環。"}]},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"text","text":"多數企業未發揮出雲端 DevOps 潛力"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"幾乎所有企業都十分認可公有云帶來的產品和服務能力,並且大部分企業已經在公有云中使用了 DevOps,打通了開發與運維之間的壁壘,讓團隊從業務需求出發,向一個共同的目標前進。但將傳統的 DevOps 直接搬到雲上,又能否獲得 1+1 等於或者大於 2 的收益呢?"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"答案是否定的。雖然雲廠商屏蔽了底層的基礎設施,讓開發人員無需關注底層資源,使得很多企業認爲上雲其實是一件容易的事情。但實際上,雲本身是一個非常複雜的操作系統,很多企業在傳統線下沒有自動化的基礎設施工具。"},{"type":"text","marks":[{"type":"strong"}],"text":"因此在田濤濤看來,企業沒有轉變觀念、沒有把雲原生運維工具用好,是阻礙其充分發揮雲端 DevOps 優勢的一個重要原因。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"根據 Puppt2021 年度運維報告顯示,只有 20% 的企業認爲自己充分發揮了雲端 DevOps 的潛力。雲上自動化運維的模式和思維與傳統 DevOps 相比,仍然有着不小差異。這也是部分企業上雲之後,建立一套雲原生自動化運維體系的挑戰。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"strong"}],"text":"首先,傳統企業上雲之後需要意識到,操作的主體會從操作資產變成了對可編程的資源"},{"type":"text","text":",這個轉變是非常重要的過程:傳統運維模式操作的都是企業的資產,需要充分壓榨提升單機的利用率和使用率,並需要提前很久規劃資源;而云端運維天然就有彈性的屬性,除了提升單機利用率,還可以 On-demand 地獲取資源和釋放,同時雲平臺把一切都變成了可編程的資源,通過開放 OpenAPI 和應用分組來讓用戶管控資源。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"strong"}],"text":"其次,雲上運維對安全可審計的要求更高。"},{"type":"text","text":"雲端操作會高頻切換很多自動化的任務,操作來源和對象相對複雜,對操作審計和操作來源和報警的時效性要求比較高;雲端提供的服務可以將服務通過一條命令直接暴露在公網之中,需要更多的設計和思考安全和網絡規劃能力來降低系統風險;高頻的可編程自動化運維需要有比較好的審計和問題追蹤能力,避免越權和不容易被追蹤的問題。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"strong"}],"text":"此外,這幾年自助服務已經成爲很多企業的追求目標。"},{"type":"text","text":"在雲上,很多企業都把自己的產品,通過服務的形式暴露給更多的客戶,所以對於系統的可靠性有着更高的要求。"}]},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"text","text":"CloudOps 應運而生"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"“企業想要尋找到一名優秀的 DevOps 工程師,其成本是非常高的。”田濤濤說。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"爲此,阿里云爲企業帶來的破局思路是:幫助企業理解雲上運維,併爲處於不同階段的企業推薦不同的功能,進而簡化他們的學習門檻,提高使用雲原生運維工具的便捷度。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","marks":[{"type":"strong"}],"text":"在 2021 雲上架構與運維峯會中,阿里雲在業界首發了"},{"type":"link","attrs":{"href":"https:\/\/summit.aliyun.com\/cloudbuild2021?utm_content=m_1000311254","title":"xxx","type":null},"content":[{"type":"text","text":"雲上自動化運維(CloudOps)白皮書"}]},{"type":"text","marks":[{"type":"strong"}],"text":",定義並系統性闡釋了一個新的詞彙——CloudOps,着重強調如何在雲平臺上更好地踐行 DevOps。"},{"type":"text","text":"同時,田濤濤也在會上發表了《CloudOps :自動化運維的新思路》的主題演講。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https:\/\/static001.geekbang.org\/infoq\/97\/978802b399fdb6056af6cc7faf90751e.webp","alt":"圖片","title":null,"style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":null,"fromPaste":true,"pastePass":true}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"據他介紹,CloudOps 作爲傳統 IT 運維和 DevOps 的延展,可以通過雲原生架構實現運維的再進化,充分幫助企業降低 IT 運維成本、提升交付速度和系統靈活敏捷度、增強系統可靠性,構建更加安全可信開放的業務平臺。"},{"type":"text","marks":[{"type":"strong"}],"text":"在 CloudOps 白皮書中還強調了一點,CloudOps 不等於單純的 Cloud+DevOps 或者 DevOpsonCloud,而需要將 DevOps 和雲有機結合,才能收穫更大價值。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"此外,田濤濤在演講時提到:“雲上運維是一個從簡單到複雜、從成長到成熟的管理過程。”企業根據不同的上雲狀態以及使用規模,其雲上運維的思路都不盡相同,並且隨着業務不斷髮展,運維的思路也日益複雜。創業公司從第一天開始就可以在雲上部署其生產環境服務客戶,而對於已經存在 IT 投入的公司來說,則需要花費更長的時間逐步上雲。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"但可以肯定的是,無論企業身處哪種場景,其運維需求都會持續存在:降低成本、提高效率是企業追求的核心目標。因此,有效地規劃和制定運維策略和方法非常重要。"},{"type":"text","marks":[{"type":"strong"}],"text":"阿里雲在 CloudOps 白皮書中提出了成熟度模型——CARES,分爲自動化能力、彈性能力、高可用能力、安全和合規能力以及成本資源量化管理五個維度進行衡量,幫助企業判斷自己所處的階段,也爲處於不同階段的企業提供運維策略參考與優化方向。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https:\/\/static001.geekbang.org\/infoq\/26\/26e12b8b0dd3830df9b194c3bfb037dc.webp","alt":"圖片","title":null,"style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":null,"fromPaste":true,"pastePass":true}},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"text","text":"簡化路徑,讓雲上運維更簡單"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"對於企業來說,如何能夠高效地交付應用已成爲了業界的共識,這就要求企業需要通過自動化、自主化的策略高效工作。對於一名研發人員來說,他們最頭痛的問題就是在基礎設施和應用之間來回切換、適配。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"爲了讓企業在運維階段更省心,田濤濤還在峯會中同步了 ECS 自動化運維套件的全新升級,包括服務器遷移中心、資源編排、運維編排等 15 個工具,可以幫助企業實現從 IT 架構的規劃、遷移、部署、彈性擴縮容到日常管理,覆蓋雲基礎設施全生命週期的自動化運維。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"本次 ECS 自動化運維套件推出了新產品——應用管理 Application Manager,不同於從前的資源視角,應用管理支持從應用視角監控、管理和運維基礎資源,實現更精細化的管理,並與阿里雲 DevOps 平臺雲效集成,支持一鍵完成從代碼編譯構建到部署的全生命週期。"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"image","attrs":{"src":"https:\/\/static001.geekbang.org\/infoq\/8a\/8a7a8bd93554a7d369dbe5b90b363284.webp","alt":"圖片","title":null,"style":[{"key":"width","value":"75%"},{"key":"bordertype","value":"none"}],"href":null,"fromPaste":true,"pastePass":true}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"在接受 InfoQ 採訪時,田濤濤表示:“基於用戶在使用 ECS 過程中反饋的常見工單,我們建了一個集羣模型來幫助用戶快速定義、診斷錯誤的鏈路,這就是我們的智能診斷服務。之前系統出現問題時,企業需要花幾個小時拉人、拉羣去解決,但通過自助化服務的工具,可以做到秒級或者分鐘級就把問題解決掉。”"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"和智能問答、智能機器人一樣,ECS 的升級思路也是優先幫助用戶解決問題。正如田濤濤在演講結束時提到的那樣:"},{"type":"text","marks":[{"type":"strong"}],"text":"未來,傳統的運維需要進化到新的思路,企業應該更少地關注基礎設施和基礎資源,更多地迴歸到應用本身,讓企業運維視角與雲平臺的運維視角緊緊貼合。"}]},{"type":"heading","attrs":{"align":null,"level":2},"content":[{"type":"text","text":"寫在最後"}]},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"type":"text","text":"談及對於雲上運維的未來展望,田濤濤認爲,在巨石應用改造和企業服務化適配的過程中,只有依靠團隊的組織和更強大的自動化能力才能幫助業務提效,幫助客戶構建更加堅實的基礎設施,讓企業更專注於產品的研發。這不僅僅是阿里雲作爲雲平臺的責任與使命,同樣也是行業共同努力的方向。"}]}]}
發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章