Using a single hive warehouse for all EMR(Hadoop) clusters

原創

2020-02-22 19:58

s the EMR/Hadoop cluster’s are transient, tracking all those databases and tables across clusters may be difficult. So, Instead of having different warehouse directories across clusters, You can use a single permanent hive warehouse across all EMR clusters. S3 would be a great choice as it is persistent storage and had robust architecture providing redundancy and read-after-write consistency.

For each cluster:

This can be configured using hive.metastore.warehouse.dir property on hive-site.xml.

1

2

3

4

5

<property>

<name>hive.metastore.warehouse.dir</name>

<value>s3n://bucket/hive_warehouse</value>

<description>location of default database for the warehouse</description>

</property>

You may need to update this setting on all nodes.

On a single hive session:

this can be configured using a command like set hive.metastore.warehouse.dir ="s3n://bucket/hive_warehouse"

or initialize hive cli with the following invocation -hiveconf hive.metastore.warehouse.dir=s3n://bucket/hive_warehouse

Note that using above configuration, all default databases and tables will be stored on s3 on path like s3://bucket/hive_warehouse/myHiveDatabase.db/

發佈了127 篇原創文章 · 獲贊 76 · 訪問量 45萬+

他的留言板關注

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

相關文章

從 Amazon Graviton3 發佈，看 2022 雲計算的核心方向

{"type":"doc","content":[{"type":"heading","attrs":{"align":null,"level":1}},{"type":"paragraph","attrs":{"indent":0,"nu

2021-12-13 17:33:52

什麼纔是實現元宇宙的關鍵路徑？

{"type":"doc","content":[{"type":"heading","attrs":{"align":null,"level":1}},{"type":"paragraph","attrs":{"indent":0,"nu

2021-12-13 17:08:51

洞察數據庫變革趨勢，亞馬遜雲科技正在憑藉這項技術改變着遊戲規則

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-12-10 16:53:54

2021 re:Invent ，我們到底該關注哪些發佈？

{"type":"doc","content":[{"type":"heading","attrs":{"align":null,"level":1}},{"type":"paragraph","attrs":{"indent":0,"nu

2021-12-09 15:23:56

微軟在Edge不斷作死：疑似阻止用戶下載谷歌；Linux 之父怒噴桌面版 Linux；滴滴出行美股退市靴子落地...傳阿里員工福利再升級，或全面試行靈活辦公...

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-12-06 10:03:56

亞馬遜re:Invent10週年官宣重大發布，推出自研芯片Graviton3、5G、IOT、數字孿生等多項服務

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null}},{"type":"paragr

2021-12-05 18:08:53

激盪十年，從未來窗口 re:Invent 看雲計算髮展變遷

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-11-25 15:18:50

亞馬遜可持續軟件工程實踐

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-10-19 10:13:57

亞馬遜雲科技推出基於 Arm 架構的 Lambda 函數

{"type":"doc","content":[{"type":"blockquote","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null

2021-09-30 14:43:58

全球芯片短缺，臺積電全線漲價；微軟挖角亞馬遜高管；IEEE 公佈 2021 年度編程語言排行榜

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-08-30 09:53:57

DevOps和雲｜InfoQ趨勢報告（2021年7月）

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-08-19 17:38:55

亞馬遜取消遊戲開發“嚴苛”條款，原因竟是？

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-08-18 13:44:01

亞馬遜創始人貝索斯將於7月5日卸任，由雲計算負責人接任

{"type":"doc","content":[{"type":"blockquote","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null

2021-06-13 07:04:05

阿里雲有機會趕超亞馬遜和微軟嗎？

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-04-21 14:44:00

巨鯨內部：亞馬遜工程師眼中的亞馬遜

{"type":"doc","content":[{"type":"paragraph","attrs":{"indent":0,"number":0,"align":null,"origin":null},"content":[{"typ

2021-04-12 14:03:53

24小時熱門文章

Nginx R31 doc 官方文檔-01-nginx 如何安裝

最新文章

最新評論文章