HBase 官方文檔中文版

Question

When should I use HBase?

Answer 1

參考 the Section 9.1, “概述” in the Architecture chapter.

Answer 2

參考 the FAQ that is up on the wiki, HBase Wiki FAQ.

Answer 3

Not really. SQL-ish support for HBase via Hive is in development, however Hive is based on MapReduce which is not generally suitable for low-latency requests. 參考 the Chapter 5, Data Model section for examples on the HBase client.

Answer 4

參考 the link to the BigTable paper in Appendix F, Other Information About HBase in the appendix, as well as the other papers.

Answer 5

參考 Appendix G, HBase History.

Answer 6

參考 Section 9.7, “Regions”.

Answer 7

參考 Section 1.2, “Quick Start”.

Answer 8

參考 Chapter 2, Configuration.

Answer 9

參考 Chapter 5, Data Model and Chapter 6, HBase and Schema Design

Answer 10

參考 Section 6.5, “ Supported Datatypes ”.

Answer 11

參考 Section 6.9, “ Secondary Indexes and Alternate Query Paths ”

Answer 12

This is a very common quesiton. You can't. 參考 Section 6.3.5, “Immutability of Rowkeys”.

Answer 13

參考 Chapter 5, Data Model, Section 9.3, “Client” and Section 10.1, “非Java 語言和 JVM 通話”.

Answer 14

參考 Chapter 7, HBase and MapReduce

Answer 15

參考 Chapter 11, Performance Tuning.

Answer 16

參考 Chapter 12, Troubleshooting and Debugging HBase.

Answer 17

EC2 issues are a special case. 參考 Troubleshooting Section 12.12, “Amazon EC2” and Performance Section 11.11, “Amazon EC2” sections.

Answer 18

參考 Chapter 14, HBase Operational Management

Answer 19

參考 Section 14.7, “HBase Backup”

Answer 20

參考 Appendix F, Other Information About HBase

Revision History
Revision 0.95-SNAPSHOT	2012-12-03T13:38
中文版翻譯整理周海漢

Row Key	Time Stamp	ColumnFamily `contents`	ColumnFamily `anchor`
"com.cnn.www"	t9		`anchor:cnnsi.com` = "CNN"
"com.cnn.www"	t8		`anchor:my.look.ca` = "CNN.com"
"com.cnn.www"	t6	`contents:html` = "<html>..."
"com.cnn.www"	t5	`contents:html` = "<html>..."
"com.cnn.www"	t3	`contents:html` = "<html>..."

Row Key	Time Stamp	Column Family `anchor`
"com.cnn.www"	t9	`anchor:cnnsi.com` = "CNN"
"com.cnn.www"	t8	`anchor:my.look.ca` = "CNN.com"

Row Key	Time Stamp	ColumnFamily "contents:"
"com.cnn.www"	t6	`contents:html` = "<html>..."
"com.cnn.www"	t5	`contents:html` = "<html>..."
"com.cnn.www"	t3	`contents:html` = "<html>..."

A.1. General
When should I use HBase? Are there other HBase FAQs? Does HBase support SQL? How can I find examples of NoSQL/HBase? What is the history of HBase?
	When should I use HBase?
	參考 the Section 9.1, “概述” in the Architecture chapter.
	Are there other HBase FAQs?
	參考 the FAQ that is up on the wiki, HBase Wiki FAQ.
	Does HBase support SQL?
	Not really. SQL-ish support for HBase via Hive is in development, however Hive is based on MapReduce which is not generally suitable for low-latency requests. 參考 the Chapter 5, Data Model section for examples on the HBase client.
	How can I find examples of NoSQL/HBase?
	參考 the link to the BigTable paper in Appendix F, Other Information About HBase in the appendix, as well as the other papers.
	What is the history of HBase?
	參考 Appendix G, HBase History.
A.2. Architecture
How does HBase handle Region-RegionServer assignment and locality?
	How does HBase handle Region-RegionServer assignment and locality?
	參考 Section 9.7, “Regions”.
A.3. Configuration
How can I get started with my first cluster? Where can I learn about the rest of the configuration options?
	How can I get started with my first cluster?
	參考 Section 1.2, “Quick Start”.
	Where can I learn about the rest of the configuration options?
	參考 Chapter 2, Configuration.
A.4. Schema Design / Data Access
How should I design my schema in HBase? How can I store (fill in the blank) in HBase? How can I handle secondary indexes in HBase? Can I change a table's rowkeys? What APIs does HBase support?
	How should I design my schema in HBase?
	參考 Chapter 5, Data Model and Chapter 6, HBase and Schema Design
	How can I store (fill in the blank) in HBase?
	參考 Section 6.5, “ Supported Datatypes ”.
	How can I handle secondary indexes in HBase?
	參考 Section 6.9, “ Secondary Indexes and Alternate Query Paths ”
	Can I change a table's rowkeys?
	This is a very common quesiton. You can't. 參考 Section 6.3.5, “Immutability of Rowkeys”.
	What APIs does HBase support?
	參考 Chapter 5, Data Model, Section 9.3, “Client” and Section 10.1, “非Java 語言和 JVM 通話”.
A.5. MapReduce
How can I use MapReduce with HBase?
	How can I use MapReduce with HBase?
	參考 Chapter 7, HBase and MapReduce
A.6. Performance and Troubleshooting
How can I improve HBase cluster performance? How can I troubleshoot my HBase cluster?
	How can I improve HBase cluster performance?
	參考 Chapter 11, Performance Tuning.
	How can I troubleshoot my HBase cluster?
	參考 Chapter 12, Troubleshooting and Debugging HBase.
A.7. Amazon EC2
I am running HBase on Amazon EC2 and...
	I am running HBase on Amazon EC2 and...
	EC2 issues are a special case. 參考 Troubleshooting Section 12.12, “Amazon EC2” and Performance Section 11.11, “Amazon EC2” sections.
A.8. Operations
How do I manage my HBase cluster? How do I back up my HBase cluster?
	How do I manage my HBase cluster?
	參考 Chapter 14, HBase Operational Management
	How do I back up my HBase cluster?
	參考 Section 14.7, “HBase Backup”
A.9. HBase in Action
Where can I find interesting videos and presentations on HBase?
	Where can I find interesting videos and presentations on HBase?
	參考 Appendix F, Other Information About HBase

hfile.LASTKEY	The last key of the file (byte array)
hfile.AVG_KEY_LEN	The average key length in the file (int)
hfile.AVG_VALUE_LEN	The average value length in the file (int)

Version 1	Version 2
File info offset (long)
Data index offset (long)	loadOnOpenOffset (long) The offset of the section that we need toload when opening the file.
Number of data index entries (int)
metaIndexOffset (long) This field is not being used by the version 1 reader, so we removed it from version 2.	uncompressedDataIndexSize (long) The total uncompressed size of the whole data block index, including root-level, intermediate-level, and leaf-level blocks.
Number of meta index entries (int)
Total uncompressed bytes (long)
numEntries (int)	numEntries (long)
Compression codec: 0 = LZO, 1 = GZ, 2 = NONE (int)
	The number of levels in the data block index (int)
	firstDataBlockOffset (long) The offset of the first first data block. Used when scanning.
	lastDataBlockEnd (long) The offset of the first byte after the last key/value data block. We don't need to go beyond this offset when scanning.
Version: 1 (int)	Version: 2 (int)

HBase 官方文檔中文版

Apache HBase™ 參考指南

HBase 官方文檔中文版

序

最前面的話

Chapter 1. 入門

1.1. 介紹

1.2. 快速開始

1.2.1. 下載解壓最新版本

1.2.2. 啓動 HBase

是否安裝了 java ?

1.2.3. Shell 練習

1.2.4. 停止 HBase

1.2.5. 下一步該做什麼

2. 配置

2.1. 基礎條件

2.1.1 java

2.1. 操作系統

2.1.2.1. ssh

2.1.2.2. DNS

2.1.2.3. Loopback IP

2.1.2.4. NTP

2.1.2.5. ulimit 和 nproc

2.1.2.5.1. 在Ubuntu上設置ulimit

2.1.2.6. Windows

2.1.3. hadoop

請完整閱讀本節：

Packaging and Apache BigTop

2.1.3.1. Hadoop 安全性

2.1.3.2. dfs.datanode.max.xcievers

2.2. HBase運行模式:單機和分佈式

2.21. 單機模式

2.2.2. 分佈式模式

2.2.2.1. 僞分佈式模式

Note

Note

2.2.2.1.1. 僞分佈模式配置文件

2.2.2.1.2. 僞分佈模式附加

2.2.2.1.2.1. 啓動

2.2.2.1.2.2. 停止

2.2.2.2. 完全分佈式模式

2.2.2.2.1. regionservers

2.2.2.2.2. ZooKeeper 和 HBase

2.2.2.2.3. HDFS客戶端配置

2.2.3. 運行和確認你的安裝

2.3. 配置文件

2.3.1. hbase-site.xml 和 hbase-default.xml

2.3.1.1. HBase 默認配置

HBase 默認配置

2.3.2. hbase-env.sh

2.3.3. log4j.properties

2.3.4. 連接HBase集羣的客戶端配置和依賴

2.3.4.1. Java客戶端配置

Java是如何讀到hbase-site.xml 的內容的

2.4. 配置示例

2.4.1. 簡單的分佈式HBase安裝

2.4.1.1. hbase-site.xml

2.4.1.2. regionservers

2.4.1.3. hbase-env.sh

2.5. 重要的配置

2.5.1. 必須的配置

2.5.2. 推薦配置

2.5.2.1. zookeeper.session.timeout

2.5.2.2. ZooKeeper 實例個數

2.5.2.3. hbase.regionserver.handler.count

2.5.2.4. 大內存機器的配置

2.5.2.5. 壓縮

2.5.2.6. 較大 Regions

2.5.2.7. 管理 Splitting

2.5.2.8. 管理 Compactions

2.5.2.9. 預測執行 (Speculative Execution)

2.5.3. 其他配置

2.5.3.1. 負載均衡

2.5.3.2. 禁止塊緩存(Blockcache)

2.5.3.3. Nagle's or the small package problem

Chapter 3. 升級

3.1. 從 0.94.x 升級到 0.96.x

The Singularity

3.2. 從 0.92.x 升級到 0.94.x

3.3. 從 0.90.x 到 0.92.x 升級

2.1.2.5. `ulimit` 和 `nproc`

2.1.2.5.1. 在Ubuntu上設置`ulimit`

2.1.3.2. `dfs.datanode.max.xcievers`

2.2.2.2.1. `regionservers`

2.3.1. `hbase-site.xml` 和 `hbase-default.xml`

2.3.2. `hbase-env.sh`

2.3.3. `log4j.properties`

Java是如何讀到`hbase-site.xml` 的內容的

2.4.1.1. `hbase-site.xml`

2.4.1.2. `regionservers`

2.4.1.3. `hbase-env.sh`

4.2.1. `irbrc`