Storm(一)什麼是Storm

這裏寫圖片描述

Apache Storm is a free and open source distributed realtime computation system. Storm makes it easy to reliably process unbounded streams of data, doing for realtime processing what Hadoop did for batch processing. Storm is simple, can be used with any programming language, and is a lot of fun to use!

Storm是免費開源的分佈式實時計算框架。Hadoop用於對數據的批處理,而Storm可以可靠地處理海量的流式數據。Storm是簡單易用的,可以與多種編程語言結合使用,學習和使用storm的過程是快樂的!

Storm has many use cases: realtime analytics, online machine learning, continuous computation, distributed RPC, ETL, and more. Storm is fast: a benchmark clocked it at over a million tuples processed per second per node. It is scalable, fault-tolerant, guarantees your data will be processed, and is easy to set up and operate.

Storm有很多使用場景:實時分析,線上機器學習,持續計算,分佈式RPC,ETL等等。Storm單個節點每秒可以處理百萬級別的tuple。Storm是可靠的、易用的、可擴展的、高容錯的。

Storm integrates with the queueing and database technologies you already use. A Storm topology consumes streams of data and processes those streams in arbitrarily complex ways, repartitioning the streams between each stage of the computation however needed. Read more in the tutorial.

Storm集成了隊列和數據庫技術。一個Storm topology可以以任意複雜的方式來處理數據流,並且可以在計算的過程中可以對數據流重新分區。


參考鏈接:http://storm.apache.org/

發表評論
所有評論
還沒有人評論,想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.
相關文章