Hive BI analytics: Visual Reporting

原創

2020-02-23 12:43

Hive BI analytics: Visual Reporting

Filed under: Hadoop, Hive, HPC, Java world — Tags: Analytics Hive, BI, Hadoop, Hive, Hive JDBC, Intellicus, large data, MapReduce, Pentaho — indoos @ 5:23 pm

I had earlier written about using Hive as a data source for BI tools using industry proven BI reporting tools and here is a list of the various official announcements from Pentaho, Talend. Microstrategy and Intellicus -

Pentaho – http://www.pentaho.com/hadoop/, Business Intelligence Player Pentaho Embraces Hadoop
Intellicus – Intellicus to support Hadoop framework for Large Data
Talend – Hadoop Support Available from Industry’s Leading Open Source Data Integration Provider . However, Hadoop features will only be in Talend’s Integration Suite and will not, according to a company spokesperson, be available in Talend’s open source core product, Open Studio.
Microstrategy (with Cloudera) – MicroStrategy Announces Support for Apache Hadoop
Cloudera is also doing something with Quest team-http://www.theregister.co.uk/2010/06/21/cloudera_and_quest_team_on_oracle_connector/

The topic is close to my heart since I firmly believe that while Hadoop and Hive are true large data analytics tool, their power is currently limited to use by software programmers. The advent of BI tools in Hadoop/Hive world would certainly bring it closer to the real end users – business users.

I am currently not too sure how these BI reporting tools are deciding how much part of the analytics be left in Map reduce and how much in the reporting tool itself- guess it will take time to find the right balance. Chances are that I will find it a bit earlier than others as I am working closely (read here) with Intellicus team to get the changes in Hive JDBC driver for Intellicus’ interoperability with Hive.

Comments (1)

February 8, 2010

BI with MapReduce

Filed under: Advanced computing, Hadoop — Tags: Aster data, BI, coulmn based database, Hadoop, Hive, Pentaho — indoos @ 2:12 pm

Have any of you used map reduce in the context of business intelligence?

While collating my thoughts on this Linked-in Hadoop discussion, found out that I needed more visuals to explain it first to myself .

So, here are the many ways in which Hadoop MapReduce does offer an alternative in the big-big BI world-

Scenario 1: Use Hadoop and Hive as interface to BI tools. Pentaho reporting is already supported as of Hive 0.4.0.

Scenario 2: Use Hadoop for intial data polishing, and then dump to a SQL supported column based database near-real BI reporting. Aster data/Vertica /Greenplum sell themselves by advertising MapReduce connectors (or similar) heavily. The cost of SQL supported column based database is the only pain point here (+ the risk on how these actually scale vs what these promise)

Scenario 3: Use Hadoop for intial data polishing, and then dump to a SQL supported column based database near-real BI reporting. In case of Real time reporting, data can further be BI polished from column based databases to a fast regular RDBMS with BI support.

Scenario 4: The free way:)- Use Hadoop for intial data polishing, and then dump to a regular SQL database with BI support. The export from HDFS can be the Un-sqoop way. The onus would more be on the developer to dump only ready-for-report data (lesser) with most of the BI already completed as part of More MR step.

The important fact to note is that there might be additional costs on moving the major chunk of BI data analysis part to programmatic interfaces (SQL or MR).

I am not too much of a database-fallen-in-love type, so do like the way Hive can emerge as a potential BI reporting tool.

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

Hive BI analytics: Visual Reporting

Hive BI analytics: Visual Reporting

February 8, 2010

BI with MapReduce

2024年DataOps趨勢預測：AI不會取代數據工程師

雲原生週刊：K8s 中的服務和網絡｜ 2024.4.29

通過Http鏈接地址爬取有贊微信商城商品信息及下載至EXCEL

多人同時導出 Excel 幹崩服務器！新來的阿里大佬給出的解決方案太優雅了！

[轉帖]cpupower

今天，昨天，近七天，近30天，近90天，js封裝

華爲云云原生FinOps解決方案，釋放雲原生最大價值

轉載幾個個人感興趣的： 2012年Linux Journal讀者選擇獎結果公佈

pig tutorial編譯問題

每個程序員都應該知道的8個Linux命令(轉，不用就忘，抽空溫習一下）

Hadoop Job Scheduler作業調度器(轉載）

Hive BI analytics: Visual Reporting

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結