From Fedora Project Wiki
(Created page with "= Apache Ambari = == Summary == [http://ambari.apache.org/ Apache Ambari] is a cluster management tool for Apache Hadoop. == Owner == * Name: [[User:pmackinn| Peter MacKinno...")
 
m (Detailed Description)
Line 15: Line 15:
  
 
== Detailed Description ==
 
== Detailed Description ==
Apache Pig is a platform for analysing large data sets that consists of a high-level language for expressing data analysis programs, coupled with infrastructure for evaluating these programs. The salient property of Pig programs is that their structure is amenable to substantial parallelization,  
+
The Apache Ambari project is aimed at making Hadoop management simpler by developing software for provisioning, managing, and monitoring Apache Hadoop clusters. Ambari provides an intuitive, easy-to-use Hadoop management web UI backed by its RESTful APIs.
which in turns enables them to handle very large data sets.
 
  
 
== Benefit to Fedora ==
 
== Benefit to Fedora ==

Revision as of 15:20, 7 April 2014

Apache Ambari

Summary

Apache Ambari is a cluster management tool for Apache Hadoop.

Owner

Current status

  • Targeted release: Fedora 21
  • Last updated: 7 April 2014
  • Tracker bug: <will be assigned by the Wrangler>

Detailed Description

The Apache Ambari project is aimed at making Hadoop management simpler by developing software for provisioning, managing, and monitoring Apache Hadoop clusters. Ambari provides an intuitive, easy-to-use Hadoop management web UI backed by its RESTful APIs.

Benefit to Fedora

Apache Pig is a data analysis tool used by many parts of the Hadoop ecosystem. Including it in Fedora increases usefulness of the Apache Hadoop package that is already in Fedora.

Scope

  • Proposal owners: The Pig package has been accepted into Fedora and provides all the functionality from the upstream release with the exception of jython (version) and parquet (unpackaged) support.
  • Other developers: N/A (not a System Wide Change)
  • Release engineering: N/A (not a System Wide Change)
  • Policies and guidelines: N/A (not a System Wide Change)

Upgrade/compatibility impact

N/A (not a System Wide Change)

How To Test

An upstream quickstart guide is available here which describes setup and simple examples.

User Experience

Users should be able to write and run applications that use Apache Pig for analysis of large data sets stored in Hadoop.

Dependencies

Apache HBase

Contingency Plan

  • Contingency mechanism: N/A (not a System Wide Change)
  • Contingency deadline: N/A (not a System Wide Change)
  • Blocks release? N/A (not a System Wide Change)
  • Blocks product? N/A

Documentation

N/A (not a System Wide Change)

Release Notes

Fedora 21 includes Apache Pig, the Hadoop data analysis tool.