Apache Ambari is a tool for provisioning, managing, and monitoring Apache Hadoop clusters.
Apache Phoenix is a SQL skin over HBase delivered as a client-embedded JDBC driver targeting low latency queries over HBase data.
Greenplum Database - Massively Parallel PostgreSQL for Analytics. An open-source massively parallel data platform for analytics, machine learning and AI.
Apache Livy is an open source REST interface for interacting with Apache Spark from anywhere.
The Apache TEZ project is aimed at building an application framework which allows for a complex directed-acyclic-graph of tasks for processing data
Apache Pig is a platform for analyzing large data sets.
A free and open source distributed realtime computation system
A Distributed Storage System for Structured Data
Quantcast File System (QFS) is a high-performance, fault-tolerant, distributed file system developed to support MapReduce processing, or other applications reading and writing large files sequentially.
The goal of the Apache Mahout™ project is to build an environment for quickly creating scalable, performant machine learning applications.
Apache Ignite In-Memory Computing, Database and Caching Platform
Apache Giraph is an iterative graph processing system built for high scalability.
A software platform for processing vast amounts of data
Apache ranger is a framework to enable,monitor and comprehensive data security across the Hadoop platform.
Apache atlas framework is an extensible set of core foundational governace services.