Hadoop

Hadoop
4.3 3.9 4.1
Smile Visitors Global

Hadoop has become the benchmark platform for writing applications to store and process decentralised data in batch mode.

Read full page
top

Hive

Hive
4.5 3.2 3.9
Smile Visitors Global

Like Pig, Hive allows developers not proficient in Java to write data processing tasks. But where Pig has defined a procedural language used to work the cluster, Hive can define SQL type structured tables and feed them with data either from the cluster or from outside sources.

Read full page
top

Pig

Pig
4.2 No rating 4.2
Smile Visitors Global

Pig is a data processing tool that is part of the Hadoop suite. It provides for the writing of scripts executed on the Hadoop infrastructure without having to first write Java tasks using the MapReduce framework. In addition, it has functionalities for loading data from an outside source to the HDFS cluster and others for exporting data for use by third party applications.

Read full page
top

Sqoop

sqoop
4.2 No rating 4.2
Smile Visitors Global

Sqoop is an Apache Foundation project whose goal is to improve the cohabitation of traditional DBMS type systems with the Hadoop platform.

Read full page
top