Visió de conjunt: Apache HBase is one of the most popular non-relational databases built on top of Hadoop and HDFS (Hadoop Distributed File system). It is also known as Hadoop database. As an Apache project, HBase is an open-source, versioned and distributed…
Apache Pig and Hadoop platform – How to process your data?
Visió de conjunt: Apache Pig is a high level scripting language and a part of Apache Hadoop Eco-system. Pig scripting is mainly used for data analysis and manipulation on top of Hadoop platform. We know that MapReduce is a programming model used…
Hadoop Basic concepts – Learn it now
Introducció: In this series, we will discuss some of the basic concepts in Hadoop and big data. We have tried to cover basic concepts and explain them to make it easy to learn and implement. Seguirem afegint,,en,Python és un llenguatge de programació de propòsit general que és de codi obert,,en,flexible,,en,potent i fàcil d'usar,,en,Una de les característiques més importants del python és el seu ric conjunt d'utilitats i biblioteques per al processament i l'anàlisi de dades,,en…
What is Hadoop distributed file system (HDFS)?
Visió de conjunt: In this article I will discuss about HDFS, which is the underlying file system of Apache Hadoop framework. Hadoop Distributed File System (HDFS) is a distributed storage space that spans across thousands of commodity hardware. This file system provides…
What is the importance of Hadoop Architecture in Production Success?
Visió de conjunt: Hadoop is a platform that is almost synonymous with Big Data. It is basically an open source framework that allows the storage and processing of clustered data-sets on a large scale. Primarily, the Hadoop architecture is known to comprise…