Tổng quan: There are a bunch of working definitions for Big Data yet to me it is put as data collection so substantial and complex that it ends up troublesome or difficult to process those utilizing conventional databases. For a little…
Lớn dữ liệu: A Big Bad Data or a Game Changer
Tổng quan: Data is evolving everywhere- from a single voice search you just made to locate a nearby restaurant in your neighbor to the last weekend’s party pictures which you just threw over Facebook account. “A report reveals that 2.5 quintillion…
Why Apache Spark is the future platform for big data?
Tổng quan: As big data becomes one of the most important assets an enterprise can possess, enterprises are demanding more out of the data. Enterprises expect data to provide complex and multidimensional insights at high speeds. To provide such insights, companies…
Khám phá HBase NoSQL DB,en
Tổng quan: Apache HBase is one of the most popular non-relational databases built on top of Hadoop and HDFS (Hadoop Distributed File system). It is also known as Hadoop database. As an Apache project, HBase is an open-source, versioned and distributed…
Apache Pig and Hadoop platform – How to process your data?
Tổng quan: Apache Pig is a high level scripting language and a part of Apache Hadoop Eco-system. Pig scripting is mainly used for data analysis and manipulation on top of Hadoop platform. We know that MapReduce is a programming model used…
Các khái niệm cơ bản của Hadoop - Tìm hiểu ngay bây giờ,en
Giới thiệu: In this series, we will discuss some of the basic concepts in Hadoop and big data. We have tried to cover basic concepts and explain them to make it easy to learn and implement. We will keep on adding…
Steps to work with Windows Azure HDInsight
Tổng quan: Hadoop has made big data handling simpler and it goes without saying that in the context of the huge importance big data is being given, Hadoop is viewed as a key tool in big data management. However, tổ chức có thể…
Hadoop installation modes – Let’s explore
Tổng quan: Apache Hadoop can be installed in different modes as per the requirement. These different modes are configured during installation. Theo mặc định, Hadoop is installed in Standalone mode. The other modes are Pseudo distributed mode and distributed mode. The purpose…
What is HDFS federation?
Tổng quan: We are well aware of the features of Hadoop and HDFS. In this document we will talk about the HDFS federation which helps us to enhance an existing HDFS architecture. It provides a clear separation between namespace and storage…
Mùa xuân cho Apache Hadoop là gì?
Tổng quan: Mùa xuân là một trong những khuôn khổ sử dụng rộng rãi trong việc phát triển các ứng dụng doanh nghiệp. Mùa xuân có các thành phần khác nhau như Spring ORM, Mùa xuân JDBC vv để hỗ trợ các tính năng khác nhau. Mùa xuân cho Apache Hadoop là khuôn khổ để hỗ trợ xây dựng ứng dụng với các thành phần Hadoop…
What are the latest trends in big data and analytics?
Tổng quan: Big data technology is coming up with best practices and better trends every day. Big data is gradually coming into main stream projects also and gaining momentum. With big data, analytics is also getting much importance, vì nó là…
What is Hadoop distributed file system (HDFS)?
Tổng quan: In this article I will discuss about HDFS, which is the underlying file system of Apache Hadoop framework. Hadoop Distributed File System (HDFS) is a distributed storage space that spans across thousands of commodity hardware. hệ thống tập tin này cung cấp…
How Hadoop Streaming works?
Tổng quan: Hadoop streaming is one of the most important utility in Hadoop distribution. The Streaming interface of Hadoop allows you to write Map-Reduce program in any language of your choice, which can work with STDIN and STDOUT. So, streaming thể…