Yleiskatsaus: Big Data -toiminnon määritelmiä on vielä joukko, mutta minulle se on niin merkittävä ja monimutkainen, että se on hankalaa tai vaikeaa käsitellä niitä, jotka käyttävät tavanomaisia tietokantoja,,en,Hieman,,en. For a little…
Big Data: A Big Bad Data or a Game Changer
Yleiskatsaus: Data is evolving everywhere- from a single voice search you just made to locate a nearby restaurant in your neighbor to the last weekend’s party pictures which you just threw over Facebook account. “A report reveals that 2.5 quintillion…
Why Apache Spark is the future platform for big data?
Yleiskatsaus: As big data becomes one of the most important assets an enterprise can possess, enterprises are demanding more out of the data. Enterprises expect data to provide complex and multidimensional insights at high speeds. To provide such insights, companies…
Exploring HBase NoSQL DB
Yleiskatsaus: Apache HBase is one of the most popular non-relational databases built on top of Hadoop and HDFS (Hadoop Distributed File system). It is also known as Hadoop database. As an Apache project, HBase is an open-source, versioned and distributed…
Apache Pig and Hadoop platform – How to process your data?
Yleiskatsaus: Apache Pig is a high level scripting language and a part of Apache Hadoop Eco-system. Pig scripting is mainly used for data analysis and manipulation on top of Hadoop platform. We know that MapReduce is a programming model used…
Hadoop Basic concepts – Learn it now
Käyttöönotto: In this series, we will discuss some of the basic concepts in Hadoop and big data. We have tried to cover basic concepts and explain them to make it easy to learn and implement. We will keep on adding…
Steps to work with Windows Azure HDInsight
Yleiskatsaus: Hadoop has made big data handling simpler and it goes without saying that in the context of the huge importance big data is being given, Hadoop is viewed as a key tool in big data management. However, organizations might…
Hadoop asennus tilaa - Tutkitaan
Yleiskatsaus: Apache Hadoop voidaan asentaa eri tiloissa kohti vaatimus. Nämä eri tilat on määritetty asennuksen aikana. Oletuksena, Hadoop is installed in Standalone mode. The other modes are Pseudo distributed mode and distributed mode. The purpose…
Mikä on HDFS liittovaltio?
Yleiskatsaus: Olemme hyvin tietoisia ominaisuudet Hadoop ja HDFS. Tässä asiakirjassa me puhumme HDFS liittovaltio, joka auttaa meitä parantamaan olemassa HDFS arkkitehtuuri. It provides a clear separation between namespace and storage…
Mikä on Spring Apache Hadoop?
Yleiskatsaus: Spring is one of the widely used frameworks in enterprise applications development. Spring has different components like Spring ORM, Spring JDBC jne tukemaan erilaisia ominaisuuksia. Spring for Apache Hadoop is the framework to support application building with Hadoop components…
Mitkä ovat uusimpia suuntauksia iso tiedot ja analyysit?
Yleiskatsaus: Big data tekniikka on keksiä parhaita käytäntöjä ja parempia trendejä päivittäin. Big data on vähitellen tulossa valtavirtaa projekteja myös ja elpyessä. Iso data, analytiikka on myös saada paljon merkitystä, as it is…
What is Hadoop distributed file system (HDFS)?
Yleiskatsaus: In this article I will discuss about HDFS, which is the underlying file system of Apache Hadoop framework. Hadoop Distributed File System (HDFS) is a distributed storage space that spans across thousands of commodity hardware. This file system provides…
How Hadoop Streaming works?
Yleiskatsaus: Hadoop streaming is one of the most important utility in Hadoop distribution. The Streaming interface of Hadoop allows you to write Map-Reduce program in any language of your choice, which can work with STDIN and STDOUT. So, Streaming can…