Introduction Since the time when Big Data was introduced it has gone through multiple phases of evolution. Hadoop was introduced in 2005 with some initial features such as the MapReduce processing engine which allowed large scale data processing workloads distributed…
What is Apache Kudu?
Overview : Kudu is the new open source project which provides updateable storage. It is a complement to HDFS/HBase, which provides sequential and read-only storage. Kudu is more suitable for fast analytics on fast data, which is the demand of…
Big Data and Education Industry
Overview: Big data has been driving revolutionary changes in education. There hardly remains an area in education not impacted by big data. You can notice the changes in the ways educational institutions are governed, course quality is managed and student…
Big Data characteristics and pain points
Overview Big data is based on three most important characteristics, known as volume, velocity and veracity. It comes in different forms and structure. Big data analytics is having significant impact in business decision. But it comes with some pain points.…
How Open Data Platform simplifies Hadoop adoption?
Overview The Open Data Platform (ODP) is an industry initiative focused on simplifying the adoption of Apache Hadoop by the Enterprise and enabling Big Data solutions to thrive with better ecosystem interoperability. It builds on the strengths of the Apache…
Exploring Zeta Architecture
Overview: The Zeta Architecture is a new way of setting up your solution and enterprise architecture. When you are deploying the Zeta architecture, you combine your solution and enterprise architecture unlike the way architecture systems are used now. Traditionally, the…
Measuring the ROI in Hadoop adoption
Overview: Nowadays, people seem to be really misinformed about Hadoop, mainly due to lots of half-truths that are fluttering about it in the market. However, all these half-truths were normal as Hadoop is said to be one of the best…
SQL on Hadoop – How does it work?
Overview: SQL on Hadoop is a group of analytical application tools that combine the SQL-style querying and processing of data with the most recent Hadoop data framework elements. The emergence of SQL on Hadoop is an important development for big…
Want to know about Big Data myths?
Overview Big data, data science, and big data analytics are perhaps some of the hottest terms in today’s technology world. But, at the same time there is a lot of misunderstanding and confusion about those terms. So people start thinking…
Big data (Hadoop) as a service – How does it work?
Overview: In today’s technology world, software as a service (SaaS) is a common model. The service if offered to the subscribers as per need basis. Big data is also following the service model. In this article, I will talk about the…
Pairing of IOT and Hadoop
Overview: The Internet of Things (IoT) is the new reality that is going to penetrate into our lives. According to Gartner, the number of interconnected devices will increase to 6.4 billion in 2016, up by 30% from 2015, thus generating…
Big Data: A Big Bad Data or a Game Changer
Overview: Data is evolving everywhere- from a single voice search you just made to locate a nearby restaurant in your neighbor to the last weekend’s party pictures which you just threw over Facebook account. “A report reveals that 2.5 quintillion…
Why Apache Spark is the future platform for big data?
Overview: As big data becomes one of the most important assets an enterprise can possess, enterprises are demanding more out of the data. Enterprises expect data to provide complex and multidimensional insights at high speeds. To provide such insights, companies…