Dark data is a subset of big data but it constitutes the biggest portion of the total volume of big data collected by organizations in a year. Níl sonraí Dark anailís de ghnáth nó próiseáilte mar gheall ar chúiseanna éagsúla ag…
How is big data helping build smart cities?
There has been a lot of activity around the concept of Smart City for some time. Cities are being identified as future smart cities. Theoretically at least, smart cities can fundamentally change our lives at many levels such as less…
What is the success rate in Hadoop adoption?
There has been a lot of hype around Hadoop for a long time. This hype was expected because Hadoop is perceived an extremely efficient big data processing tool. But time has come to look at some cold, hard facts. It…
What are the top big data analytics pain points?
Big Data offers business enterprises a never-before opportunity to improve productivity and their revenue. However, enterprises have been struggling with the task of getting the best out of the Big Data they collect. A survey conducted in 2012 on 300…
What is the impact of big data in home health care?
Big data represents an unprecedented opportunity for the healthcare industry to move to the next level of service quality. While a lot of discussion on the relation between big data and the healthcare industry tend to circle around the services…
How big data analytics can help in Personal Fitness Devices?
The personal fitness device industry is changing with the advent of the Internet of Things (IoT). Previously, the personal fitness devices were just devices, isolated, doing a specific job such as recording your blood pressure. You could either view the…
How Big Data is Influencing Data Driven Advertising?
Big data has been significantly influencing data driven advertising. Originally, big data is a good fit for data driven advertising because this type of advertising mainly depends upon data. A survey conducted by BlueKai, a leading big data platform found…
How can you manage large volume of data using Apache Cassandra NoSQL database?
Forbhreathnú: Apache Cassandra is one of the most popular and scalable open source NoSQL database. Cassandra is an ideal database for managing huge volume of unstructured, semi-structured and structured data across multiple data centers and the cloud environment. Cassandra delivers…
What is Apache Spark?
Forbhreathnú: Apache spark is a high performance general engine used to process large scale data. It is an open source framework used for cluster computing. Aim of this framework is to make the data analytic faster – both in terms…
What is Apache Shark?
Forbhreathnú: Apache shark is a distributed query engine developed by the open source community. This query engine is mainly used for Hadoop data. It provides enhanced performance and high-end analytical results to Hive users. In this document, I will talk…
How to process your data using Apache Pig?
Forbhreathnú: Apache Pig is a platform and a part of BigData eco-system. The platform is used to process large volume of data set in a parallel way. The pig platform works on top of Apache Hadoop and MapReduce Platform. As…
What Are The Advanced Apache Hadoop MapReduce Features?
Forbhreathnú: The basic MapReduce programming explains the work flow details. But it does not cover the actual working details inside the MapReduce programming framework. This article will explain the data movement through the MapReduce architecture and the API calls used…
How NoSQL integrates with Hadoop eco-system?
Apache Hadoop is an open source big data processing platform. It has its own eco-system products to support various needs. Different big data products/platforms can integrate Hadoop and NoSQL into one platform so it provides better performance and a single source of…