Dark data is a subset of big data but it constitutes the biggest portion of the total volume of big data collected by organizations in a year. Dark data is not usually analysed or processed because of various reasons by…
Sut y mae data mawr helpu i adeiladu dinasoedd smart?
There has been a lot of activity around the concept of Smart City for some time. Cities are being identified as future smart cities. Theoretically at least, smart cities can fundamentally change our lives at many levels such as less…
Beth yw'r gyfradd llwyddiant o ran mabwysiadu Hadoop?
There has been a lot of hype around Hadoop for a long time. This hype was expected because Hadoop is perceived an extremely efficient big data processing tool. But time has come to look at some cold, hard facts. It…
Beth yw'r analytics data mawr bwyntiau poen top?
Data Mawr yn cynnig cyfle byth-cyn i wella cynhyrchiant ac mae eu refeniw mentrau busnes. However, o fentrau wedi bod yn cael trafferth gyda'r dasg o gael y gorau allan o'r Data Big maent yn ei gasglu. Mae arolwg a gynhaliwyd yn 2012 on 300…
Beth yw effaith y data mawr mewn gofal iechyd cartref?
data Mawr yn gyfle digynsail ar gyfer y diwydiant gofal iechyd i symud i'r lefel nesaf o ansawdd y gwasanaeth. While a lot of discussion on the relation between big data and the healthcare industry tend to circle around the services…
Sut analytics data mawr yn gallu helpu mewn Dyfeisiau Ffitrwydd Personol?
The personal fitness device industry is changing with the advent of the Internet of Things (IoT). Previously, the personal fitness devices were just devices, isolated, doing a specific job such as recording your blood pressure. You could either view the…
How Big Data is Influencing Data Driven Advertising?
Big data has been significantly influencing data driven advertising. Originally, big data is a good fit for data driven advertising because this type of advertising mainly depends upon data. A survey conducted by BlueKai, a leading big data platform found…
How can you manage large volume of data using Apache Cassandra NoSQL database?
Trosolwg: Apache Cassandra is one of the most popular and scalable open source NoSQL database. Cassandra is an ideal database for managing huge volume of unstructured, semi-structured and structured data across multiple data centers and the cloud environment. Cassandra delivers…
What is Apache Spark?
Trosolwg: Apache spark is a high performance general engine used to process large scale data. It is an open source framework used for cluster computing. Aim of this framework is to make the data analytic faster – both in terms…
What is Apache Shark?
Trosolwg: Apache shark is a distributed query engine developed by the open source community. This query engine is mainly used for Hadoop data. It provides enhanced performance and high-end analytical results to Hive users. In this document, I will talk…
How to process your data using Apache Pig?
Trosolwg: Apache Pig is a platform and a part of BigData eco-system. The platform is used to process large volume of data set in a parallel way. The pig platform works on top of Apache Hadoop and MapReduce Platform. As…
What Are The Advanced Apache Hadoop MapReduce Features?
Trosolwg: The basic MapReduce programming explains the work flow details. But it does not cover the actual working details inside the MapReduce programming framework. This article will explain the data movement through the MapReduce architecture and the API calls used…
How NoSQL integrates with Hadoop eco-system?
Apache Hadoop is an open source big data processing platform. It has its own eco-system products to support various needs. Different big data products/platforms can integrate Hadoop and NoSQL into one platform so it provides better performance and a single source of…