Overview: Gradle is an automated project building tool which uses the concepts of both Apache Ant and Apache Maven. Gradle is based on a domain specific language rather than the traditional XML approach used by Apache Ant and Apache Maven.…
What is DooPHP?
Overview: DooPHP is a high performance open source PHP framework. It is also a rapid development framework for PHP application development. It uses common design patterns like MVC and ORM. The framework helps to write less code for performing tasks…
What are the practical advantages and disadvantages of Cloud Computing?
Overview: Cloud computing is nothing but delivery of computing as a service rather than as a product. When we use the word computing, it includes the cost of CPU, the memory, the storage, network and other software required to create…
How can you manage large volume of data using Apache Cassandra NoSQL database?
Overview: Apache Cassandra is one of the most popular and scalable open source NoSQL database. Cassandra is an ideal database for managing huge volume of unstructured, semi-structured and structured data across multiple data centers and the cloud environment. Cassandra delivers…
What is Apache Spark?
Overview: Apache spark is a high performance general engine used to process large scale data. It is an open source framework used for cluster computing. Aim of this framework is to make the data analytic faster – both in terms…
What is Apache Shark?
Overview: Apache shark is a distributed query engine developed by the open source community. This query engine is mainly used for Hadoop data. It provides enhanced performance and high-end analytical results to Hive users. In this document, I will talk…
How to process your data using Apache Pig?
Overview: Apache Pig is a platform and a part of BigData eco-system. The platform is used to process large volume of data set in a parallel way. The pig platform works on top of Apache Hadoop and MapReduce Platform. As…
How to create your first HIVE script?
Overview: Apache Hive is an integral part of Hadoop eco-system. Hive can be defined as a data warehouse like software which facilitates query and large data management on HDFS (Hadoop distributed file system). One must remember that Hive is not…
What is Apache HBase and when should you use it?
Overview: Apache HBase can be defined as the Hadoop database. It is a distributed, non-relational and open source database written in Java. It is developed based on the Google BigTable framework and runs on HDFS (Hadoop distributed file system). Apache…
What Are The Advanced Apache Hadoop MapReduce Features?
Overview: The basic MapReduce programming explains the work flow details. But it does not cover the actual working details inside the MapReduce programming framework. This article will explain the data movement through the MapReduce architecture and the API calls used…
How to run JUNIT testing framework using build tool ANT?
Overview: In this document we will discuss about the build tool ant and unit testing framework Junit. Both of these have become an integral part of java development. Both ant and Junit are widely used in the java world. Most…
What is downtime and importance of third party SLAs?
SLAs (Service Level Agreement) are the most important and critical component of any vendor contract. This component describes all the service level expectations (services and quality) from the vendor. It also clearly defines the penalties or the alternate solution if…
What is Streaming Application Testing?
In this current age of data explosion, real time traffic over the internet is growing rapidly. Streaming applications are one of the most important areas where real time data is captured and delivered instantly. The streaming information is not only…