Overview: To anyone applying AI in any form, the response to the heading above might be “Duh!” That’s an obvious statement to those engrossed at the coalface, but for many others (especially on the client side), they have yet to…
How Open Data Platform simplifies Hadoop adoption?
Overview The Open Data Platform (ODP) is an industry initiative focused on simplifying the adoption of Apache Hadoop by the Enterprise and enabling Big Data solutions to thrive with better ecosystem interoperability. It builds on the strengths of the Apache…
Why Apache Spark is the future platform for big data?
Overview: As big data becomes one of the most important assets an enterprise can possess, enterprises are demanding more out of the data. Enterprises expect data to provide complex and multidimensional insights at high speeds. To provide such insights, companies…
Introduction to Apache Spark with Examples and Use Cases
BY RADEK OSTROWSKI – SOFTWARE ENGINEER @ TOPTAL I first heard of Spark in late 2013 when I became interested in Scala, the language in which Spark is written. Sometime later, I did a fun data science project trying to predict survival on the…
Why Apache Flink and Apache Spark are used for Processing Streaming Data?
The demand for faster data processing has been increasing and real-time streaming data processing appears to be the answer. While Apache Spark is still being used in a lot of organizations for big data processing, Apache Flink has been coming…