Big Data

Big Data refers to datasets that are too large to be dealt-with by using traditional data processing techniques. Big Data is also defined as data that contains greater variety, arriving in increasing volumes and with more velocity. This is also known as the three V’s:

  • Variety
  • Volume
  • Velocity

Big Data processing is a key part of Data Science, and has been evolving since its definition due to the large amount of information generated on a daily basis.

This section will discuss Big Data historical context, the main pillars of the discipline, its relation with Data Science, Data Analysis & Data Engineering, the mail tools used for Big Data manipulation, processing & analysis, and more.

Over the last two articles of this series, we have discussed different Big Data file formats and their overall…
In the first part of this 3-article series, we introduced the concepts of columnar file formats & row-based file formats. We also…
A Big Data file format is designed to store high volumes of variable data optimally. This can be achieved…

Request Full Resume