- 40 min
Over the last two articles of this series, we have discussed different Big Data file formats and their overall…
Big Data refers to datasets that are too large to be dealt-with by using traditional data processing techniques. Big Data is also defined as data that contains greater variety, arriving in increasing volumes and with more velocity. This is also known as the three V’s:
Big Data processing is a key part of Data Science, and has been evolving since its definition due to the large amount of information generated on a daily basis.
This section will discuss Big Data historical context, the main pillars of the discipline, its relation with Data Science, Data Analysis & Data Engineering, the mail tools used for Big Data manipulation, processing & analysis, and more.
This blog was created because I firmly believe in open source technology and free learning resources. If you’d like to complement the content I create, you’re welcome to drop a message using the contact form.
© Pablo Aguirre 2023.
All rights reserved.