Kategorier: Alla - volume - algorithms - variety - veracity

av carlos bernal för 1 år sedan

88

the 4 Vs of Big Data

The concept of Big Data revolves around managing, processing, and analyzing large and complex data sets. Four primary characteristics define Big Data: velocity, veracity, volume, and variety.

the 4 Vs of Big Data

the 4 Vs of Big Data

VERACITY OF BIG DATA

Refers to the quality of the data that is being analyzed.

Data from a medical experiment or trial.

non-valuable

Noise.

Low one

Contains a high percentage of meaningless data.

High one

Has many records that are valuable to analyze and that contribute in a meaningful way to the overall results.

VARIETY OF BIG DATA

Is as big the variety that generally is one out of three types: structured, semi structured and unstructured data wich frequently requires distinct processing capabilities and specialist algorithms


EXAMPLE:

EXAMPLE

CCTV audio and video files that are generated at various locations in a city.

Because of these characteristics of the data, the knowledge domain that deals with the storage, processing, and analysis of these data sets has been labeled Big Data.

VELOCITY OF BIG DATA

Speed with which data is generated with such a pace that requires distinct (distributed) processing techniques.

EXAMPLE:

Twitter messages or Facebook posts.

VOLUMEN OF BIG DATA

Size of the data sets that need to be analyzed and processed

EXAMPLE:

All credit card transactions on a day within Europe.