What is big data?
providers about it
There is no exact indication of the amount of data from which one is talking about big data. As a general rule, big data is mass data in the multi-digit terabyte or petabyte range and beyond. Regardless of a certain amount of data, this definition of big data has prevailed: big data refers to amounts of data that are so large and complex that traditional techniques and methods of data processing and data analysis are no longer applicable.
English term big data It literally means “collective data” and is intended to indicate that it relates to the processing of particularly large amounts of data. The amount of data one is talking about above is often discussed about big data. However, there is no fixed threshold above which the term big data is used. As a rule, it is group data consisting of several numbers terabyte-area or petabyteThe region and beyond.
Due to technical advances, the amount of data to be processed in the big data environment is constantly increasing. No matter the amount of data, the definition prevails that the traditional techniques and methods of data processing and data analysis can no longer be used for big data. The amounts of data are too large, too complex, poorly organized, or too short-lived to be managed and evaluated in the traditional way. Data no longer fits on individual hard drives and cannot be processed in the required time using traditional technologies. Therefore, methods that distribute data across multiple systems, load balancing and process parallelism are used.
Big data systems often run in parallel with many processors or servers. The characteristics of these systems are processing many data records in a short time, fast imports of large amounts of data, real-time queries, fast processing of complex query commands, execution of many parallel queries and processing of structured, unstructured or half-structured data and different data formats.
Five in BigData’s
Many definitions speak of the five “V”s that characterize big data. These five “V” are:
the sound stands for large amount of data. diverse Diversity means and describes the different data formats (text, images, and videos) in which the data is available and the different data structures such as structured, unstructured, or semi-structured data. directional speed (“Speed”) is intended to mean that data is generated at a high speed and can be processed quickly or in real time. reliability (“Truthfulness”) represents the accuracy and quality of the data. The data comes from many different sources and is processed qualitatively. term Values Finally, (“value”) means that the vast amounts of data to be processed and analyzed by an organization or company results in added value.