The various areas where large data, unstructured and semi-structured data is generated at a fast pace or need to be processed in a low latency space.
Below are the following data sets analyzed for telcos
Data Set
|
Volume
|
Variety
|
Velocity
|
Remarks
|
Order
|
Medium
|
Medium
|
Low
|
Customer application form is Semistructured
|
CDRs
|
High
|
Low
|
High
|
Billions of records per day
|
Payments
|
Medium
|
Low
|
Medium
|
|
Network Data
|
High
|
Medium
|
High
|
Mostly structured data for call usage
– very high volumes – semistructured if web data – deep packet inspection is included |
Subscriber
|
Medium
|
Low
|
Low
|
|
Products
|
Low
|
Low
|
Low
|
Telcos moving towards simpler
products |
It is clear from the data sets that the CDR, Network data, web usage data (including for web surfing, social interactions and content consumption – video, audio, shopping) are the key data sets that are relevant for Big Data elements.