Usually defined by three elements:
- Volume
- Velocity (speed)
- Variety
Proper organization and use of them is as much important as having them.
Organizations
- Relational Data Model: RDBMS (Relational Database Management System), mainly implemently by SQL (Structured Query Language).
- Entity-Relationship Data Model (ER): . . . It added additional abstraction to increase the usability of the data. In the model, each item was defined independently of its use. Therefore, developers could create new relationships between data sources without complex programming .
- Data warehouse in 90s
- Beginning of unstructured data use -- BLOBs (Binary Large Objects)
- Object Database Management System (ODBMS).
The above has shown Structured -> Unstructured data. Therefore, building, organizing, integrating, analyzing, and deciding (utilizing the data) become extremely important.
- COIAA -- Capture, Organize, Integrate, Analyze, Act
Capture
- Setting architectural foundation
[JPG image (43.71 KB)]
- Computer-generated or Machine-generated Data
- Human-generated Data
- Hybrid Data . . . .
- Sensor data . . . . RFID tags, Smart meters, medical devices, GPS data, etc.
- Web log data . . . Google analytics,
- Point-of-sale data . . . Cashiers' swipes . . . .
- Financial data