Frameworks and Tools
Computation
Hadoop, Spark, Samza, Flink, Hive, Pig, Drill, etcTransportation
Kafka, Flume, Sqoop, Scribe, RabbitMQ, ZeroMQ, IronMQ, etcStorage
HBase, Cassandra, CouchDB, MongoDB, etcCoordination
Zookeeper, Consul, Etcd, Eureka, etcScheduling
Mesos, Yarn, Oozie, etcCommon Data Infrastructure
Data Ingestion Layer
- High throughput
- Simple processing logic, merely a pass through
- Cannot serve as a storage layer
Data Storage Layer
(Operational Store [Indexed] + File System [Un-Indexed])- High availability
- Fault tolerance
- Handles high data volume
- Able to handle various type of data
OLTP vs OLAP
"Online transaction processing" vs "Online analytical processing"
No comments:
Post a Comment