Big Data in the Microsoft Platform
- 21. ETL Tools BI Reporting RDBMS
Zookeepr (Coordination)
Pig (Data Flow) Hive (SQL) Sqoop
Avro (Serialization)
MapReduce (Job Scheduling/Execution System)
HBase (key-value store) (Streaming/Pipes APIs)
HDFS
(Hadoop Distributed File System)
- 26. Block Size = 64MB
Replication Factor = 3
Cost/GB is a few ¢/month
vs $/month