Current Slide

Small screen detected. You are viewing the mobile version of SlideWiki. If you wish to edit slides you will need to use a larger device.

Big Data Ecosystem

‹#›
File system
HDFS, NFS
Resource manager
Mesos, Yarn
Coordination
Zookeeper
Data Acquisition
Apache Flume, Apache Sqoop
Data Stores
MongoDB, Cassandra, Hbase, Hive
Data Processing
 
Frameworks
Hadoop MapReduce, Apache Spark, Apache Storm, Apache Flink
Tools
Apache Pig, Apache Hive
Libraries
SparkR, Apache Mahout, MlLib, etc
Data Integration
 
Message Passing
Managing data heterogeneity
Apache Kafka
SemaGrow, Strabon
Operational Frameworks
 
Monitoring
Apache Ambari

Content Tools

Sources

There are currently no sources for this slide.