This flashcard is just one of a free flashcard set. See all flashcards!
2
Hadoop
– Enables distributed parallel processing of big data across inexpensive computers
– Key services
• Hadoop Distributed File System (HDFS): data storage
• MapReduce: breaks data into clusters for work
• Hbase: NoSQL database
– Used Yahoo, NextBio
– Key services
• Hadoop Distributed File System (HDFS): data storage
• MapReduce: breaks data into clusters for work
• Hbase: NoSQL database
– Used Yahoo, NextBio