Apache Hadoop has become the defacto platform for developing large-scale , data intensive applications. It has been used actively in academia and Industry for research and data mining. Our Hadoop Experts will provide an opportunity to understand the latest trends and roadmap for the Hadoop platform and its ecosystem, and how Hadoop is leveraged in various domains.
MongoDB is a document database that provides high performance, high availability and easy scalability. Instead of storing data in tables as is done in a “classical” relational database, MongoDB stores structured data as JSON-like documents with dynamic schemas, making the integration of data in certai types of applications easier and faster.
HBase is an open source, non-relational distributed database modeled after Google’s Big Table and is written in Java. It is developed as part of Apache software foundation’s Apache Hadoop project and runs on top of HDFS (Hadoop Distributed File System), providing Big Table like capabilities for Hadoop.
The Apache Mahout is an open source machine learning library from Apache, Its goal is to build scalable machine learning libraries. The Algorithm Mahout implements fall under of machine learning or collective intelligence to produce free implementations of distributed or otherwise scalable machine learning algorithms on the Hadoop platform.