International Business , Data Mining , DataWarehousing, Artificial Intelligence - Big Data Systems Section 1

16. ___________ is a distributed machine learning framework on top of Spark.

Cancel reply

Your email address will not be published. Required fields are marked *


Cancel reply

Your email address will not be published. Required fields are marked *


17. ________ is a resource management platform responsible for managing compute resources in the cluster and using them in order to schedule users and applications.

Cancel reply

Your email address will not be published. Required fields are marked *


Cancel reply

Your email address will not be published. Required fields are marked *


18. ________ is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data.

Cancel reply

Your email address will not be published. Required fields are marked *


Cancel reply

Your email address will not be published. Required fields are marked *


19. Which of the following tool is designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases.

Cancel reply

Your email address will not be published. Required fields are marked *


Cancel reply

Your email address will not be published. Required fields are marked *


20. _______ brings scalable parallel database technology to Hadoop and allows users to submit low latencies queries to the data that's stored within the HDFS or the Hbase without acquiring a ton of data movement and manipulation.

Cancel reply

Your email address will not be published. Required fields are marked *


Cancel reply

Your email address will not be published. Required fields are marked *