Examples of Test Questions

Big Data Basics

  1. What is it Big Data?
  2. Properties of Big Data ? Give the short explanation for each of them.

Hadoop Basics

  1. What is Apache Hadoop? Which problem does it solve?
  2. Hadoop Architecture. What are the core components of Hadoop Framework?
  3. What is Hadoop HDFS component? HDFS Architecture?
  4. What are the main principles of MapReduce programming paradigm?

Apache Spark Basics

  1. What is Apache Spark? What are the main differences between Hadoop and Spark systems?
  2. Apache Spark architecture? Give an short explanation for each of the components
  3. What is it Spark RDD? Which operations does it support? Give an example of these operations?
  4. Possible operations on RDD (Actions, Transformations)
  5. Differences between map and flatMap operations
  6. How could we change the number of partitions of a RDD?
  7. Apache Spark Context? What is it? How to create it? (Give an example of the source code)
  8. Give few examples , how RDD can be created using SparkContext
  9. What is Lazy evaluated RDD mean?
  10. Apache Spark SQLContext(SparkSession)? What is it? For which purposes does it ne
  11. How to define Spark SQLContext(SparkSession)? (Give an example of the source code)
  12. What is Spark DataFrame?
  13. How to download JSON file into Spark SQLContext? Algorithm and source code.
  14. How to download CSV file into Spark SQLContext? Algorithm and source code.
  15. What is Apache Spark Streaming?
  16. What is Streaming DStream? How to create DStream?

results matching ""

    No results matching ""