Examples of Test Questions

Big Data Basics
- What is it Big Data?
- Properties of Big Data ? Give the short explanation for each of them.
Hadoop Basics
- What is Apache Hadoop? Which problem does it solve?
- Hadoop Architecture. What are the core components of Hadoop Framework?
- What is Hadoop HDFS component? HDFS Architecture?
- What are the main principles of MapReduce programming paradigm?
Apache Spark Basics
- What is Apache Spark? What are the main differences between Hadoop and Spark systems?
- Apache Spark architecture? Give an short explanation for each of the components
- What is it Spark RDD? Which operations does it support? Give an example of these operations?
- Possible operations
on RDD (Actions, Transformations)
- Differences between map and flatMap operations
- How could we change the number of partitions of a RDD?
- Apache Spark Context? What is it? How to create it? (Give an example of the source code)
- Give few examples , how RDD can be created using SparkContext
- What is Lazy evaluated RDD mean?
- Apache Spark SQLContext(SparkSession)? What is it? For which purposes does it ne
- How to define Spark SQLContext(SparkSession)? (Give an example of the source code)
- What is Spark DataFrame?
- How to download JSON file into Spark SQLContext? Algorithm and source code.
- How to download CSV file into Spark SQLContext? Algorithm and source code.
- What is Apache Spark Streaming?
- What is Streaming DStream? How to create DStream?