Skip to main content

Spark Hadoop: the essential questions you may need to know

In this series of blogs contain some essential questions about Spark and Hadoop, which are frequently being asked in many interviews. At the same time, those Q&A will help you better understand the big data framework, ecoystem, especially know hot to better apply them
August 2019 · 1 MIN · bigdata · framework

Eco-system Hadoop
#

Basic Concept
#

  1. How do you know about RedHat, Apache. I know, it is straightforward google-it questions. however

  2. Hadoop ecosystem

HDFS
#

  1. what is the mechanism. Can we change a part of it?

[structure] Namenode, Datanode

namenode containes the metadata, import so, usually it has at least

sequentialized

  1. what is a block? in windows, the block is x KB

RDD
#

— RELATED —
Spark多节点配置
Apache Spark安装和环境配置
Using Arrow to Handle Errors and Exceptions in Kotlin
10 Basic Tips on Bash Scripting to Optimize Your Operation Work