FUNDAMENTALS OF COMPUTER

DATABASE FUNDAMENTALS

BASICS OF BIG DATA

Question [CLICK ON ANY CHOICE TO KNOW THE RIGHT ANSWER]
Which of the following statement(s) is/are true?S1:Apache Spark GraphX provides the following property operators-mapVertices(), mapEdges(), mapTriplets()S2:The RDDs in Spark, depend on one or more other RDDs. The representation of dependencies in between RDDs is known as the lineage graph. Lineage graph information is used to compute each RDD on demand, so that whenever a part of persistent RDD is lost, the data that is lost can be recovered using the lineage graph information.
A
Only S1 is true
B
Only S2 is true
C
Both S1 and S2 are true
D
None of the mentioned
Explanation: 

Detailed explanation-1: -Which of the following statements are true in the context of Apache Spark GraphX operators ? S1: Property operators modify the vertex or edge properties using a user defined map function and produces a new graph.

Detailed explanation-2: -GraphX is Apache Spark’s API for graphs and graph-parallel computation. GraphX unifies ETL (Extract, Transform & Load) process, exploratory analysis and iterative graph computation within a single system.

Detailed explanation-3: -Spark GraphX is a library used to build graphs in Apache Spark. The graph data structure can be defined using either a graph schema or an RDD of vertices and edges. The vertices are entity labels, and the edges represent relationships between entities.

Detailed explanation-4: -The core aggregation operation in GraphX is aggregateMessages . This operator applies a user defined sendMsg function to each edge triplet in the graph and then uses the mergeMsg function to aggregate those messages at their destination vertex.

There is 1 question to complete.