|
With the development of complex network and the increase of data scale, the performance of data analysis becomes more and more important. In this paper, we present a new approach for network analysis about the flight data based on Hadoop which is an implementation of the MapReduce parallel framework: Firstly, we identity and group the information of passengers and flights. Secondly, we extract graph nodes, which represent the passengers, and graph edges, which represent the relationship between the passengers. Finally we visualize the passenger’s egocentric network to help network analysis. |
|
Keywords:Hadoop;Complex network;Data analysis;Visualization |
|