Self Emergence of Knowledge Trees:

Extraction of the Wikipedia hidden hierarchies

Lev Muchnik 1, Royi Itzhak 2Sorin Solomon 3,4and Yoram Louzoun 2*

  1. Physics department, Bar Ilan University, Ramat Gan, Israel 52900

  2. Math department, Bar Ilan University, Ramat Gan, Israel, 52900

  3. Racah Institute of Physics Hebrew University, Jerusalem, Israel.

  4. ISI Torino I-10133, Italy.


* To whom correspondence and requests should be sent: Yoram Louzoun, math department, Bar Ilan University, Ramat Gan, Israel, 52900 , phone: 972-3-5317610



The rapid accumulation of knowledge and the recent emergence of new dynamic and practically unmoderated information repositories made the classical concept of the knowledge hierarchal structure irrelevant and impossible to impose manually. This led to modern methods of data location, such as browsing or searching, which conceal the underlying information structure. We here propose new methods designed to automatically construct a hierarchy from a network of related terms. We apply these methods to Wikipedia and compare the hierarchy obtained from network of articles to the complementary acyclic category layer of the Wikipedia and show an excellent fit. We verify our methods in two networks with no apriori hierarchy: The E. coli genetic regulatory network and the C. Elegans neural network and reproduce a known functional order.

Data & Results


Genetic Network

File Name Details  &  Explanation
E.Coli Hierarchy

Attraction Basin Hierarchy applied to E.Coli Genetic Network


Neural Network

File Name Details  &  Explanation
Neural Net Hierarchy Hierarchal Intermediacy (betweenness-based) applied to C-elegans Neural Network
Neural Net Hierarchy Local Hierarchy applied to C-elegans Neural Network


The work of LM and YL was covered by the Yeshaya Horowitz. The work of YL and RI is also supported by the co3 NEST PATHFINDER of the EU 6th framework.



