Bio4j 0.7, some numbers

Hi everyone!

There have already been a good few posts showing different uses and applications of Bio4j, but what about Bio4j data itself? Today I’m going to show you some basic statistics about the different types of nodes and relationships Bio4j is made up of. Just as a heads up, here are the general numbers of Bio4j 0.7 :

  • Number of Relationships: 530.642.683
  • Number of Nodes: 76.071.411
  • Relationship types: 139
  • Node types: 38

Ok, but how are those relationships and nodes distributed among the different types? In this chart you can see the first 20 Relationship types (click on the image below to check the interactive chart):

Here, the same thing but for the first 20 Node types (click on the image below to check the interactive chart):

You can also check these two files including the numbers from all existing types:

All this data was obtained with the program GetNodeAndRelsStatistics.

Have a good day!



  • Patrick Durusau Excellent! Question: When I checked at PubMed, I did not find Neo4j cited in any of the medical literature. I am not a medical professional but am interested in what might promote Bio4j in the medical research community? It is too good of a resource to be unnoticed. Patrick

    • ppareja Hi Patrick, I’m glad you liked the post. It’s true that Bio4j may not have caught the attention of many people yet who could definitely make a good use out of it. What are the reasons for that? Well, I think it could be a mixture of factors. Some people don’t like too much learning new technologies/strategies/workflows… and tend to stick to things they already know as long as possible – which is totally respectable and undestandable. Other people though, may simply not have found about it yet… It’s also possible that due to the lack of a well structured project documentation, potential users get lost in their way when trying to figure out what’s Bio4j about and/or miss the parts they could be interested in. I could keep on going with more possible reasons that are coming to my mind but still, couldn’t be really objective – it’s me who created this project :D The point you bring up is actually one of the reasons why we value so much any sort of feedback for the project, (specially constructive ‘bad’ feedback that help us realize its weaknesses) Let me know if you come up with an idea to let more people know about Bio4j ! Pablo