A new computational tool called ProtFus, developed by researchers at the Azrieli Faculty of Medicine of Bar-Ilan University, screens scientific literature to validate predictions about the activity of fusion proteins — proteins encoded by the joining of two genes that previously encoded two separate proteins.
ProtFus was recently presented by Somnath Tagore, from the lab of Dr. Milana Frenkel-Morgenstern, and colleagues in PLOS Computational Biology.
Different kinds of fusion proteins can arise naturally in the human body, sometimes leading to cancer. Understanding interactions between fusion proteins and other proteins can help improve personalized cancer treatment. However, the number of scientific papers discussing these interactions is growing rapidly, and there is no standard format for presenting this information. Thus, organizing and keeping abreast of this knowledge poses a major challenge.
ProtFus addresses this challenge by using computational strategies — such as text mining and machine learning — to analyze scientific literature from the online search engine PubMed. It is able to identify fusion proteins that may go by multiple names, and it can identify experimentally verified interactions between fusion proteins and other proteins. When applied to a test set of 1,817 fusion proteins, ProtFus identified 2,908 interactions across 18 cancer types that had been published in scientific texts in PubMed.
ProtFus also builds on a tool previously developed by the researchers in order to predict a given fusion protein’s interactions based on the known properties of its two parental proteins. ProtFus takes a fusion protein of interest, uses the previously developed tool (Chimeric protein-Protein-Interactions, or ChiPPI) to predict its interactions, and then validates these interactions by means of a PubMed search.
“Our findings demonstrate the potential for text mining of large-scale scientific articles using a novel big-data infrastructure, with real-time updating from articles published daily,” says Dr. Frenkel-Morgenstern, Head of the Cancer Genomics and BioComputing of Complex Diseases Lab at the Azrieli Faculty of Medicine and corresponding author of the study. “ProtFus can promote studying alterations of protein networks for individual cancer patients in a fully personalized manner,” highlights Tagore, first author and previous postdoc in Morgenstern’s lab (currently a postdoc at Columbia University, New York).