A biomarker identification model from protein protein interaction network using natural language processing and graph convolutional network