Semantic integrated document clustering for improved text mining