content analysis, similarities relationship and text analysis ​