Инд. авторы: | Mansurova M., Barakhnin V., Aubakirov S., Khibatkhanuly E., Musina A. |
Заглавие: | Parallel text document clustering based on genetic algorithm |
Библ. ссылка: | Mansurova M., Barakhnin V., Aubakirov S., Khibatkhanuly E., Musina A. Parallel text document clustering based on genetic algorithm // Математические и информационные технологии, MIT-2016: Справочник конференции / Conference Information. - 2016: Друштво математичара Косова и МетохијеПриродно-математички факултетКосовска Митровица. - P.133-134. |
Внешние системы: | РИНЦ: 27179115; |
Реферат: | eng: This work describes parallel implementation of algorithm FRIS-Tax for clustering of a corpus of documents. The time of FRIS-Tax operation increases exponentially with the increase in the amount of articles. In this relation, to speed up the work at two stages of the algorithm, technologies of parallel computations were used. First, when choosing individuals in a genetic algorithm. The parallel genetic algorithm is implemented on high performance platform MPJ Express. Secondly, during direct implementation of the clustering algorithm. The loading test revealed two slowest stages in FRIS-Tax algorithm. They appeared to be finding of the first pillar and finding of the next pillar. To speed up these stages, the technology Streams JAVA 8 was used. For monitoring of the algorithm implementation, we developed a web interface which allows observing the current values of genetic parameters and achieved values of the fitness function. The work presents quantitative values of the process execution time demonstrating the advantage of parallel implementation of the algorithm. |
Издано: | 2016 |
Физ. характеристика: | с.133-134 |
Конференция: | Название: Международная конференция «Математические и информационные технологии, MIT-2016» Аббревиатура: MIT-2016 Город: Врнячка Баня, Будва Страна: Сербия, Черногория Даты проведения: 2016-08-28 - 2016-09-05 Ссылка: http://conf.nsc.ru/MIT-2016 |
Цитирование: | 1. Borisova I.A., Zagoruiko N.G. Functions rival similarity in the problem of taxonomy, Proc. Conf. with international participation “Knowledge -Ontology -Theory” (Umbrella-07). Novosibirsk, 2007. T. 2. P. 67-76. 2. Barakhnin V.B., Nekhaeva V.A., Fedotov A.M. On the statement of the similarity measure for the clustering of text documents, Vestn. Novosib. state. Univ. Series: Information technology. 2008. T. 6, no. 1. S. 3-9. 3. Zagoruiko N.G., Barakhnin V.B., Borisova I.A., Tkachev D.A. Clustering of text documents from an electronic database of publications algorithm FRiS-Tax, Computational technologies. -T. 18, number 6, 2013. C. 62-74. 4. Gladkov L.A. Kureichik V.V., V.M. Kureichik Genetic algorithms, Ed. V.M. Kureichik. -2nd ed., Rev. and add. -M.: FIZMATLIT, 2006. -320 p. |