Artificial intelligence

Scientific journal

ISSN 2710-1673

ONLINE: ISSN 2710-1681

Select your language


The use of lexemes fields in data mining of texts arrays

Pavlyshenko B.1
1 Ivan Franko Lviv National University

Full text (PDF)

UDC: 519.765:519.767:004.89
Publication Language: Ukrainian
Stuc. intelekt. 2013; 18(1):98–109

Abstract: The model of semantic and thematic lexemes fields for data mining of text documents has been proposed. The vector model of text documents in the semantic space was considered. The basis of this space is formed by frequency-distributional characteristics of semantic and thematic fields. The experimental analysis of texts samples showed high efficiency of lexemes fields usage in the classification analysis of texts authorship.

Keywords: data mining, Bayesian classification, semantic and thematic fields, vector space model of text documents, texts classification

References:

  1. Pantel P. Journal of Artificial Intelligence Research. 2010. Vol.37. P.141-188.
  2. Brasegyan A.A. Analiz dannyh i protsessov: ucheb. posobie. SPb.:BHV-Peterburg,2009. 512s.
  3. Sebastiani F. ACM Computing Surveys. 2002. Vol. 34. № 1. P. 1-47.
  4. Manning C. D. Prabhakar Raghavan and Hinrich Schütze, Introduction to Information Retrieval.Cambridge University Press. 2008. 496p.
  5. Pavlyshenko B. M. Elektronіka ta іnformatsіyni tehnologіi. 2011. Vypusk 1. S. 212-222.
  6. Pavlyshenko B. Komp’yuternі nauky ta іnformatsіynі tehnologіi : zbіrnyk naukovyh prats’. L’vіv :Vydavnystvo L’vіvs’koi polіtehnіky. 2011. № 710. S. 215-218.
  7. Pavlyshenko B. M. Matematychnі mashyny і systemy. 2012. №1. S. 69-76.
  8. Pavlyshenko B.M. Elektronyka ta іnformatsіynі tehnologіi. 2012. Vypusk 2 . S.164-172.
  9. Verdieva Z.N. Semanticheskie polya v sovremennom angliyskom yazyke. M.: Vysshaya shkola. 1986. 120s.
  10. Polevye struktury v sisteme yazyka./kollektivnaya monografiya pod.red. prof. Z.D.Popova. Voronezh.:Izd-vo Voronezhskogo un-ta.1989. 197s.
  11. Kuznetsova E. V. Leksiko-semanticheskie gruppy russkih glagolov. Irkutsk: Izd-vo Irkut. Un-ta. 1989. 180s.
  12. Ufimtseva A.A. Opyt izucheniya leksiki kak sistemy (na materiale angliyskogo yazyka). M.: Izdatel’stvoAkademii nauk SSSR. 1962. 176s.
  13. Rusanіvs/ky V.M. Іnformatsіyno-lіngvіstychnі osnovy tlumachnoi leksykografіi. Movoznavstvo. K.2002. №6. S.7-31.
  14. Shyrokov V.A. Semantychnі stany movnyh odynyts' ta ih zastosuvannya v kognіtyvnіy leksykografіi.Movoznavstvo. 2005. №3-4. S.47- 62.
  15. Skorohod’ko E.F. Sіtkove modeluvannya leksyky: lіngvіstychna іnterpretatsіya parametrіv semantichnoiskladnostі. Movoznavstvo. 1995. №6. S.19-28.
  16. Gliozzo A. Semantic Domains in Computational Linguistics. Alfio Gliozzo, Carlo Strapparava. Springer. 2009.132 p.
  17. Gol’dberg V.B. Kontrastivnyj analiz leksiko-semanticheskih grup (na materiale angliyskogo, russkogo inemetskogo yazykov). Tambov: TGPI. 1988. 56 s.
  18. Fellbaum C. WordNet. An Electronic Lexical Database. Cambridge. MA: MIT Press. 1998. 432 p.
  19. Mirkin B.G. Analiz kachestvennyh priznakov i struktur. M.: Statistika. 1980. 319 s.

View full text (PDF)