Semantic clustering of subject-oriented languages's texts (morphology and syntax)
D.V. Mikhailov, G.M. Emelyanov

State Educational Institution of Higher Vocational Education "Yaroslav-the-Wise Novgorod State University"

Full text of article: Russian language.

Abstract:
The problem considered is the semantic clustering of texts in Subject-Oriented Natural Language. The approach offered is to elaborate perfomance criteria for syntactic analysis as a toolbox to reveal objects and attributes. Especial attention is given to the Splintered Values and conversives within nouns's syntactic contexts.

Key words: text mining, natural language, subject area, semantic equivalence, knowledge clustering, lattice theory.

References:

  1. Tikhomirov, I.A. Integration of linguistic and statistic methods in search engine "Exactus" [Electronic resource] / I.A. Tikhomirov, I.V. Smirnov // Computional linguistics and intellectual technologies: International Conference "Dialogue-2008". http://www.dialog?21.ru/dialog2008/materials/html/80.htm. -  (in Russian, access date: 18.11.2009).
  2. Vasilev, V.I. Methodological rules of designing of computer tests [Text] / V.I. Vasilev, A.N. Demidov, N.G. Malyshev, T.N. Tjagunova - Moscow: MSUPA, 2000. – 64 p. – (in Russian).
  3. Mel'chuk, I.A. An Attempt at a Theory of "MeaningÛText" Linguistic Models: Semantics, Syntax [Text] / I.A. Mel'chuk. – Moscow: Languages of Slavonic Culture, 1999. – 345 p. – (in Russian).
  4. Mikhailov, D.V. Formation and clustering of Russian's nouns's contexts within the frameworks of Splintered Values [Text] / D.V. Mikhailov, G.M. Emelyanov, N.A.  Stepanova // 9th Int. Conf. "Pattern Recognition and Image Analysis: New Information Technologies" (PRIA-9-2008). – Nizhni Novgorod. – NNSU. – 2008. – Vol.2. – P. 39-42.
  5. Osipov, G.S. Knowledge acquisition by intellectual systems: fundamentals of theory and technology [Text] / G.S.  Osipov. – Moscow: Nauka, 1997. – 112 p. – (in Russian).
  6. Nozhov, I.M. Syntactic analysis [Electronic resource] / I.M. Nozhov // Computerra. – 2002. – No21 (446). http://www.computerra.ru/offline/2002/446/18250/. - (in Russian, access date: 18.11.2009).
  7. Emelyanov, G.M. Conceptually-situational modeling of process of synonymic transformation of the natural-language statements as machine learning on the basis of precedents [Text] / G.M. Emelyanov, A.N. Kornyshov, D.V. Mikhailov // Scientific-theoretical magazine «Artificial intelligence». – 2006. - No2. – P. 72-75. – (in Russian).
  8. Kibrik, A.E. Sketches on the general and applied questions of linguistics [Text] / А.Е. Кибрик. – Moscow: KomKniga, 2005. – 332 p. – (in Russian).
  9. Ganter, B. Formal Concept Analysis – Mathematical Foundations [Текст] / Ganter B. and Wille R. - Berlin : Springer-Verlag, 1999. - 284 p.
  10. Software package of syntactic analysis and machine translation [Electronic resource] // http://cs.isa.ru:10000/dwarf/. - (in Russian, access date: 18.11.2009).
  11. The Concept Explorer [Electronic resource] // http://conexp.sourceforge.net. - (access date: 18.11.2009).
  12. Gusev, V.D. Algorithm of revealing of set expressions with taking into account their variability (morphological and combinatorial) [Electronic resource] / V.D. Gusev, N.V. Salomatina // Computional linguistics and intellectual technologies: International Conference "Dialogue-2004". http://www.dialog-21.ru/Archive/2004/Salomatina.htm. - (in Russian, access date: 18.11.2009).

© 2009, ИСОИ РАН
Россия, 443001, Самара, ул. Молодогвардейская, 151; электронная почта: ko@smr.ru ; тел: +7 (846 2) 332-56-22, факс: +7 (846 2) 332-56-20